Company
Date Published
Author
Chuan Li
Word count
934
Language
English
Hacker News points
None

Summary

The NVIDIA GeForce RTX 4090 is the newest GPU for gamers, creators, students, and researchers, offering significantly higher training throughput and a more cost-effective option in terms of training throughput/$ compared to its predecessor, the GeForce RTX 3090. The RTX 4090's Training throughput/Watt is comparable to the RTX 3090, despite its high power consumption of 450W. Multi-GPU training scales reasonably well for RTX 4090, with most models getting close to double the training throughput with two GPUs. However, some sub-optimal scaling was observed. The RTX 4090 consistently outperformed the RTX 3090 in multi-GPU tests. Despite its high power consumption and larger size compared to the RTX 3090, the RTX 4090 is a great option for deep learning workloads due to its superior performance and cost-effectiveness.