Tag: Deepseek
-

Engram, DeepSeek, and the return of “memory” as an architectural primitive
DeepSeek’s Engram adds conditional memory to MoE models, shifting routine local patterns to fast lookup—freeing compute as memory costs surge.
-

How DeepSeek’s Mathematical Optimizations Complement NVIDIA’s NCCL for Efficient AI Training
DeepSeek’s low-precision math and NCCL’s optimized inter-GPU collectives cut bandwidth, boost multi-GPU training efficiency and scalability.
-

Humanity’s Last Exam: The Ultimate Test for AI and the Future of Intelligence
Humanity’s Last Exam exposes AI limits: top models falter on reasoning. DeepSeek R shows promise but true AGI remains out of reach.
-

The AI Gold Rush Just Got Interesting: How China’s DeepSeek R1 is Giving Silicon Valley a Run for Its (Literal) Money
DeepSeek R slashes AI costs, democratizing powerful models and sparking innovation—challenging Silicon Valley’s AI dominance.