HN: Deepseek and great HN thread on it - must read

SoMaCoSF · January 26, 2025, 12:56am

we’ve been tracking the deepseek threads extensively in LS. related reads:

i consider the deepseek v3 paper required preread GitHub - deepseek-ai/DeepSeek-V3

R1 + Sonnet > R1 or O1 or R1+R1 or O1+Sonnet or any other combo R1+Sonnet set SOTA on aider’s polyglot benchmark | aider

independent repros: 1) Notion – The all-in-one workspace for your notes, tasks, wikis, and databases. 2) https://buttondown.com/ainews/archive/ainews-tinyzero-reprod… 3) x.com

R1 distillations are going to hit us every few days - because it’s ridiculously easy (<$400, <48hrs) to improve any base model with these chains of thought eg with Sky-T1 recipe (writeup https://buttondown.com/ainews/archive/ainews-bespoke-stratos… , 23min interview w team https://www.youtube.com/watch?v=jrf76uNs77k)

i probably have more resources but dont want to spam - seek out the latent space discord if you want the full stream i pulled these notes from

This article linked in the htread is great:

and the Latent Space mailings are top.

Fantastic thread.

SoMaCoSF · January 26, 2025, 1:17am

NUTS:

koltanl · January 27, 2025, 4:23pm

This should hopefully massively accelerate everything for everyone then! Woot!

Topic		Replies	Views
Deepseek R1 on HN - GoodReads Discussions	0	153	January 20, 2025
Deepseek R1 beats o1 and sonnet 3.5 Feature Requests	6	2153	January 21, 2025
Fantasic thread on HN Discussions	1	51	January 25, 2025
O3 beat r1 again on coding Discussions	7	1072	February 1, 2025
Deepseek v3 beats sonnet 3.5 Feature Requests	25	7860	January 22, 2025

HN: Deepseek and great HN thread on it - must read

Related topics