I’m often asked for a way to deal with the barrage of new models/capabilities/evals/buzzwords in AI. This post answers that.
Keep in mind that my focus is on stuff that has a relevant and immediate application to industry, which means that there should be a model I can run, an API I can call or something of the sort. I’m not personally interested in staying up to date with academic developments; if you need to improve your signal/noise ratio in arXiv this post will not be super helpful.
X.com the everything app should be your main source of news. Everyone relevant in the space is there making announcements, discussing stuff or just generally shitposting. I don’t have a definitive list of people to follow, but if you follow the accounts below the algorithm should curate your timeline over time.
In no particular order
- @giffmana
- @main_horse
- @AlpinDale
- @skalskip92
- @tomaarsen
- @menhguin
- @Dorialexander
- @willccbb
- @_xjdr
- @Tim_Dettmers
- @casper_hansen_
- @vikhyatk
- @ggerganov
- @RisingSayak
- @jeremyphoward
- @jobergum
- @charles_irl
- @wightmanr
- @mervenoyann
- @rasbt
- @karpathy
- @HamelHusain
- @kalomaze
Depending on your focus, r/StableDiffusion (image generation and editing) and r/LocalLlama (LLMs and VLMs) are very good subreddits to follow. They often focus on smaller models that can be run in consumer hardware, tools and libraries, finetunes, etc, though lately they also started allowing discussions of commercial (no open weights) models.
I only subscribe to one newsletter. smol.ai gathers news from various sources (including Discord) and sends you a nice email every weekday, which is great if you don’t have the time for X or Reddit.
With these sources I’m very rarely out of the loop of new developments.