Resources to Get a Job in ML, Post-Transformer Architectures, What's in Store for 2025, and More

Society's Backend Reading List 12-30-2024

Dec 30, 2024

Here's a comprehensive AI reading list from this past week. Thanks to all the incredible authors for creating these helpful articles and learning resources.

This one is particularly important so the full reading list is available to ALL subscribers. There are many great resources for getting a job in ML and a lot of great information about AI in 2024 and what’s in store for 2025.

Society's Backend is reader supported. You can support my work (these reading lists and standalone articles) for 80% off for the first year (just $1/mo). You'll also get the extended reading list each week.

A huge thanks to all supporters. 🙂

Get 80% off for 1 year

What Happened Last Week

The most important event from this past week was the release of DeepSeek-V3, an open model that competes with OpenAI’s strongest models while being trained on 10x less compute. This shows what’s possible when we don’t just try to scale up resources, but instead try to improve efficiency. I’ve been saying efficiency will be the focus of AI in 2025 and I stand by it. I have another article coming out later this week with more detail about it.

Other important happenings this past were the release of year-end AI recaps and 2025 AI predictions. Check these out:

Last Week's Reading List

In case you missed it, here are some highlights from last week:

Google Leads New AI Releases, OpenAI Gives First Glimpse of AGI with o3, AI Policy Will Be Interesting Going Forward, and More

Logan Thorneloe

December 24, 2024

Read full story

Reading List

321 real-world gen AI use cases from the world's leading organizations

Organizations are increasingly using generative AI to enhance processes in areas like customer service, employee empowerment, and data analysis. Companies such as ADT, Alaska Airlines, and Best Buy are developing AI solutions to improve customer experiences and operational efficiency. Google Cloud technologies, including Vertex AI and Gemini, are central to these innovations across various industries.

Society's Backend

Resources to Get a Job in ML, Post-Transformer Architectures, What's in Store for 2025, and More

Society's Backend Reading List 12-30-2024

What Happened Last Week

Last Week's Reading List

Google Leads New AI Releases, OpenAI Gives First Glimpse of AGI with o3, AI Policy Will Be Interesting Going Forward, and More

Reading List

321 real-world gen AI use cases from the world's leading organizations

Competitive Programming Changed My Life Forever

Physics Professors Are Using AI Models as Physics Tutors

How To Focus On The Right Problems

5 Beginner-Friendly Projects to Learn LLMs & RAG

The Ultimate Guide to Building a Machine Learning Portfolio That Lands Jobs

7 Machine Learning Projects For Beginners

What is a Transformer?

How to Build Agentic AI[Agents]

2024 in Post-Transformers Architectures (State Space Models, RWKV) [LS Live @ NeurIPS]

Linux Context Switching Internals: Part 1 - Process State and Memory

Nagasaki Was Bombed Against Direct Orders - Adam Brown

6 AI Trends that will Define 2025

Deep: The UX of Search

AI Agents with Reasoning, Solving Complex Tasks with EASE!

A Comprehensive Analytical Framework for Mathematical Reasoning in Multimodal Large Language Models

Top 25 AI Tools for Content Creators in 2025

DeepSeek-AI Just Released DeepSeek-V3: A Strong Mixture-of-Experts (MoE) Language Model with 671B Total Parameters with 37B Activated for Each Token

AWS Researchers Propose LEDEX: A Machine Learning Training Framework that Significantly Improves the Self-Debugging Capability of LLMs

Google DeepMind Introduces Differentiable Cache Augmentation: A Coprocessor-Enhanced Approach to Boost LLM Reasoning and Efficiency

Meet SemiKong: The World’s First Open-Source Semiconductor-Focused LLM

Unveiling Privacy Risks in Machine Unlearning: Reconstruction Attacks on Deleted Data

Microsoft and Tsinghua University Researchers Introduce Distilled Decoding: A New Method for Accelerating Image Generation in Autoregressive Models without Quality Loss

Neural Networks for Scalable Temporal Logic Model Checking in Hardware Verification

Measure Up

Optimizing Machine Learning Models for Production: A Step-by-Step Guide

10 Podcasts That Every Machine Learning Enthusiast Should Subscribe To

5 Tools for Visualizing Machine Learning Models

6 Language Model Concepts Explained for Beginners

This open problem taught me what topology is

The 10 most viewed blog posts of 2024

The 10 most viewed publications of 2024

Top 25 AI Tools for Increasing Sales in 2025

8 Insights to Make Sense of OpenAI o3

Discussion about this post