LogoAIAny
  • Search
  • Collection
  • Category
  • Tag
  • Blog
LogoAIAny

Learn Anything about AI in one site

Best learning resources for AI

LogoAIAny

Learn Anything about AI in one site.

support@aiany.app
Product
  • Search
  • Collection
  • Category
  • Tag
Resources
  • Blog
Company
  • Privacy Policy
  • Terms of Service
  • Sitemap
Copyright © 2025 All Rights Reserved.

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

2025
DeepSeek-AI, Daya Guo +198

This paper introduces DeepSeek-R1, a large language model that improves reasoning purely through reinforcement learning (RL), even without supervised fine-tuning. It shows that reasoning skills like chain-of-thought, self-reflection, and verification can naturally emerge from RL, achieving performance comparable to OpenAI’s top models. Its distilled smaller models outperform many open-source alternatives, democratizing advanced reasoning for smaller systems. The work impacts the field by proving RL-alone reasoning is viable and by open-sourcing both large and distilled models, opening new directions for scalable, cost-effective LLM training and future development in reasoning-focused AI systems.

NLPLLMdeepseekpaper
Icon for item

Aider

2025
Open-source community

Aider is a terminal-first AI pair-programming tool that edits your local Git repo through chat.

ai-toolsai-coding

Deep Dive into LLMs like ChatGPT

2025
Andrej Karpathy

The best introduction to how large language models (LLMs) like ChatGPT works in the world. It covers the three main stages of their training: pre-training on vast amounts of internet text, supervised fine-tuning to become helpful assistants, and reinforcement learning to improve problem-solving skills. The video also discusses LLM psychology, including why they hallucinate, how they use tools, and their limitations. Finally, it looks at future capabilities like multimodality and agent-like behavior.

LLMvideoChatGPTtutorial
Icon for item

AIBrix

2025
vLLM Project

vLLM-project’s control-plane that orchestrates cost-efficient, plug-and-play LLM inference infrastructure.

ai-developmentai-inferenceai-serving
Icon for item

Tabby

2025
TabbyML Inc.

Tabby is an open-source, self-hosted code-completion engine that runs on your GPU or CPU.

ai-toolsai-coding

How I use LLMs

2025
Andrej Karpathy

The best introduction on how to use LLMs like ChatGPT. It covers the basics of how LLMs work, including concepts like "tokens" and "context windows". The video then demonstrates practical applications, such as using LLMs for knowledge-based queries, and more advanced features like "thinking models" for complex reasoning. It also explores how LLMs can use external tools for internet searches and deep research. Finally, the video delves into the multimodal capabilities of LLMs, including their use of voice, images, and video.

LLMvideoChatGPTtutorial
Icon for item

DBHub

2025
Bytebase

Universal database gateway MCP server that lets agents explore MySQL, Postgres, SQL Server, MariaDB and more.

mcp-server
Icon for item

BlenderMCP

2025
Sid Ahuja

Model Context Protocol (MCP) bridge that lets Claude AI inspect, create and manipulate Blender scenes programmatically.

mcp-server
Icon for item

NVIDIA Dynamo

2025
NVIDIA

NVIDIA Dynamo is an open-source, high-throughput, low-latency inference framework that scales generative-AI and reasoning models across large, multi-node GPU clusters.

ai-developmentai-inferenceai-servingnvidia
Icon for item

Continue

2025
Continue Dev Inc.

Continue is an open-source IDE extension and hub for creating custom AI coding assistants.

ai-toolsai-codingplugin
Icon for item

Desktop Commander MCP

2025
wonderwhy-er

Cross-platform desktop automation MCP server that lets AI run terminal commands, manage processes and edit local files.

mcp-server
Icon for item

Mobile MCP

2025
Mobile Next

An MCP server for large-scale mobile automation and scraping on iOS & Android emulators, simulators and real devices.

mcp-server
  • Previous
  • 1
  • 2
  • More pages
  • 15
  • 16
  • 17
  • Next