Tag

Explore by tags

All

30u30

ASR

ChatGPT

GNN

IDE

RAG

agent-skills

ai

ai-agent

ai-api

ai-api-management

ai-client

ai-coding

ai-demos

ai-deploy

ai-development

ai-framework

ai-image

ai-image-demos

ai-inference

ai-leaderboard

ai-library

ai-rank

ai-serving

ai-tools

ai-train

ai-video

ai-workflow

AIGC

algorithms

alibaba

amazon

android

anthropic

audio

aws

benchmark

benchmarks

biology

blog

book

bytedance

chatbot

chatgpt

chemistry

claude

claude-code

cli

code

codex

coding

coding-agents

copilot

course

cpu

cuda

cursor

deepmind

deepseek

depth

devops

diffusers

distillation

docker

drug-discovery

electron

embeddings

engineering

evaluation

facebook

finance

flow-matching

foundation

foundation-model

gcode

gemini

gemini-cli

gemma

genomics

gitHub

github

go

google

gradient-booting

grok

groq

huggingface

image

ios

java

javascript

json

kimi

llama.cpp

LLM

llm

long-horizon

lora

mLOps

math

mcp

mcp-client

mcp-server

meta-ai

meta-pytorch

metal

microsoft

mlops

mobile

multilingual

multimodal

mysql

NLP

nlp

nodejs

numpy

nvidia

ocr

ollama

openai

opencode

pandas

paper

parquet

physics

pi

plugin

polars

postgres

privacy

programming

prompt-engineering

pwa

python

pytorch

qwen

react

reasoning

redis

retrieval

RL

rl

robotics

rust

science

security

segmentation

shodan

skillkit

software-engineering

sora

speech

sqlite

ssh

stt

swe

swift

tensorrt

terminal

transformers

translation

tts

tutorial

typescript

vibe-coding

video

vision

vllm

voice

vulkan

web-search

windsurf

xAI

xai

AI Coding Dictionary

Explains AI coding jargon in plain English, giving concrete engineering meanings for terms like context window, tool call, and attention degradation. Structured as a browsable dictionary with pragmatic examples and guidance for developers building LLM-driven systems.

ai-coding ai-tools prompt-engineering agent-skills github+2

book-to-skill

Converts technical books and document collections into an on-demand agent “skill” that Claude Code, GitHub Copilot CLI, and Amp can load to answer questions from the original content. Produces a compact SKILL.md plus per-chapter files so agents load only the needed sections, cutting token use and reducing hallucination risk.

agent-skills claude-code copilot cli book+5

χ-Bench

Evaluates LLM-driven agents on long-horizon, policy-rich U.S. healthcare workflows using 75 clinical task fixtures and a 20-app MCP simulator; includes task fixtures, shared worlds, and leaderboard integration (Managed-Care handbook is gated).

huggingface mcp agent-skills ai-agent ai-leaderboard+3

DeepSeek V4 Pro Hermes Reasoning Traces

Provides 19,331 multi-turn ChatML Hermes reasoning traces produced by DeepSeek V4 Pro for LoRA fine-tuning of agent-style models; includes VRAM-tiered variants, train/valid/test splits, and dense tool-calling annotations in Parquet format.

deepseek huggingface llm ai-agent agent-skills+3

deepseek-hermes-reasoning-traces

Provides 19,331 multi-turn ChatML Hermes reasoning traces for LoRA fine-tuning of local models to behave as Hermes agents. Includes train/valid/test splits, VRAM-tiered variants (nano→spark), ~138K tool-call annotations, and Parquet format under Apache-2.0.

deepseek ai-agent llm huggingface pandas+3

Knowledge Catalog

Google Cloud (Google LLC), GoogleCloudPlatform (GitHub organization)

Provides tools and samples to build context management, enrichment, and retrieval solutions on Google Cloud Knowledge Catalog — an AI-oriented data catalog that builds a dynamic knowledge graph for structured and unstructured data, suitable for RAG and agent workflows.

google github ai ai-development RAG+5

TokenSpeed

LightSeek Foundation

High-throughput LLM inference engine for agentic workloads, combining a local‑SPMD static compiler for parallelism, a C++ scheduler with a Python execution plane and type‑safe KV‑cache reuse, pluggable high-performance kernels (including an MLA implementation), and a low‑overhead AsyncLLM entrypoint for production GPU inference.

llm ai-inference ai-serving agent-skills tensorrt+7

AI Agent Tutorials2026

cwc-workshops

jeffcurry-ant, michael-cohen-io +4Anthropic PBC

Collection of hands-on workshop materials and sample code from Anthropic's "Code with Claude" series, covering Claude Managed Agents, memory (Dreaming Service), eval-driven agent development, and multi-agent patterns. Not maintained and not accepting contributions.

anthropic claude claude-code agent-skills mcp+6

Agents Last Exam — Task Card Metadata

agents-last-exam (RDI Berkeley)

Provides task-card metadata for 147 long-horizon professional tasks from the Agents Last Exam benchmark — titles, prompts, taxonomy, and input-file descriptors. This v1.0 release is metadata-only; companion repos host input files and gated reference outputs.

huggingface ai-agent agent-skills pandas ai-rank

Multi-Benchmark LLM Agent Traces

Provides 1,781 OpenTelemetry execution traces of LLM-powered agents across six benchmarks, including full conversations, token usage, timing, tool calls and model metadata—useful for performance analysis, agent-behavior research, and inference debugging.

llm ai-inference mlops agent-skills ai-agent+4

SkillOpt

Yang Yifan, Gong Ziyang +8Microsoft

Trains reusable natural-language 'skills' for frozen LLM agents by optimizing the skill document in text-space — using trajectory-driven edits, validation-gated updates, and deployable best_skill.md artifacts. Multi-backend, zero inference-time cost at deployment, designed for iterative, validation-led skill improvement.

agent-skills ai-agent ai-train llm python+6

Native SDK

Vercel Labs, Vercel

Build native desktop apps authored with declarative .native markup and TypeScript (or Zig) compiled to native code, with no browser or JS runtime in the binary. Ships a component catalog, deterministic rendering, hot reload, and an embedded automation server for AI agent workflows.

typescript cli nodejs metal agent-skills+2