Tag

Explore by tags

All

30u30

ASR

ChatGPT

GNN

IDE

RAG

agent-skills

ai

ai-agent

ai-api

ai-api-management

ai-client

ai-coding

ai-demos

ai-deploy

ai-development

ai-framework

ai-image

ai-image-demos

ai-inference

ai-leaderboard

ai-library

ai-rank

ai-serving

ai-tools

ai-train

ai-video

ai-workflow

AIGC

algorithms

alibaba

amazon

android

anthropic

audio

aws

biology

blog

book

bytedance

chatbot

chatgpt

chemistry

claude

claude-code

cli

code

codex

copilot

course

cursor

deepmind

deepseek

depth

devops

diffusers

docker

drug-discovery

electron

embeddings

engineering

evaluation

facebook

finance

foundation

foundation-model

gemini

gemini-cli

gemma

genomics

gitHub

github

go

google

gradient-booting

grok

groq

huggingface

image

ios

java

javascript

json

LLM

llm

mLOps

math

mcp

mcp-client

mcp-server

meta-ai

meta-pytorch

microsoft

mlops

mobile

multilingual

multimodal

mysql

NLP

nlp

nodejs

nvidia

ocr

ollama

openai

opencode

pandas

paper

physics

pi

plugin

polars

postgres

privacy

prompt-engineering

pwa

python

pytorch

qwen

RL

robotics

rust

science

security

shodan

skillkit

sora

speech

sqlite

ssh

stt

swe

tensorrt

terminal

transformers

translation

tts

tutorial

typescript

vibe-coding

video

vision

vllm

voice

xAI

xai

Crawlee

2016

Apify

Unified Node.js library for web crawling and browser automation that fetches pages and files via headless browsers or raw HTTP. Provides persistent queues, proxy rotation, session management, storage, and human-like fingerprints to build scalable data pipelines (e.g., RAG/LLM datasets).

javascript nodejs typescript github mLOps+3

NVIDIA Agent Skills

2026

NVIDIA

Provides a catalog of NVIDIA-verified, portable “skills” — instruction sets that teach AI agents how to use NVIDIA libraries, models and platform tools. Each skill is published with detached signatures and evaluation artifacts for verifiable reuse in agent workflows.

nvidia agent-skills skillkit ai-agent ai-workflow+4

no-mistakes

2026

kunchenguid

Acts as a local git proxy that runs an AI-driven validation pipeline in a disposable worktree, only forwarding the branch and opening a PR after every check passes. Runs review, tests, docs, and lint in isolation, applies safe auto-fixes, supports multiple agent providers, and pauses for human approval when intent would change.

go cli agent-skills ai-workflow mLOps+2

CubeSandbox

2026

Tencent Cloud

Provides hardware-isolated, sub-60ms, ultra-low-overhead sandboxes to run untrusted LLM/agent code. Offers event-level snapshots, kernel-level egress control, credential vaulting, and drop-in E2B SDK compatibility for high-density AI agent deployment.

rust ai-agent ai-deploy security mLOps+1

FreeLLMAPI

2026

tashfeenahmed

Provides a single OpenAI-compatible /v1 API that aggregates the free tiers of 16 LLM providers into one unified endpoint. Features smart routing and automatic failover, per-key free-tier tracking, encrypted key storage, embeddings/media routing, and a Docker one-liner for local use.

ai-api llm openai docker mLOps+6

BugTraceAI-CORE-Ultra-27B-Q6

2026

BugTraceAI

Generates production-ready offensive-security artifacts from prompts—Nuclei templates, CVE PoCs, exploit scripts and pentest tooling—fine-tuned on bug-bounty reports and CVE writeups and quantized for consumer/server GPU deployment.

qwen security huggingface llm ai-tools+4

ITBench-AA

2026

Artificial Analysis, IBM

Provides 40 public Kubernetes incident scenarios (SRE subset) with ground-truth root-cause entities and offline cluster snapshots in JSONL format; designed to evaluate agentic root-cause diagnosis on alerts, events, traces and topology.

evaluation huggingface agent-skills json pandas+4

TurboServe: Serving Streaming Video Generation Efficiently and Economically

2026

1Shanghai Jiao Tong University, 2Shengshu Technology +1

Youhe Jiang, Haoxu Wang +6

Serves interactive, long-lived streaming video-generation sessions by jointly scheduling session placement and GPU autoscaling to meet tight per-chunk latency. Combines migration-aware placement, load-driven autoscaling, coalesced chunk processing, GPU–CPU offloading and NCCL GPU–GPU migration; reports ~37% reductions in worst-case per-chunk latency and GPU operating cost.

video ai-video ai-serving ai-inference mLOps+4

AFTER

2026

Julia Belikova, Rauf Parchiev +5

Benchmark for evaluating procedural skill evolution in LLM agents: isolates reusable skill bodies, role-specific work surfaces, and hidden oracle assets to measure whether skill refinements transfer across tasks, roles, and model backbones. Includes 382 workplace tasks, 22 skills, and a controlled evaluation protocol.

evaluation agent-skills huggingface llm ai-agent+2

NVIDIA GLM-5.2 NVFP4

2026

NVIDIA, Z.ai

Provides a pre-quantized NVFP4 checkpoint of GLM-5.2 for long-context reasoning and coding; reduces model footprint so GLM-5.2 can run on multi‑GPU Blackwell nodes and is ready for inference with SGLang and vLLM.

nvidia huggingface llm vllm tensorrt+5

nvidia/Qwen3.6-27B-NVFP4

2026

NVIDIA, Alibaba Group (Qwen Team)

NVFP4-quantized variant of Qwen3.6-27B that reduces parameter bits from 16 to 4, cutting disk and GPU memory requirements by ~2.5× while keeping comparable benchmark accuracy; ready for vLLM-based inference on NVIDIA hardware and supports long, multimodal contexts.

nvidia qwen vllm huggingface llm+6

Ornith-1.0-397B

2026

DeepReinforce Team

Provides an open-source Mixture-of-Experts coding LLM (397B) optimized for agentic, tool-enabled coding workflows with a 262,144-token context window, OpenAI-compatible API, serving recipes (vLLM/SGLang), and published coding-benchmark results.

ai-coding agent-skills vllm transformers qwen+6