Tag

Explore by tags

AI Model2026

SenseNova-U1-8B-MoT

End-to-end multimodal model for native text↔image understanding, interleaved image-text generation, and image editing. Uses the NEO-Unify MoT architecture to avoid separate visual encoders/VAE. Suited for multimodal prototyping, demos, and research (Apache‑2.0).

multimodal transformers huggingface vision ai-image+4

MCP Server2026

Agent Toolkit for AWS

Amazon Web Services (AWS)

Provides a single MCP endpoint that lets AI coding agents search AWS docs, run sandboxed Python scripts, and make authenticated AWS API calls with enterprise guardrails like IAM condition keys, CloudWatch metrics, and CloudTrail auditing.

aws mcp mcp-server agent-skills plugin+6

AI Model2026

google/gemma-4-26B-A4B-it-assistant

Google DeepMind

Acts as the assistant (drafter) checkpoint for Gemma 4 26B A4B on Hugging Face, used in Speculative Decoding to pre-draft tokens and speed up generation. Designed for long-context, multimodal workflows where lower latency and on-device or edge inference matter.

deepmind google transformers huggingface multimodal+5

AI Video2026

claude-video

Brad BonannoSolaris Automation

Enables Claude to “watch” videos by extracting timestamped frames plus captions/transcripts and feeding them to Claude for grounded Q&A. Key features: native captions first, Whisper fallback, frame deduplication, and multiple detail modes (transcript/efficient/balanced/token-burner). Useful for summarizing, debugging, and extracting moments.

claude claude-code agent-skills ai-video multimodal+3

AI Model2026

kai-os/Carnice-V2-27b-GGUF

kai-os

Provides multiple GGUF-quantized exports of Carnice V2 (a merged BF16 SFT of Qwen3.6-27B) optimized for llama.cpp and Hermes-style agent traces, with quant tiers targeted at 16–24GB local GPUs and agentic inference.

huggingface llm ai-inference ai-serving ai-deploy+3

AI Agent2026

Hallmark

Together AI

Generates page-scale UI designs and audits for Claude Code, Cursor, and Codex using a 57-gate “anti-AI-slop” rule set — produces distinct, non-template HTML+CSS outputs and supports audit, redesign, and study verbs with a built-in pre-emit self-critique.

agent-skills skillkit prompt-engineering claude-code cursor+3

AI Dataset2026

AgentTrove

open-thoughts (OpenThoughts-Agent team)

Provides 1.7M agent interaction traces in terminus-2 format for training and evaluating agentic LLMs and RL agents. Compiled from 219 source datasets across code repair, shell, math, competitive programming and general tasks; produced with the Harbor harness.

ai-agent agent-skills huggingface code RL+1

AI Dataset2026

TaskTrove

open-thoughts

Aggregates 750k+ Harbor-compatible agentic tasks from 100+ public sources (Parquet shards preserved). Includes tasks with and without verifiers for RL evaluation or SFT/datagen workflows, enabling reproducible trace generation.

huggingface RL ai-agent agent-skills code+2

AI Client2026

Deep Code CLI

Vegamo

Terminal-native AI coding assistant optimized for the deepseek-v4 model. Provides configurable "thinking" modes and reasoning-intensity controls, agent skills for extensibility, MCP integration, and a shared config with a VSCode plugin.

deepseek agent-skills cli terminal nodejs+4

AI Model2026

Ling-2.6-flash

inclusionAI

An instruct-focused LLM (104B total, 7.4B active) optimized for fast, token-efficient inference in agent workflows. Uses hybrid linear attention plus a sparse MoE to raise throughput and cut token use; suited for high-frequency production agents, with some trade-offs in very deep reasoning.

foundation-model huggingface llm ai-inference vllm+5

AI Model2026

Ling-2.6-1T

inclusionAI

A trillion-parameter LLM optimized for long-context, low-latency text generation and agentic coding workflows. Combines MLA+Linear Attention and a post-training 'fast thinking' token-suppression strategy to reduce token overhead and improve multi-step execution reliability for production agents.

transformers llm huggingface vllm ai-inference+5

AI Model2026

Qwen3.5-9B-DeepSeek-V4-Flash

Jackrong

Distills DeepSeek‑V4's multi-step structured reasoning into a Qwen3.5‑9B model for fast image-text-to-text reasoning and agentic tool workflows. Trades larger teacher size for inference efficiency and improved procedural reasoning — good for low-latency research, evaluation, and agent integration.

deepseek transformers llm multilingual huggingface+5

Tag

Explore by tags

Tag

Explore by tags

All

30u30

ASR

ChatGPT

GNN

IDE

RAG

agent-skills

ai

ai-agent

ai-api

ai-api-management

ai-client

ai-coding

ai-demos

ai-deploy

ai-development

ai-framework

ai-image

ai-image-demos

ai-inference

ai-leaderboard

ai-library

ai-rank

ai-serving

ai-tools

ai-train

ai-video

ai-workflow

AIGC

algorithms

alibaba

amazon

android

anthropic

audio

aws

benchmark

benchmarks

biology

blog

book

bytedance

chatbot

chatgpt

chemistry

claude

claude-code

cli

code

codex

coding

copilot

course

cuda

cursor

deepmind

deepseek

depth

devops

diffusers

distillation

docker

drug-discovery

electron

embeddings

engineering

evaluation

facebook

finance

flow-matching

foundation

foundation-model

gcode

gemini

gemini-cli