LogoAIAny
  • Search
  • Collection
  • Category
  • Tag
  • Daily AI
LogoAIAny

Tag

Explore by tags

LogoAIAny

Curated AI Resources for Everyone

[email protected]

Powered by airss.app

Product
  • Search
  • Collection
  • Category
  • Tag
Resources
  • Blog
Company
  • Privacy Policy
  • Terms of Service
  • Sitemap
Copyright © 2026 All Rights Reserved.
  • All

  • 30u30

  • ASR

  • ChatGPT

  • GNN

  • IDE

  • RAG

  • agent-skills

  • ai

  • ai-agent

  • ai-api

  • ai-api-management

  • ai-client

  • ai-coding

  • ai-demos

  • ai-deploy

  • ai-development

  • ai-framework

  • ai-image

  • ai-image-demos

  • ai-inference

  • ai-leaderboard

  • ai-library

  • ai-rank

  • ai-serving

  • ai-tools

  • ai-train

  • ai-video

  • ai-workflow

  • AIGC

  • algorithms

  • alibaba

  • amazon

  • android

  • anthropic

  • audio

  • aws

  • biology

  • blog

  • book

  • bytedance

  • chatbot

  • chatgpt

  • chemistry

  • claude

  • claude-code

  • cli

  • code

  • codex

  • copilot

  • course

  • cursor

  • deepmind

  • deepseek

  • depth

  • devops

  • diffusers

  • docker

  • drug-discovery

  • electron

  • embeddings

  • engineering

  • facebook

  • finance

  • foundation

  • foundation-model

  • gemini

  • gemini-cli

  • gemma

  • genomics

  • gitHub

  • github

  • go

  • google

  • gradient-booting

  • grok

  • huggingface

  • image

  • ios

  • java

  • javascript

  • LLM

  • llm

  • math

  • mcp

  • mcp-client

  • mcp-server

  • meta-ai

  • meta-pytorch

  • microsoft

  • mlops

  • mobile

  • multilingual

  • multimodal

  • NLP

  • nlp

  • nodejs

  • nvidia

  • ocr

  • ollama

  • openai

  • opencode

  • pandas

  • paper

  • physics

  • plugin

  • postgres

  • privacy

  • prompt-engineering

  • python

  • pytorch

  • RL

  • robotics

  • rust

  • science

  • security

  • shodan

  • skillkit

  • sora

  • speech

  • ssh

  • tensorrt

  • terminal

  • transformers

  • translation

  • tts

  • tutorial

  • typescript

  • vibe-coding

  • video

  • vision

  • vllm

  • voice

  • xAI

  • xai

Icon for item

Toward Generalist Autonomous Research via Hypothesis-Tree Refinement

2026
Jiajie Jin, Yuyang Hu +16

Lets an AI agent propose, run, and evaluate multi-step research experiments using a persistent Hypothesis Tree that links hypotheses, artifacts, evidence, and distilled insights. Combines a long-lived coordinator with short-lived executors to carry lessons across time; evaluated on six ML tasks.

paperai-agentagent-skillsai-workflowai-train+2
Icon for item

FORT-Searcher: Synthesizing Shortcut-Resistant Search Tasks for Training Deep Search Agents

2026
Jia Deng, Yimeng Chen +10

Synthesizes shortcut-resistant search tasks to train deep search agents by controlling four shortcut risks across entity selection, evidence-graph construction, question formulation, and adversarial refinement. Produces training trajectories with longer pre-answer search and fewer shortcut patterns; code will be released on GitHub.

papergithubai-agentagent-skillsdeepseek+2
Icon for item

EvoArena: Tracking Memory Evolution for Robust LLM Agents in Dynamic Environments

2026
Jundong Xu, Qingchuan Li +12

Benchmarks evolving environments as sequences of progressive updates and introduces EvoMem, a patch-based memory that records structured update histories so LLM agents can reason about environment evolution. Demonstrates measurable gains on EvoArena and other benchmarks.

LLMai-agentagent-skillspapernlp
Icon for item

SpatialClaw: Rethinking Action Interface for Agentic Spatial Reasoning

2026
Seokju Cho, Ryo Hachiuma +9

Provides a training-free, code-as-action framework that lets VLM-backed agents write and run stateful Python cells to compose perception and geometry primitives for open-ended 3D/4D spatial reasoning. Demonstrates consistent gains across 20 benchmarks and multiple VLM backbones.

visionmultimodalai-agentagent-skillspaper
Icon for item

InterleaveThinker: Reinforcing Agentic Interleaved Generation

2026
Dian Zheng, Harry Lee +5

Adds interleaved text–image generation to existing image generators via a multi-agent pipeline: a planner sequences stepwise instructions, a critic detects and refines failures, and single-step RL (GRPO) reinforces per-step corrections—suited for visual narratives and embodied guidance.

multimodalvisionai-imageimageRL+3
  • Previous
  • 1
  • 2
  • More pages
  • 27
  • 28
  • 29
  • Next