AI Dashboard

New AI developments worth knowing about

A simpler, high-signal dashboard focused on recent releases, important product updates, major industry moves, and a limited set of research that matters in practice.

2,249

Total tracked

458

Last 24h

1,319

Last 3 days

Feed Controls

Time Window

Filtered Feed

Briefing

A simplified feed view based on the current lens and filters.

AI ModelHigh ImpactOSS

Coherearxiv_cs_ai21 hours ago

Adaptive Chunking: Optimizing Chunking-Method Selection for RAG

Abstract: The effectiveness of Retrieval-Augmented Generation (RAG) is highly dependent on how documents are chunked, that is, segmented into smaller units for ...

adaptivechunkingoptimizing+3

AI ModelHigh ImpactOSS

Coherearxiv_cs_lg21 hours ago

Corruption-Aware Training of Latent Video Diffusion Models for Robust Text-to-Video Generation

Abstract: Latent Video Diffusion Models (LVDMs) have achieved state-of-the-art generative quality for image and video generation; however, they remain b...

corruption-awaretraininglatent+5

AI ModelHigh ImpactOSS

Coherearxiv_cs_ai21 hours ago

TrustGeoGen: Formal-Verified Data Engine for Trustworthy Multi-modal Geometric Problem Solving

Abstract: Geometric problem solving (GPS) requires precise multimodal understanding and rigorous, step-by-step logical reasoning. However, developing capable ...

trustgeogenformal-verifieddata+5

AI ModelWorth WatchingOSS

Coherearxiv_cs_ai21 hours ago

PackForcing: Short Video Training Suffices for Long Video Sampling and Long Context Inference

Abstract: Autoregressive video diffusion models have demonstrated remarkable progress, yet they remain bottlenecked by intractable linear KV-cache growth, tempo...

packforcingshortvideo+5

AI ModelWorth WatchingOSS

Coherearxiv_cs_cl21 hours ago

Humans vs Vision-Language Models: A Unified Measure of Narrative Coherence

Abstract: We study narrative coherence in visually grounded stories by comparing human-written narratives with those generated by vision-language models (VLMs) on...

humansvision-languagemodels+4

AI ModelWorth Watching

Coherearxiv_cs_ai21 hours ago

Pixelis: Reasoning in Pixels, from Seeing to Acting

Abstract: Most vision-language systems are static observers: they describe pixels, do not act, and cannot safely improve under shift. This passivity limits gene...

pixelisreasoningpixels+2

New ToolWorth Watching

Coherearxiv_cs_lg21 hours ago

Time-Correlated Video Bridge Matching

Abstract: Diffusion models excel in noise-to-data generation tasks, providing a mapping from a Gaussian distribution to a more complex data distribution. Howe...

time-correlatedvideobridge+1

New ToolWorth Watching

Coherearxiv_cs_ai21 hours ago

ByteStorm: a multi-step data-driven approach for Tropical Cyclones detection and tracking

Abstract: Accurate tropical cyclones (TCs) tracking represents a critical challenge in the context of weather and climate science. Traditional tracking ...

bytestormmulti-stepdata-driven+5

ResearchImportant ResearchNew BenchmarkWorth Watching

Coherearxiv_cs_ai21 hours ago

Out of Sight but Not Out of Mind: Hybrid Memory for Dynamic Video World Models

Abstract: Video world models have shown immense potential in simulating the physical world, yet existing memory mechanisms primarily treat environments as static canvases. When dynamic subjects hide out of s...

outsightout+5

ResearchImportant ResearchNew BenchmarkWorth Watching

Coherearxiv_cs_ai21 hours ago

A cross-species neural foundation model for end-to-end speech decoding

Abstract: Speech brain-computer interfaces (BCIs) aim to restore communication for people with paralysis by translating neural activity into text. Most systems use cascaded frameworks that decode pho...

cross-speciesneuralfoundation+4

Safety ResearchImportant ResearchProduct Relevant

Coherearxiv_cs_ai21 hours ago

Epistemic Bias Injection: Biasing LLMs via Selective Context Retrieval

Abstract: When answering user queries, LLMs often retrieve knowledge from external sources stored in combining AI generation with external knowledge lookup (RAG) databases. These are often populated...

epistemicbiasinjection+5

AI Model

Coherearxiv_cs_ai21 hours ago

AI Security in the Foundation Model Era: A Comprehensive Survey from a Unified Perspective

Abstract: As machine learning (ML) systems expand in both scale and functionality, the security landscape has become increasingly complex, with a proliferation ...

securityfoundationmodel+5

Model ReleaseHigh ImpactOSS

Coheretechcrunch_ai1 day ago

Cohere launches an open-source voice model specifically for transcription

Relatively light at just 2 billion parameters, the model is meant for use with consumer-grade GPUs for those who want to self-host it. It currently supports 14 languages.

coherelaunchesopen-source+4

ResearchImportant ResearchBreakthrough

PrimaryCoherehf_papers1 day ago

MACRO: Advancing Multi-Reference Image Generation with Structured Long-Context Data

Generating images conditioned on multiple visual references is critical for real-world applications such as multi-subject composition, narrative illustration, and novel view synthesis, yet current models suffer from severe performance degradation...

macroadvancingmulti-reference+5

AI ModelHigh Impact

Coherearxiv_cs_ai1 day ago

LGSE: Lexically Grounded Subword Embedding Initialization for Low-Resource Language Adaptation

Abstract: Adapting pretrained language models to low-resource, morphologically rich languages remains a significant challenge. Existing vocabulary expansion met...

lgselexicallygrounded+5

AI ModelWorth WatchingOSS

Coherearxiv_cs_ai1 day ago

GAIA: A Foundation Model for Operational Atmospheric Dynamics

Abstract: We introduce GAIA (Geospatial Artificial Intelligence for Atmospheres), a hybrid self-supervised geospatial foundation model that fuses Masked...

gaiafoundationmodel+3

AI ModelWorth Watching

Coherearxiv_cs_ai1 day ago

A Multimodal Framework for Human-Multi-Agent Interaction

Abstract: Human-robot interaction is increasingly moving toward multi-robot, socially grounded environments. Existing systems struggle to integrate multimodal p...

multimodalframeworkhuman-multi-agent+1

AI ModelWorth Watching

Coherearxiv_cs_ai1 day ago

Hierarchical Long Video Understanding with Audiovisual Entity Cohesion and Agentic Search

Abstract: Long video understanding presents significant challenges for vision-language models due to extremely long context windows. Existing solutions ...

hierarchicallongvideo+5

AI ModelWorth Watching

Coherearxiv_cs_cl1 day ago

PINGALA: Prosody-Aware Decoding for Sanskrit Poetry Generation

Abstract: Poetry generation in Sanskrit typically requires the verse to be semantically coherent and adhere to strict prosodic rules. In Sanskrit prosody, every l...

pingalaprosody-awaredecoding+3

ResearchImportant ResearchAgents & ToolsWorth WatchingOSS

Coherearxiv_cs_ai1 day ago

Can LLM Agents Generate Real-World Evidence? Evaluating Observational Studies in Medical Databases

Abstract: Observational studies can yield clinically actionable evidence at scale, but executing them on real-world databases is open-ended and requires coherent decisions across cohort construction, analysis,...

llmagentsgenerate+5

AI ModelWorth Watching

Coherearxiv_cs_cl1 day ago

Navigating the Concept Space of Language Models

Abstract: Sparse autoencoders (SAEs) trained on large language model activations output thousands of features that enable mapping to human-interpretable concepts....

navigatingconceptspace+2

AI ModelWorth Watching

Coherearxiv_cs_cl1 day ago

LLMs Do Not Grade Essays Like Humans

Abstract: Large language models have recently been proposed as tools for automated essay scoring, but their agreement with human grading remains unclear. In thi...

llmsgradeessays+2

New ToolWorth Watching

Coherearxiv_cs_ai1 day ago

PhySe-RPO: Physics and Semantics Guided Relative Policy Optimization for Diffusion-Based Surgical Smoke Removal

Abstract: Surgical smoke severely degrades intraoperative video quality, obscuring anatomical structures and limiting surgical perception. Existing learning-based...

physe-rpophysicssemantics+5

AI ModelWorth Watching

Coherearxiv_cs_cl1 day ago

Towards Reward Modeling for AI Tutors in Math Mistake Remediation

Abstract: Evaluating the pedagogical quality of AI tutors remains challenging: standard NLG metrics do not determine whether responses identify mistakes, scaffold...

towardsrewardmodeling+4

AI ModelWorth Watching

Coherearxiv_cs_ai1 day ago

Residual Decoding: Mitigating Hallucinations in Large Vision-Language Models via History-Aware Residual Guidance

Abstract: Large Vision-Language Models (LVLMs) can reason from image-text inputs and perform well in various multimodal tasks. Despite this success, the...

residualdecodingmitigating+5

AI ModelWorth Watching

Coherearxiv_cs_ai1 day ago

Image Generation from Contextually-Contradictory Prompts

Abstract: Text-to-image diffusion models excel at generating high-quality, diverse images from natural language prompts. However, they often fail to pro...

imagegenerationcontextually-contradictory+1

ResearchImportant ResearchProduct RelevantWorth Watching

Coherearxiv_cs_ai1 day ago

STRIATUM-CTF: A Protocol-Driven Agentic Framework for General-Purpose CTF Solving

Abstract: Large Language Models (LLMs) have demonstrated potential in code generation, yet they struggle with the multi-step, stateful reasoning required for offensive cybersecurity operations. Existing rese...

striatum-ctfprotocol-drivenagentic+4