AI News HubLIVE

Source Mix

  • Hacker News AI17
  • arXiv Computational Linguistics3
  • arXiv Computer Vision3
  • Latent Space3
  • The Decoder3
  • The Guardian AI3
  • The Verge AI3
  • ZDNet AI3

Topic Mix

  • Research25
  • Agents23
  • Models19
  • Chips14
  • Policy13
  • Tools6
  • Startups3
  • Robotics1

Timeline

  • 2026-05-2810
  • 2026-05-207
  • 2026-05-155
  • 2026-05-195
  • 2026-05-164
  • 2026-05-184
  • 2026-05-274
  • 2026-05-213

Latest Updates

Meta launches Instagram, Facebook, and WhatsApp subscriptions

Meta rolls out consumer subscription plans for Instagram, Facebook, and WhatsApp globally, with prices from $2.99 to $3.99 per month, offering extra features. The company also begins testing new subscriptions for businesses, creators, and Meta AI users.

  • Meta launches Instagram Plus ($3.99/mo), Facebook Plus ($3.99/mo), and WhatsApp Plus ($2.99/mo) globally
  • Subscribers get profile customization, super reactions, story insights, and more
In-site article

AGI timelines shift with whichever lab is dominant

A new analysis shows that top AI forecasters adjust their AGI timelines based on which lab is currently leading the field, with predictions swinging from earlier to later and back again as the dominant lab changes from ChatGPT to xAI/Meta/Gemini to Anthropic.

  • Predictions for when most cognitive labor will be automated (AGI) fluctuate significantly based on which AI lab is currently dominant.
  • From 2023-2025, most researchers moved AGI timelines earlier; from 2025-2026, they moved them later; in early 2026, under Anthropic's rapid progress, they moved earlier again.
In-site article

Tweaking Local Language Model Settings with Ollama

This article dives deep into Ollama's configuration engine, covering how to fine-tune local language model parameters using the Modelfile, optimize hardware performance with server environment variables, and format prompt flows with Go template syntax.

  • The Ollama Modelfile is a declarative configuration file that defines model behavior, including base model, system instructions, and parameters.
  • Sampling parameters (temperature, Top-K, Top-P, Min-P) control the creativity and determinism of the model's outputs.
In-site article

Meta One: Zuckerberg finally puts a price tag on all that AI spending

Meta is rolling out paid add-ons for Instagram, Facebook, and WhatsApp worldwide while building a separate paid AI offering. This marks the first time Meta has clearly monetized its AI investments.

  • Meta launches paid add-ons globally for Instagram, Facebook, and WhatsApp.
  • A separate paid AI product is also in development, monetizing AI investments.
In-site article

ElevenLabs Music v2 promises opera-to-metal transitions without losing musical coherence

ElevenLabs has released Music v2, an upgraded AI music generation model that can shift between genres like opera, heavy metal, and rap within a single song. A new inpainting feature allows users to regenerate specific sections without affecting the rest.

  • Music v2 enables seamless genre transitions within a single song.
  • New inpainting feature allows targeted regeneration of specific sections.
In-site article

Former Google and Apple Researchers Launch a Startup to Build AI's Missing Feed

A group of former researchers from Google DeepMind, Apple, OpenAI, and Meta have launched a startup called Trajectory, aiming to help companies continuously improve their AI products by training on real-world user interactions. The company has raised a $15 million seed round at a $115 million valuation, led by Conviction. Trajectory's platform enables continuous learning for AI models, updating them based on real-world failures. It currently works with AI-native companies like Clay and Harvey, and plans to expand to Fortune 500 companies.

  • Trajectory is founded by ex-Google DeepMind, Apple, OpenAI, and Meta researchers to enable continuous learning for AI.
  • The startup raised $15M seed funding at $115M valuation, with investors including Jeff Dean and Fei-Fei Li.
In-site article

From Affect to Complex Behavior: Advancing Multimodal Human-Centered AI at the 10th ABAW Workshop & Competition

The 10th ABAW Workshop and Competition at CVPR 2026 advances multimodal human-centered AI by introducing new challenges including emotional mimicry intensity estimation, ambivalence/hesitancy recognition, and fine-grained violence detection, alongside traditional affect estimation and recognition tasks. The competition leverages large-scale in-the-wild datasets, and the paper track covers a broad range of topics from pose estimation to fairness and robustness.

  • ABAW 2026 introduces novel challenges: emotional mimicry intensity, ambivalence recognition, and violence detection.
  • Workshop continues dual structure with competition and paper tracks.
In-site article

ICG: Improving Cover Image Generation via MLLM-based Prompting and Personalized Preference Alignment

ICG is a novel framework that integrates MLLM-based prompting with personalized preference alignment to generate high-quality, contextually relevant cover images. It extracts semantic features via meta tokens, refines them with user embeddings, and injects personalized context into diffusion models. A multi-reward learning strategy combines public rewards with a personalized preference model, eliminating the need for labeled supervision. Experiments show improvements in image quality, semantic fidelity, and personalization, boosting user appeal and recommendation accuracy.

  • ICG integrates MLLM prompting with personalized preference alignment for end-to-end cover image generation.
  • Semantic features are extracted via meta tokens and refined with user embeddings for diffusion model injection.
In-site article

$E^3$-Agent: An Executable and Evolving Agent for Resource Management of Edge Generative Inference

This paper presents $E^3$-Agent, an executable and evolving agent for resource management of edge AIGC. It separates a fast-path router from a slow-path LLM meta-controller, learns online from execution feedback, and adapts to unknown time-varying service-time mappings. Evaluation shows 65%-73% latency reduction over static baselines and effective stutter suppression.

  • Edge generative inference faces unknown per-device performance and non-stationarity.
  • $E^3$-Agent uses a dual-path architecture: fast router + slow LLM meta-controller.
In-site article

Show HN: The Two Pillars – A conceptual framework for post-AI software work

A paper argues that with generative AI dissolving the human capacity to write correct code as the binding constraint, software work reorganizes around two pillars: Mixer Mode (humans operating multiple judgment axes continuously like a sound engineer) and Meta-Software (software that observes, validates, and governs other software). The two pillars are inseparable, drawing a parallel to the historical transition from artisanal to mass production.

  • The production of code is ceasing to be the dominant problem in software organizations due to generative AI.
  • Mixer Mode describes a new human role where practitioners continuously operate multiple judgment axes.
In-site article

South Africa Has AI Leverage. Its Draft Policy Leaves It Unused

South Africa holds 88% of global platinum-group metals, hosts Africa's largest data center market, and sits at the center of a US-China AI infrastructure contest. Yet its draft AI policy, withdrawn after hallucinated references, fails to leverage these advantages for favorable terms. The article examines South Africa's structural leverage, three possible AI infrastructure futures (Chinese, US, local open-weight), and the need for binding governance provisions.

  • South Africa's platinum metals and renewable energy give it unique AI leverage, but the draft policy lacks minimum terms for hyperscalers, data sovereignty, or tech transfer conditions.
  • US and Chinese tech companies (Microsoft, Huawei) compete for AI infrastructure control in South Africa, while the policy does not specify what South Africa demands in return.
In-site article

The AI fight brewing inside The New York Times

Unionized staff at The New York Times' Tech Guild accuse management of refusing to disclose AI usage plans and using internal AI tools to monitor performance, leading to unfair labor practice charges. The dispute highlights broader industry tensions over AI in newsrooms.

  • Tech Guild alleges Times management withheld information on AI use and future plans affecting jobs.
  • Two AI tools, DX and Glean, used to track employee performance and activity, sparking privacy and surveillance concerns.
In-site article

ACM Conference on AI and Agentic Systems – ACM CAIS 2026

ACM CAIS 2026 registration is full; join the waitlist. The conference runs May 26–29, 2026 in San Jose, featuring keynotes, 63 research papers, and 46 system demos, with a partnership with the AI Engineer World's Fair.

  • Registration full, join waitlist
  • May 26–29, 2026 in San Jose
In-site article

Reachy Mini goes fully local

This article details how to deploy a fully local voice conversation pipeline for the Reachy Mini robot, eliminating the need for cloud servers or API keys. It uses a cascaded approach combining VAD, STT, LLM, and TTS, with recommended defaults: llama.cpp with Gemma 4, Silero VAD, Parakeet-TDT 0.6B v3 STT, and Qwen3-TTS. Various LLM options are provided, including local MLX, Transformers, vLLM, or remote Responses API.

  • Reachy Mini can now run conversations fully locally without a server.
  • The cascaded pipeline includes VAD, STT, LLM, and TTS, with swappable components.
In-site article

Why and How to Run Local Models in Zed

Local models offer privacy, cost savings, control, and availability. While not as capable as frontier models, they are improving. This post explains how to set up local models in Zed using LM Studio, Ollama, or llama.cpp, and offers tips for effective use.

  • Local models provide privacy, lower cost, control, and always-availability.
  • They are less capable and slower than frontier models, but suitable for many tasks.
In-site article

Document Classification Pattern Recognition via Information Fusion: A Systematic Review of Multimodal and Multiview Representation Approaches

This systematic review of 139 studies proposes a unified framework and meta-analysis. Results show multimodal fusion improves accuracy by 5.28% on average, multiview fusion boosts accuracy by 4.67% and F1 by 3.08%, but only a minority of studies used statistical tests, raising reproducibility concerns.

  • Meta-analysis reveals that multimodal and multiview fusion significantly improve document classification accuracy.
  • Multimodal fusion yields a +5.28% accuracy gain; multiview yields +4.67% accuracy and +3.08% F1 score.
In-site article

The evolution of AI-assisted software engineering paradigms

The software industry is undergoing an unprecedented metamorphosis. From simple statistical completion of early coding assistants, through conversational chatbots and the failure of multi-agent systems, we have arrived at the era of the Agentic Loop. This comprehensive guide analyzes the entire evolution, from the Completion paradigm to the revolutionary Ralph Loop that is redefining how we write code.

  • AI-assisted coding evolved from statistical code completion (2021-2022) to the Agentic Loop paradigm.
  • Tools like Codex and GitHub Copilot were based on statistical models, lacking task understanding and long-term reasoning.
In-site article

Curlo: Local AI Search for SFX and Music by Describing Sounds

Curlo is a privacy-first macOS app for searching, previewing, and organizing large sound libraries. It lets you find SFX or music by describing what you want to hear, with fully local offline search, metadata management, AI auto-tagging, and integration with DAWs.

  • Local offline semantic search for audio files
  • Search by natural language description, filename, tags, and more
In-site article

Researchers let Claude Code discover AI scaling algorithms that humans probably wouldn't have designed

Researchers from UMD, Google, Meta, and other institutions use AutoTTS to let a coding agent independently discover control algorithms for AI reasoning. The algorithm it found cuts compute by about 70 percent compared to standard self-consistency while matching its accuracy. The whole search cost $40 and took 160 minutes.

  • AutoTTS uses an offline simulation environment to let a coding agent autonomously explore test-time scaling algorithms without human-written rules.
  • The discovered algorithm achieves higher accuracy per compute on math benchmarks than established methods like self-consistency.
In-site article

Big fears in Big Tech sector over Artificial Intelligence job losses

Meta cuts 350 jobs in Ireland amid global layoffs, sparking concerns over AI-driven displacement. Amazon, Oracle, Block also cite AI for job cuts. Experts warn of 'AI washing' as some layoffs may be unrelated. Official data shows 10.7% drop in Irish tech employment. ESRI report warns 200,000 roles could be displaced by AI.

  • Meta lays off 350 of 1,800 Irish employees; global 8,000 job cuts linked to AI investments.
  • Amazon, Oracle, Block announce AI-related job cuts; Ireland sees hundreds of roles affected.
In-site article

Meta's Claudeonomics leaderboard

Meta launched an internal AI leaderboard called 'Claudeonomics' to track employee token usage, but shut it down after data leaked. The trend of tracking AI usage is growing, with Nvidia's Jensen Huang proposing AI tokens as part of compensation.

  • Meta's internal AI leaderboard 'Claudeonomics' ranked employees based on token consumption and used gamification badges.
  • The leaderboard was shut down after internal usage data was shared publicly.
In-site article

Giving Agents Computers — Ivan Burazin, Daytona

We chat with Daytona's CEO about their insane 74% MoM Growth, 850K Daily Runs, Bare Metal Sandboxes, RL Evals, and the New Agent Cloud

  • Daytona pivoted from human dev environments to AI sandboxes, achieving 74% MoM growth
  • Provides bare-metal infrastructure with custom scheduler; single sandbox starts in ~60ms
In-site article

AI Can Seem More Human Than Real Humans in a Classic Turing Test, Study Finds

New UC San Diego research shows that with persona prompts, advanced LLMs like GPT-4.5 are judged human more often than real humans in a three-party Turing test. LLaMa-3.1 also performed at human levels. The study raises important questions about online trust and the nature of humanlikeness.

  • GPT-4.5 was judged human 73% of the time; LLaMa-3.1-405B 56%.
  • Without persona prompting, performance dropped sharply.
In-site article

AI red teaming agents change how LLMs get tested

The proliferation of LLM adversarial testing tools has made it difficult for operators to navigate the landscape. Autonomous agents that select attacks, compose transforms, and execute tests are emerging as a solution. Dreadnode's agent executed 674 attacks against Meta's Llama Scout in about three hours with an 85% success rate. However, limitations include constrained coverage, alignment issues with the orchestrating agent, and a lack of formal comparison with human experts.

  • AI agents can autonomously red team LLMs, translating natural language goals into executed attacks and compliance mappings.
  • Dreadnode's agent achieved 85% attack success rate on Llama Scout, but results may not generalize to frontier models.
In-site article

Railway: The Agent-Native Cloud — Jake Cooper

3M Users, 100K Signups/Week, Own-Metal Data Centers, $200K+ Coding Agent Spend, and the Death of PRs

  • Railway has grown from 100 hand-acquired users to 3M, adding 100K signups per week.
  • Owns bare metal data centers with 3-month payback period and 70% margins.
In-site article

Tribal Raises $10M Seed to Bring Context-Aware AI Agents to Enterprise Systems

Tribal announced a $10 million seed round led by Team8 to expand its metadata-native AI agent platform, which maps enterprise systems' metadata layers to deliver production-ready solutions with governance. The platform integrates with Salesforce, SAP, and other major systems, aiming to turn AI experiments into repeatable business impact.

  • Tribal raised $10M seed funding led by Team8, with DYDX Capital and angel investors. The founders have deep enterprise background from Salesforce, Wix, and Spot.io.
  • The platform uses a Metadata Fabric to ingest and map enterprise system metadata in minutes, providing organizational context for AI agents.
In-site article

It’s make or break time for AI labeling systems

We're about to find out if systems like SynthID and C2PA can effectively combat deepfakes. Google expands SynthID verification to Chrome and Search, also checking C2PA metadata. OpenAI adds SynthID to its images. Meta will use C2PA on Instagram. But challenges remain: metadata easily stripped, open-source models not adopting, and Google's dual role as AI creator and solution provider.

  • Google integrates SynthID verification into Chrome and Search, also checking C2PA metadata.
  • OpenAI adds SynthID watermarking to ChatGPT and API-generated images.
In-site article

I wore Google's Android XR glasses again - and my limit-testing should scare Meta and Apple

The author's latest demo of Google's Android XR reference glasses and Project Aura shows Gemini-powered capabilities like adding events to calendar, editing photos, and cross-app integration, suggesting ambient AI is the future of wearables.

  • Google launches three smart glasses: audio-only, Project Aura, and a reference model, all powered by Gemini.
  • Demo includes voice-adding FIFA matches to calendar, turning people into minions, and extracting ingredients to Keep.
In-site article

EgoTraj: Real-World Egocentric Human Trajectory Dataset for Multimodal Prediction

Researchers introduce EgoTraj, an egocentric multimodal dataset recorded with Meta Quest Pro, featuring 75 navigation sequences in real-world urban environments with synchronized RGB video, head poses, eye gaze, and scene annotations. The dataset aims to advance trajectory prediction for humanoid robotics, wearable sensing, and assistive navigation, and benchmarks multiple state-of-the-art methods.

  • EgoTraj is the first egocentric multimodal human trajectory dataset captured in real-world urban settings using Meta Quest Pro.
  • It contains 75 sequences with synchronized RGB video, 6-DoF head poses, 3D eye gaze vectors, and scene annotations.
In-site article

Time to REFLECT: Can We Trust LLM Judges for Evidence-based Research Agents?

As deep research agents increasingly automate complex information-seeking tasks, reliable evaluation becomes critical. LLM-as-judge is used to assess these agents, but its reliability is poorly understood. The REFLECT benchmark introduces a fine-grained meta-evaluation, revealing that current LLM judges are unreliable, with best models achieving accuracies below 55% across reasoning, tool-use, and report-quality failures, especially poor on evidence verification. The study offers actionable guidance for building more reliable evaluation pipelines.

  • LLM judges show systematic limitations in evaluating deep research agents, with overall accuracy below 55%
  • REFLECT benchmark generates fine-grained failure instances via controlled interventions on agent traces
In-site article

Meta is rapidly reorganizing its workers’ jobs around AI: ‘Transfers aren’t optional’

Meta is forcing over 7,000 workers to move to new teams, including teams focused on AI cloud infrastructure and an internal AI agent codenamed Hatch, as it pivots to AI. Last month, 1,000 engineers were reassigned to Applied AI, with employees told transfers are mandatory.

  • Meta is mandating more than 7,000 workers to transfer to AI-focused teams.
  • New teams include AI cloud infrastructure and an internal AI agent project called Hatch.
In-site article

OpenAI's new image watermarks make it easier to spot AI fakes - here's how

OpenAI now uses C2PA metadata and SynthID watermarks to tag AI-generated images, making them harder to strip. A public verification tool is also rolling out.

  • OpenAI adopts C2PA metadata and SynthID watermarks for all AI images.
  • SynthID embeds imperceptible pixel-level signals that survive editing and screenshots.
In-site article

[AINews] How to land a job at a frontier lab (on Pretraining)

A quiet day before Google I/O lets us amplify a notable blogpost by Vlad Feinberg on job preparation for pretraining roles at frontier AI labs, alongside major model releases (Cursor Composer 2.5, Qwen3.7), local inference speedups via MTP in llama.cpp, and research on MoEs, RL, and agent evaluation.

  • Vlad Feinberg shares job tips for frontier labs, emphasizing kernel-level optimization and agent work.
  • Cursor releases Composer 2.5 and reveals training a much larger model with 10× compute.
In-site article

Layoffs and Harsh AI Reality Inside Meta

Meta begins another round of layoffs cutting 8,000 jobs while ramping up AI investments. Employee morale is low, with internal resistance to AI data tracking tools.

  • Meta starts layoffs this week, cutting 8,000 jobs and scrapping 6,000 open roles.
  • CEO Mark Zuckerberg's tone shifts from apology to efficiency, with no apology this time.
In-site article

AI Weekly Issue #493: Meta hired $145B in capex and fired 8,000 people

Meta committed $145B to AI infrastructure and began laying off 8,000 people in the same week. Standard Chartered cut over 7,000 jobs, explicitly replacing 'lower-value human capital' with AI investment. Pope Leo XIV announced his first AI encyclical co-authored with Anthropic's Christopher Olah, to be released May 25.

  • Meta committed $145B in AI capex while laying off 8,000 employees.
  • Standard Chartered described its cuts as replacing 'lower-value human capital.'
In-site article

Hybrid LLM-based Intelligent Framework for Robot Task Scheduling

This study introduces intelligent frameworks that use Large Language Models (LLMs) to improve task scheduling for construction robots. The system utilizes a Natural Language Processing interface for communication and adapts in real-time to unexpected site conditions. It concurrently uses two LLM agents: a generator (GPT-4) and a supervisor (Gemma 3/Llama 4/Mistral 7b) to provide precise task schedules. Evaluation results highlight the crucial role of LLMs in construction robotic tasks.

  • Proposes a hybrid LLM-based framework for construction robot task scheduling
  • Uses dual-agent architecture: generator (GPT-4) and supervisor (Gemma 3/Llama 4/Mistral 7b)
In-site article

Deep Pre-Alignment for VLMs

This paper proposes Deep Pre-Alignment (DPA), a novel architecture that replaces the standard ViT encoder with a small VLM as perceiver, ensuring visual features are deeply aligned with the text space of the target large language model. On the 4B parameter scale, DPA outperforms baselines by 1.9 points across 8 multimodal benchmarks, with gains widening to 3.0 points at the 32B scale. DPA also achieves a 32.9% reduction in language capability forgetting and demonstrates consistent gains across different LLM families like Qwen3 and LLaMA 3.2.

  • DPA replaces ViT encoder with a small VLM perceiver for deep visual-text alignment.
  • Outperforms baselines by 1.9 points at 4B scale and 3.0 points at 32B scale on multimodal benchmarks.
In-site article

Quantization Undoes Alignment: Bias Emergence in Compressed LLMs Across Models and Precision Levels

The paper reveals that quantizing LLMs to lower precision (e.g., 3-bit) causes 6-21% of previously unbiased items to exhibit new stereotypical behaviors, while models' willingness to answer 'unknown' drops by 17.4%. Standard metrics like perplexity barely change, indicating aggregate metrics miss fairness-critical degradation.

  • Quantization can reintroduce biases even in aligned models.
  • 3-bit quantization caused 6-21% of unbiased items to become stereotypical.
In-site article

Fair outputs, Biased Internals: Causal Potency and Asymmetry of Latent Bias in LLMs for High-Stakes Decisions

Instruction-tuned language models appear fair in high-stakes decisions but harbor biased internal representations. Using mortgage underwriting as a test case, this study shows that while outputs show no bias, internal layers encode and amplify demographic associations. Activation steering and cross-layer interventions reveal that reintroducing suppressed information at critical layers can flip decisions almost entirely. The latent bias is asymmetric, direction-dependent, and exploitable via adversarial prompts or fine-tuning. The authors argue that output-only audits are insufficient and propose a dual-layer testing framework combining output evaluation with representational analysis for AI governance.

  • LLMs exhibit no output-level bias but retain and amplify demographic representations internally
  • Activation steering shows that latent bias, when reinjected, can reverse decisions
In-site article

Nous Research Proposes Lighthouse Attention: A Training-Only Selection-Based Hierarchical Attention That Delivers 1.4–1.7× Pretraining Speedup at Long Context

Nous Research has published Lighthouse Attention, a training-only selection-based hierarchical attention mechanism that wraps around standard scaled dot-product attention during pretraining and is removed afterward. Unlike prior methods that pool only keys and values, Lighthouse pools Q, K, and V symmetrically across a multi-resolution pyramid, reducing the attention call from O(N·S·d) to O(S²·d) and running stock FlashAttention on a small dense sub-sequence. Tested on a 530M Llama-3-style model at 98K context, it achieves a 1.40–1.69× end-to-end wall-clock speedup against a cuDNN SDPA baseline with matching or lower final training loss.

  • Lighthouse Attention is a training-only hierarchical attention that symmetrically pools queries, keys, and values across a multi-resolution pyramid, reducing compute from O(N·S·d) to O(S²·d).
  • On a 530M Llama-3 model at 98K context, it achieves 1.4–1.7× speedup over cuDNN SDPA baseline with matching or lower final loss.
In-site article

Annota – A local-first, E2EE note app with BYOK AI

Annota is a local-first, end-to-end encrypted note-taking app with a powerful editor, AI assistant, and absolute privacy. It supports local Ollama or BYOK AI. All features are free for local use, with subscriptions for cloud sync and multi-device support.

  • Local-first design with full offline capability and E2E encryption
  • AI assistant supports local Ollama or Bring Your Own Key (BYOK)
In-site article

LocalVibe – Pure-Rust local AI stack with MCP, in one binary (Apple Silicon)

LocalVibe is a pure-Rust local AI coding assistant optimized for Apple Silicon. It provides chat with quantized LLMs via Metal, on-device ONNX embeddings, LanceDB vector search, and a TUI interface. It includes MCP server for Claude Code, an OpenAI-compatible HTTP server, and various tool integrations.

  • Pure-Rust binary with Candle+Metal inference, fastembed-rs embeddings, and LanceDB vector store.
  • TUI with five sections: Chat, Models, Databases, Index, Settings.
In-site article

Show HN: AI/ML benchmark for local LLM inference and XGBoost training on GPU/CPU

An open-source benchmark suite that runs a full GPU/CPU benchmark with one command, covering Ollama LLM inference and XGBoost training, and generates an interactive HTML report.

  • Supports Ollama LLM (3B-14B parameter models) and XGBoost training/inference benchmarks
  • Single command execution with automatic HTML report and Streamlit dashboard
In-site article

ArXiv will ban researchers who upload papers full of AI slop

ArXiv, a popular platform for preprint academic research, is taking a new step to attempt to reduce the volume of papers that include AI slop. If a paper has "incontrovertible evidence that the authors did not check the results of LLM generation," such as hallucinated references or "meta-comments" left by an LLM, authors will be banned from ArXiv for a year, according to Thomas Dietterich, ArXiv's section chair of its computer science section. Future ArXiv submissions will also have to be accepted at "a reputable peer-reviewed venue."

  • ArXiv will impose a one-year ban on authors whose papers show clear signs of unchecked LLM output, such as hallucinated references or meta-comments.
  • After the ban, future submissions must first be accepted by a reputable peer-reviewed venue.
In-site article

'Reading the invisible': AI framework accounts for hidden defects in metal 3D printing

A new AI framework can detect microscopic internal defects in metal 3D printed components that are invisible to the naked eye but compromise structural integrity, potentially overcoming a key barrier to widespread adoption of metal additive manufacturing.

  • AI framework identifies hidden internal defects in metal 3D printed parts
  • Defects are microscopic but significantly affect part strength
In-site article

Meta to Start Capturing Employee Mouse Movements, Keystrokes for AI Training

Meta is installing tracking software on US-based employees' computers to capture mouse movements, clicks, and keystrokes for AI training. The tool, MCI, also takes occasional screenshots. Meanwhile, Meta plans to lay off 10% of its workforce globally starting May 20.

  • Meta deploys MCI tracking software on employee computers for AI training.
  • MCI captures mouse movements, clicks, keystrokes, and screenshots.
In-site article

‘I didn’t want to be the guinea pig’: inside tech’s AI-fueled manager purge

Tech companies are increasingly targeting middle managers in AI-driven restructurings, with Coinbase, Amazon, Block, and Meta cutting layers. Workers fear loss of mentorship and career advancement.

  • AI-driven restructurings are systematically eliminating middle managers in tech
  • Coinbase, Amazon, Block, and Meta have laid off tens of thousands, focusing on management layers
In-site article

What Are AI Ethics

AI ethics is the practice of ensuring that AI systems are developed and used in ways that are fair, transparent, accountable, and safe. This article explores the core questions, challenges, regulatory landscape, and practical steps for responsible AI.

  • AI ethics addresses how to build AI systems that do more good than harm.
  • Key challenges include fairness conflicts, opacity, and the balance between privacy and performance.
In-site article

Company Directory