Meta rolls out consumer subscription plans for Instagram, Facebook, and WhatsApp globally, with prices from $2.99 to $3.99 per month, offering extra features. The company also begins testing new subscriptions for businesses, creators, and Meta AI users.
Meta launches Instagram Plus ($3.99/mo), Facebook Plus ($3.99/mo), and WhatsApp Plus ($2.99/mo) globally
Subscribers get profile customization, super reactions, story insights, and more
A new analysis shows that top AI forecasters adjust their AGI timelines based on which lab is currently leading the field, with predictions swinging from earlier to later and back again as the dominant lab changes from ChatGPT to xAI/Meta/Gemini to Anthropic.
Predictions for when most cognitive labor will be automated (AGI) fluctuate significantly based on which AI lab is currently dominant.
From 2023-2025, most researchers moved AGI timelines earlier; from 2025-2026, they moved them later; in early 2026, under Anthropic's rapid progress, they moved earlier again.
This article dives deep into Ollama's configuration engine, covering how to fine-tune local language model parameters using the Modelfile, optimize hardware performance with server environment variables, and format prompt flows with Go template syntax.
The Ollama Modelfile is a declarative configuration file that defines model behavior, including base model, system instructions, and parameters.
Sampling parameters (temperature, Top-K, Top-P, Min-P) control the creativity and determinism of the model's outputs.
Meta is rolling out paid add-ons for Instagram, Facebook, and WhatsApp worldwide while building a separate paid AI offering. This marks the first time Meta has clearly monetized its AI investments.
Meta launches paid add-ons globally for Instagram, Facebook, and WhatsApp.
A separate paid AI product is also in development, monetizing AI investments.
ElevenLabs has released Music v2, an upgraded AI music generation model that can shift between genres like opera, heavy metal, and rap within a single song. A new inpainting feature allows users to regenerate specific sections without affecting the rest.
Music v2 enables seamless genre transitions within a single song.
New inpainting feature allows targeted regeneration of specific sections.
A group of former researchers from Google DeepMind, Apple, OpenAI, and Meta have launched a startup called Trajectory, aiming to help companies continuously improve their AI products by training on real-world user interactions. The company has raised a $15 million seed round at a $115 million valuation, led by Conviction. Trajectory's platform enables continuous learning for AI models, updating them based on real-world failures. It currently works with AI-native companies like Clay and Harvey, and plans to expand to Fortune 500 companies.
Trajectory is founded by ex-Google DeepMind, Apple, OpenAI, and Meta researchers to enable continuous learning for AI.
The startup raised $15M seed funding at $115M valuation, with investors including Jeff Dean and Fei-Fei Li.
The 10th ABAW Workshop and Competition at CVPR 2026 advances multimodal human-centered AI by introducing new challenges including emotional mimicry intensity estimation, ambivalence/hesitancy recognition, and fine-grained violence detection, alongside traditional affect estimation and recognition tasks. The competition leverages large-scale in-the-wild datasets, and the paper track covers a broad range of topics from pose estimation to fairness and robustness.
ICG is a novel framework that integrates MLLM-based prompting with personalized preference alignment to generate high-quality, contextually relevant cover images. It extracts semantic features via meta tokens, refines them with user embeddings, and injects personalized context into diffusion models. A multi-reward learning strategy combines public rewards with a personalized preference model, eliminating the need for labeled supervision. Experiments show improvements in image quality, semantic fidelity, and personalization, boosting user appeal and recommendation accuracy.
ICG integrates MLLM prompting with personalized preference alignment for end-to-end cover image generation.
Semantic features are extracted via meta tokens and refined with user embeddings for diffusion model injection.
This paper presents $E^3$-Agent, an executable and evolving agent for resource management of edge AIGC. It separates a fast-path router from a slow-path LLM meta-controller, learns online from execution feedback, and adapts to unknown time-varying service-time mappings. Evaluation shows 65%-73% latency reduction over static baselines and effective stutter suppression.
Edge generative inference faces unknown per-device performance and non-stationarity.
$E^3$-Agent uses a dual-path architecture: fast router + slow LLM meta-controller.
A paper argues that with generative AI dissolving the human capacity to write correct code as the binding constraint, software work reorganizes around two pillars: Mixer Mode (humans operating multiple judgment axes continuously like a sound engineer) and Meta-Software (software that observes, validates, and governs other software). The two pillars are inseparable, drawing a parallel to the historical transition from artisanal to mass production.
The production of code is ceasing to be the dominant problem in software organizations due to generative AI.
Mixer Mode describes a new human role where practitioners continuously operate multiple judgment axes.
South Africa holds 88% of global platinum-group metals, hosts Africa's largest data center market, and sits at the center of a US-China AI infrastructure contest. Yet its draft AI policy, withdrawn after hallucinated references, fails to leverage these advantages for favorable terms. The article examines South Africa's structural leverage, three possible AI infrastructure futures (Chinese, US, local open-weight), and the need for binding governance provisions.
South Africa's platinum metals and renewable energy give it unique AI leverage, but the draft policy lacks minimum terms for hyperscalers, data sovereignty, or tech transfer conditions.
US and Chinese tech companies (Microsoft, Huawei) compete for AI infrastructure control in South Africa, while the policy does not specify what South Africa demands in return.
Unionized staff at The New York Times' Tech Guild accuse management of refusing to disclose AI usage plans and using internal AI tools to monitor performance, leading to unfair labor practice charges. The dispute highlights broader industry tensions over AI in newsrooms.
Tech Guild alleges Times management withheld information on AI use and future plans affecting jobs.
Two AI tools, DX and Glean, used to track employee performance and activity, sparking privacy and surveillance concerns.
ACM CAIS 2026 registration is full; join the waitlist. The conference runs May 26–29, 2026 in San Jose, featuring keynotes, 63 research papers, and 46 system demos, with a partnership with the AI Engineer World's Fair.
This article details how to deploy a fully local voice conversation pipeline for the Reachy Mini robot, eliminating the need for cloud servers or API keys. It uses a cascaded approach combining VAD, STT, LLM, and TTS, with recommended defaults: llama.cpp with Gemma 4, Silero VAD, Parakeet-TDT 0.6B v3 STT, and Qwen3-TTS. Various LLM options are provided, including local MLX, Transformers, vLLM, or remote Responses API.
Reachy Mini can now run conversations fully locally without a server.
The cascaded pipeline includes VAD, STT, LLM, and TTS, with swappable components.
Local models offer privacy, cost savings, control, and availability. While not as capable as frontier models, they are improving. This post explains how to set up local models in Zed using LM Studio, Ollama, or llama.cpp, and offers tips for effective use.
Local models provide privacy, lower cost, control, and always-availability.
They are less capable and slower than frontier models, but suitable for many tasks.
This systematic review of 139 studies proposes a unified framework and meta-analysis. Results show multimodal fusion improves accuracy by 5.28% on average, multiview fusion boosts accuracy by 4.67% and F1 by 3.08%, but only a minority of studies used statistical tests, raising reproducibility concerns.
Meta-analysis reveals that multimodal and multiview fusion significantly improve document classification accuracy.
Multimodal fusion yields a +5.28% accuracy gain; multiview yields +4.67% accuracy and +3.08% F1 score.
The software industry is undergoing an unprecedented metamorphosis. From simple statistical completion of early coding assistants, through conversational chatbots and the failure of multi-agent systems, we have arrived at the era of the Agentic Loop. This comprehensive guide analyzes the entire evolution, from the Completion paradigm to the revolutionary Ralph Loop that is redefining how we write code.
AI-assisted coding evolved from statistical code completion (2021-2022) to the Agentic Loop paradigm.
Tools like Codex and GitHub Copilot were based on statistical models, lacking task understanding and long-term reasoning.
Curlo is a privacy-first macOS app for searching, previewing, and organizing large sound libraries. It lets you find SFX or music by describing what you want to hear, with fully local offline search, metadata management, AI auto-tagging, and integration with DAWs.
Local offline semantic search for audio files
Search by natural language description, filename, tags, and more
Researchers from UMD, Google, Meta, and other institutions use AutoTTS to let a coding agent independently discover control algorithms for AI reasoning. The algorithm it found cuts compute by about 70 percent compared to standard self-consistency while matching its accuracy. The whole search cost $40 and took 160 minutes.
AutoTTS uses an offline simulation environment to let a coding agent autonomously explore test-time scaling algorithms without human-written rules.
The discovered algorithm achieves higher accuracy per compute on math benchmarks than established methods like self-consistency.
Meta cuts 350 jobs in Ireland amid global layoffs, sparking concerns over AI-driven displacement. Amazon, Oracle, Block also cite AI for job cuts. Experts warn of 'AI washing' as some layoffs may be unrelated. Official data shows 10.7% drop in Irish tech employment. ESRI report warns 200,000 roles could be displaced by AI.
Meta lays off 350 of 1,800 Irish employees; global 8,000 job cuts linked to AI investments.
Amazon, Oracle, Block announce AI-related job cuts; Ireland sees hundreds of roles affected.
Meta launched an internal AI leaderboard called 'Claudeonomics' to track employee token usage, but shut it down after data leaked. The trend of tracking AI usage is growing, with Nvidia's Jensen Huang proposing AI tokens as part of compensation.
Meta's internal AI leaderboard 'Claudeonomics' ranked employees based on token consumption and used gamification badges.
The leaderboard was shut down after internal usage data was shared publicly.
New UC San Diego research shows that with persona prompts, advanced LLMs like GPT-4.5 are judged human more often than real humans in a three-party Turing test. LLaMa-3.1 also performed at human levels. The study raises important questions about online trust and the nature of humanlikeness.
GPT-4.5 was judged human 73% of the time; LLaMa-3.1-405B 56%.
Without persona prompting, performance dropped sharply.
The proliferation of LLM adversarial testing tools has made it difficult for operators to navigate the landscape. Autonomous agents that select attacks, compose transforms, and execute tests are emerging as a solution. Dreadnode's agent executed 674 attacks against Meta's Llama Scout in about three hours with an 85% success rate. However, limitations include constrained coverage, alignment issues with the orchestrating agent, and a lack of formal comparison with human experts.
AI agents can autonomously red team LLMs, translating natural language goals into executed attacks and compliance mappings.
Dreadnode's agent achieved 85% attack success rate on Llama Scout, but results may not generalize to frontier models.
Tribal announced a $10 million seed round led by Team8 to expand its metadata-native AI agent platform, which maps enterprise systems' metadata layers to deliver production-ready solutions with governance. The platform integrates with Salesforce, SAP, and other major systems, aiming to turn AI experiments into repeatable business impact.
Tribal raised $10M seed funding led by Team8, with DYDX Capital and angel investors. The founders have deep enterprise background from Salesforce, Wix, and Spot.io.
The platform uses a Metadata Fabric to ingest and map enterprise system metadata in minutes, providing organizational context for AI agents.
We're about to find out if systems like SynthID and C2PA can effectively combat deepfakes. Google expands SynthID verification to Chrome and Search, also checking C2PA metadata. OpenAI adds SynthID to its images. Meta will use C2PA on Instagram. But challenges remain: metadata easily stripped, open-source models not adopting, and Google's dual role as AI creator and solution provider.
Google integrates SynthID verification into Chrome and Search, also checking C2PA metadata.
OpenAI adds SynthID watermarking to ChatGPT and API-generated images.
The author's latest demo of Google's Android XR reference glasses and Project Aura shows Gemini-powered capabilities like adding events to calendar, editing photos, and cross-app integration, suggesting ambient AI is the future of wearables.
Google launches three smart glasses: audio-only, Project Aura, and a reference model, all powered by Gemini.
Demo includes voice-adding FIFA matches to calendar, turning people into minions, and extracting ingredients to Keep.
Researchers introduce EgoTraj, an egocentric multimodal dataset recorded with Meta Quest Pro, featuring 75 navigation sequences in real-world urban environments with synchronized RGB video, head poses, eye gaze, and scene annotations. The dataset aims to advance trajectory prediction for humanoid robotics, wearable sensing, and assistive navigation, and benchmarks multiple state-of-the-art methods.
EgoTraj is the first egocentric multimodal human trajectory dataset captured in real-world urban settings using Meta Quest Pro.
It contains 75 sequences with synchronized RGB video, 6-DoF head poses, 3D eye gaze vectors, and scene annotations.
As deep research agents increasingly automate complex information-seeking tasks, reliable evaluation becomes critical. LLM-as-judge is used to assess these agents, but its reliability is poorly understood. The REFLECT benchmark introduces a fine-grained meta-evaluation, revealing that current LLM judges are unreliable, with best models achieving accuracies below 55% across reasoning, tool-use, and report-quality failures, especially poor on evidence verification. The study offers actionable guidance for building more reliable evaluation pipelines.
LLM judges show systematic limitations in evaluating deep research agents, with overall accuracy below 55%
REFLECT benchmark generates fine-grained failure instances via controlled interventions on agent traces
Meta is forcing over 7,000 workers to move to new teams, including teams focused on AI cloud infrastructure and an internal AI agent codenamed Hatch, as it pivots to AI. Last month, 1,000 engineers were reassigned to Applied AI, with employees told transfers are mandatory.
Meta is mandating more than 7,000 workers to transfer to AI-focused teams.
New teams include AI cloud infrastructure and an internal AI agent project called Hatch.
OpenAI now uses C2PA metadata and SynthID watermarks to tag AI-generated images, making them harder to strip. A public verification tool is also rolling out.
OpenAI adopts C2PA metadata and SynthID watermarks for all AI images.
SynthID embeds imperceptible pixel-level signals that survive editing and screenshots.
A quiet day before Google I/O lets us amplify a notable blogpost by Vlad Feinberg on job preparation for pretraining roles at frontier AI labs, alongside major model releases (Cursor Composer 2.5, Qwen3.7), local inference speedups via MTP in llama.cpp, and research on MoEs, RL, and agent evaluation.
Vlad Feinberg shares job tips for frontier labs, emphasizing kernel-level optimization and agent work.
Cursor releases Composer 2.5 and reveals training a much larger model with 10× compute.
Meta begins another round of layoffs cutting 8,000 jobs while ramping up AI investments. Employee morale is low, with internal resistance to AI data tracking tools.
Meta starts layoffs this week, cutting 8,000 jobs and scrapping 6,000 open roles.
CEO Mark Zuckerberg's tone shifts from apology to efficiency, with no apology this time.
Meta committed $145B to AI infrastructure and began laying off 8,000 people in the same week. Standard Chartered cut over 7,000 jobs, explicitly replacing 'lower-value human capital' with AI investment. Pope Leo XIV announced his first AI encyclical co-authored with Anthropic's Christopher Olah, to be released May 25.
Meta committed $145B in AI capex while laying off 8,000 employees.
Standard Chartered described its cuts as replacing 'lower-value human capital.'
This study introduces intelligent frameworks that use Large Language Models (LLMs) to improve task scheduling for construction robots. The system utilizes a Natural Language Processing interface for communication and adapts in real-time to unexpected site conditions. It concurrently uses two LLM agents: a generator (GPT-4) and a supervisor (Gemma 3/Llama 4/Mistral 7b) to provide precise task schedules. Evaluation results highlight the crucial role of LLMs in construction robotic tasks.
Proposes a hybrid LLM-based framework for construction robot task scheduling
This paper proposes Deep Pre-Alignment (DPA), a novel architecture that replaces the standard ViT encoder with a small VLM as perceiver, ensuring visual features are deeply aligned with the text space of the target large language model. On the 4B parameter scale, DPA outperforms baselines by 1.9 points across 8 multimodal benchmarks, with gains widening to 3.0 points at the 32B scale. DPA also achieves a 32.9% reduction in language capability forgetting and demonstrates consistent gains across different LLM families like Qwen3 and LLaMA 3.2.
DPA replaces ViT encoder with a small VLM perceiver for deep visual-text alignment.
Outperforms baselines by 1.9 points at 4B scale and 3.0 points at 32B scale on multimodal benchmarks.
The paper reveals that quantizing LLMs to lower precision (e.g., 3-bit) causes 6-21% of previously unbiased items to exhibit new stereotypical behaviors, while models' willingness to answer 'unknown' drops by 17.4%. Standard metrics like perplexity barely change, indicating aggregate metrics miss fairness-critical degradation.
Quantization can reintroduce biases even in aligned models.
3-bit quantization caused 6-21% of unbiased items to become stereotypical.
Instruction-tuned language models appear fair in high-stakes decisions but harbor biased internal representations. Using mortgage underwriting as a test case, this study shows that while outputs show no bias, internal layers encode and amplify demographic associations. Activation steering and cross-layer interventions reveal that reintroducing suppressed information at critical layers can flip decisions almost entirely. The latent bias is asymmetric, direction-dependent, and exploitable via adversarial prompts or fine-tuning. The authors argue that output-only audits are insufficient and propose a dual-layer testing framework combining output evaluation with representational analysis for AI governance.
LLMs exhibit no output-level bias but retain and amplify demographic representations internally
Activation steering shows that latent bias, when reinjected, can reverse decisions
Nous Research has published Lighthouse Attention, a training-only selection-based hierarchical attention mechanism that wraps around standard scaled dot-product attention during pretraining and is removed afterward. Unlike prior methods that pool only keys and values, Lighthouse pools Q, K, and V symmetrically across a multi-resolution pyramid, reducing the attention call from O(N·S·d) to O(S²·d) and running stock FlashAttention on a small dense sub-sequence. Tested on a 530M Llama-3-style model at 98K context, it achieves a 1.40–1.69× end-to-end wall-clock speedup against a cuDNN SDPA baseline with matching or lower final training loss.
Lighthouse Attention is a training-only hierarchical attention that symmetrically pools queries, keys, and values across a multi-resolution pyramid, reducing compute from O(N·S·d) to O(S²·d).
On a 530M Llama-3 model at 98K context, it achieves 1.4–1.7× speedup over cuDNN SDPA baseline with matching or lower final loss.
Annota is a local-first, end-to-end encrypted note-taking app with a powerful editor, AI assistant, and absolute privacy. It supports local Ollama or BYOK AI. All features are free for local use, with subscriptions for cloud sync and multi-device support.
Local-first design with full offline capability and E2E encryption
AI assistant supports local Ollama or Bring Your Own Key (BYOK)
LocalVibe is a pure-Rust local AI coding assistant optimized for Apple Silicon. It provides chat with quantized LLMs via Metal, on-device ONNX embeddings, LanceDB vector search, and a TUI interface. It includes MCP server for Claude Code, an OpenAI-compatible HTTP server, and various tool integrations.
Pure-Rust binary with Candle+Metal inference, fastembed-rs embeddings, and LanceDB vector store.
TUI with five sections: Chat, Models, Databases, Index, Settings.
An open-source benchmark suite that runs a full GPU/CPU benchmark with one command, covering Ollama LLM inference and XGBoost training, and generates an interactive HTML report.
Supports Ollama LLM (3B-14B parameter models) and XGBoost training/inference benchmarks
Single command execution with automatic HTML report and Streamlit dashboard
ArXiv, a popular platform for preprint academic research, is taking a new step to attempt to reduce the volume of papers that include AI slop. If a paper has "incontrovertible evidence that the authors did not check the results of LLM generation," such as hallucinated references or "meta-comments" left by an LLM, authors will be banned from ArXiv for a year, according to Thomas Dietterich, ArXiv's section chair of its computer science section. Future ArXiv submissions will also have to be accepted at "a reputable peer-reviewed venue."
ArXiv will impose a one-year ban on authors whose papers show clear signs of unchecked LLM output, such as hallucinated references or meta-comments.
After the ban, future submissions must first be accepted by a reputable peer-reviewed venue.
A new AI framework can detect microscopic internal defects in metal 3D printed components that are invisible to the naked eye but compromise structural integrity, potentially overcoming a key barrier to widespread adoption of metal additive manufacturing.
AI framework identifies hidden internal defects in metal 3D printed parts
Defects are microscopic but significantly affect part strength
Meta is installing tracking software on US-based employees' computers to capture mouse movements, clicks, and keystrokes for AI training. The tool, MCI, also takes occasional screenshots. Meanwhile, Meta plans to lay off 10% of its workforce globally starting May 20.
Meta deploys MCI tracking software on employee computers for AI training.
MCI captures mouse movements, clicks, keystrokes, and screenshots.
Tech companies are increasingly targeting middle managers in AI-driven restructurings, with Coinbase, Amazon, Block, and Meta cutting layers. Workers fear loss of mentorship and career advancement.
AI-driven restructurings are systematically eliminating middle managers in tech
Coinbase, Amazon, Block, and Meta have laid off tens of thousands, focusing on management layers
AI ethics is the practice of ensuring that AI systems are developed and used in ways that are fair, transparent, accountable, and safe. This article explores the core questions, challenges, regulatory landscape, and practical steps for responsible AI.
AI ethics addresses how to build AI systems that do more good than harm.
Key challenges include fairness conflicts, opacity, and the balance between privacy and performance.