πŸ€– Tech & AI Β· 07

Tech & AI

Every item from Trenri daily that fell into Tech & AI, latest first.

June 2026 Β· 110

  • OpenAI and Broadcom Unveil 'JalapeΓ±o' LLM Inference Chip

    OpenAI and Broadcom unveiled the 'JalapeΓ±o' chip dedicated to LLM inference on June 25. OpenAI said it went from design to tape-out in nine months and cuts costs by roughly 50% versus existing Nvidia GPUs.

  • OpenAI Starts Limited Preview of GPT-5.6 Series for Approved Partners Only

    OpenAI began a limited preview of GPT-5.6 Sol, Terra, and Luna on June 26. According to Axios, it was made available early to approximately 20 approved partners at the request of the US government.

  • Apple Unveils Third-Gen Foundation Model AFM 3 with Google and Nvidia

    At WWDC 2026, Apple unveiled a family of five AFM 3 third-generation foundation models developed with Google and Nvidia, consisting of on-device models and three server-based models.

  • OpenAI Releases GPT-5.6 Sol, Terra, and Luna in Limited Preview

    OpenAI announced the GPT-5.6 series on June 26, comprising Sol (flagship), Terra (low-cost high-performance), and Luna (lowest-cost high-speed). Sol achieved top performance in coding, cybersecurity, and biology. The preview is currently limited to a small number of trusted partners.

  • Intel Begins Risk Production of 18A-P Process β€” 9% Performance Gain

    Intel officially announced the start of risk production for its 18A-P process at the VLSI Symposium on June 16. Compared to 18A, it achieves a 9% performance improvement at the same power, and an 18% power reduction at the same performance, according to CNBC.

  • Apple WWDC 2026: Siri Gets Gemini Integration Β· Claude Support Added

    Apple announced at WWDC 2026 a new Siri powered by Google Gemini as its default AI, along with a multi-AI extension system. Starting with iOS 27, users can choose between ChatGPT, Gemini, and Claude as their AI model, according to TechCrunch.

  • Microsoft Unveils Its Own AI Reasoning Model MAI-Thinking-1

    Microsoft announced its in-house reasoning model MAI-Thinking-1 at Build 2026. It is the company's first reasoning model trained from scratch without distillation from third-party models. A code generation specialist model, MAI-Code-1-Flash, was also unveiled alongside it.

  • Google Gemini 2.5 Pro Achieves 82.4% on GPQA Diamond

    Google released Gemini 2.5 Pro with a Deep Think reasoning mode on June 22. It scored 82.4% on the GPQA Diamond benchmark and became the first publicly available model to achieve 89.8% on MMLU-Pro.

  • OpenAI and Broadcom Unveil AI Inference Chip 'JalapeΓ±o'

    OpenAI and Broadcom announced 'JalapeΓ±o,' a dedicated LLM inference ASIC, on June 24. Design and tape-out were completed in nine months, with a target of roughly 50% cost reduction over current GPUs.

  • SK Hynix Files for $29B Nasdaq ADR Under Ticker SKHY

    SK Hynix has filed for a Nasdaq ADR listing under the ticker SKHY, targeting approximately $29 billion in proceeds with trading set to begin July 10. The company holds a 60% share of the HBM market.

  • Qualcomm in Talks to Acquire AI Chip Startup Tenstorrent for $8–10B

    Qualcomm is in negotiations to acquire Tenstorrent, a RISC-V-based AI chip design firm founded by Jim Keller, for $8–10 billion, according to The Register.

  • SpaceX Acquires AI Coding Tool Cursor for $60 Billion

    SpaceX acquired Cursor just four days after its IPO in what is the largest acquisition ever. With Cursor β€” annual revenue of $4 billion, used by 64% of the Fortune 500 β€” now under Musk, concerns about model neutrality are spreading through the developer community.

  • ChatGPT Market Share Falls to 46.4% β€” Loses Majority for First Time

    According to a Sensor Tower report, ChatGPT's share of the AI chatbot market dropped to 46.4% for the first time ever. With Gemini and Claude together holding 38%, the AI market is rapidly reshaping.

  • Naver and NVIDIA Sign Agreement to Build Gigawatt-Scale AI Factory in Korea

    Naver and NVIDIA signed an agreement to expand sovereign AI infrastructure based on the NVIDIA DSX platform. Starting with a 55MW facility at GAK Sejong Data Center in 2027, the plan calls for expansion to 200MW by 2028.

  • NVIDIA's Vera Rubin AI Chip Platform Enters Full Production

    NVIDIA has begun full-scale manufacturing of the Vera Rubin AI superchip platform, featuring 72 GPUs and HBM4 memory. The chips are slated for AWS and Google Cloud delivery in H2 2026.

  • EU AI Act Transparency Obligations Take Effect August 2

    Article 50 of the EU AI Act kicks in on August 2, requiring companies to disclose when users interact with AI-generated content or chatbots. Non-compliant firms face substantial penalties.

  • Colorado AI Act Enforcement Delayed Again to January 2027

    Colorado's governor postponed the state's AI law enforcement from June 30 to January 1, 2027 after court injunctions and legislative amendments substantially narrowed corporate obligations.

  • Anthropic files confidential S-1, $965B valuation target October

    Confidential Form S-1 to SEC locks October Nasdaq target. $47B annual revenue run-rate. OpenAI-leading valuation. Up to $60B raise planned.

  • Nvidia, Microsoft unveil 1-petaflop RTX Spark super-chip

    Computex 2026: Nvidia and Microsoft jointly announced RTX Spark. Blackwell RTX GPU + 20-core Grace CPU enable 120B-parameter LLMs to run locally. Fall OEM rollout.

  • Google drops Gemini 3.5 Pro, 2M-token context window

    DeepMind shipped Gemini 3.5 Pro and ultra-fast Flash. 2M-token context and Deep Think reasoning mode outgun competitors on long analysis and code.

  • US bans foreign access to latest AI models; export controls bite

    June 12: National security declared reason for restricting non-US access to cutting-edge AI. AI models now classified as strategic asset. New regulatory frame begins.

  • Qualcomm eyes Tenstorrent for up to $10B; challenge Nvidia

    Early negotiations underway: $8B–$10B for RISC-V AI chip designer. Jim Keller joining signals Qualcomm's serious bid to break Nvidia-AMD lock on AI hardware.

  • Trump Signs Executive Order on AI Innovation and Security, Excluding Mandatory Pre-approval

    The White House issued the 'Executive Order on Promoting Advanced AI Innovation and Security' on June 2. It establishes voluntary cybersecurity clearing houses and frameworks for frontier model pre-evaluation while explicitly excluding mandatory government licensing, ensuring industry autonomy.

  • Samsung Electronics Begins HBM4 Mass Production, Targets Triple HBM Revenue This Year

    Samsung Electronics begins shipping HBM4 at maximum speed of 11.8Gbps, planning to triple HBM revenue this year. Amid surging AI chip demand, Korean and Taiwanese companies are reported to have near-monopoly status in cutting-edge memory markets.

  • EU Frontier AI Grand Challenge Winner: EUROPA Consortium Selected

    The European Commission selected EUROPA Consortium as winner of the Frontier AI Grand Challenge on June 19. The winning team receives up to 2.5% of NVIDIA Blackwell 6,000-chip cluster and EuroHPC supercomputing resources, reflecting EU's commitment to independent AI capabilities.

  • SK Hynix ships HBM4E 12-layer samples to Nvidia ahead of schedule

    SK Hynix delivered next-generation AI memory HBM4E 12-layer samples to Nvidia (its likely key customer) ahead of schedule. The chips feature 17% lower thermal resistance and 20% better energy efficiency, positioning them for the Nvidia Rubin Ultra platform and accelerating the AI memory cycle.

  • Anthropic files confidential S-1 at $965B valuation β€” October Nasdaq listing targeted

    Anthropic filed a confidential S-1 with the SEC on June 1, formally commencing its Nasdaq IPO process. Goldman Sachs, Morgan Stanley, and JPMorgan are co-lead underwriters. With annual revenue run-rate exceeding $47 billion, the company is targeting an October listing and competing with OpenAI for the title of biggest-ever AI IPO.

  • OpenAI GPT-5.6 launch imminent β€” chief scientist calls it 'a meaningful leap'

    GPT-5.6 is widely expected to launch between June 22-28, with prediction markets assigning an 83% probability of this week. OpenAI's chief scientist described it as 'a meaningful leap,' with a 1.5M-token context window and significantly enhanced agentic capabilities among the expected upgrades.

  • Anthropic pursues October Nasdaq IPO at $965B valuation

    After confidentially filing an S-1 with the SEC, Anthropic is pursuing a Nasdaq listing in October led by Goldman Sachs and JP Morgan. Secondary market valuation is $1.05-1.15T, making it the largest AI IPO ever.

  • SK Hynix ships HBM4E 12-layer samples ahead of schedule β€” 20% better power efficiency

    SK Hynix shipped its HBM4E 12-layer 48GB samples ahead of schedule, featuring 16Gbps/pin and 4.0TB/s bandwidth with 20% better power efficiency than the prior generation. They are competing for inclusion in Nvidia's 2027 Rubin Ultra platform.

  • Apple WWDC 2026 unveils Siri-Gemini integration and full AI overhaul

    Apple announced a revamped Siri powered by Google Gemini integration, supporting visual context awareness and complex cross-app task automation. iOS 27 and macOS Golden Gate were also unveiled alongside the updates.

  • GPT-5.4 exceeds human level 72.4% for first time in OSWorld benchmark...dawn of AI automation age

    OpenAI's GPT-5.4 achieved 75.0% on the OSWorld OS manipulation automation benchmark, surpassing the human expert baseline of 72.4% for the first time. Claude Opus 4.6 is also closely pursuing at 72.5%, with the age of universal desktop automation by AI agents now officially open.

  • Anthropic files confidential S-1 with valuation $965B...targets October Nasdaq listing

    Anthropic filed a confidential S-1 with the SEC on June 1 and officially confirmed October Nasdaq listing. With enterprise valuation of $965B and annualized revenue around $44B (roughly 5x its ~$9B at end-2025), the largest AI company IPO in history is anticipated. Goldman Sachs, JPMorgan, and Morgan Stanley are lead underwriters.

  • At G7 Evian, 3 major AI CEOs issue joint self-regulation declaration...new AI governance order formed

    At the G7 Evian summit, OpenAI's Altman, Anthropic's Amodei, and Google DeepMind's Hassabis jointly attended and signed a joint declaration calling for international cooperation on AI safety. Amodei emphasized that democratic nations must strengthen AI regulation cooperation.

  • GPT-5.4 surpasses human benchmark on desktop automation for first time

    OpenAI's GPT-5.4 achieved 75.0% on the OSWorld benchmark, exceeding the human expert score of 72.4% for the first time. Claude Opus 4.6 closely trails at 72.5%, marking a watershed moment where AI has officially crossed human-level performance in real-world PC task automation.

  • Anthropic files confidential IPO S-1, targets October Nasdaq listing at $965B

    Anthropic has filed a confidential S-1 with the SEC, targeting an October Nasdaq IPO at a $965 billion valuation β€” the largest AI IPO in history. Goldman Sachs, JP Morgan and Morgan Stanley are underwriters, seeking to raise more than $60 billion.

  • AI Big Three CEOs reach first joint voluntary regulation pledge at G7 Evian

    OpenAI's Altman, Anthropic's Amodei and Google DeepMind's Hassabis agreed to a joint AI voluntary regulation package at the G7 Evian summit β€” the first time all three CEOs have appeared at a G7 simultaneously. They also co-signed a letter calling for stricter synthetic DNA governance.

  • G7 Day 2: Three Rival AI CEOs Hold First-Ever Joint Session on Regulation

    OpenAI's Sam Altman, Anthropic's Dario Amodei, and Google DeepMind's Demis Hassabis gathered for the first time at the G7 Evian summit to discuss AI regulation and bio/cyber security threats. The three co-signed a letter to Congress urging tougher controls on synthetic DNA, staking out AI safety as a shared priority.

  • Anthropic IPO Roadshow Imminent, $965B Valuation Eyes Record-Breaking Listing

    After confidentially filing its S-1 with the SEC on June 1, Anthropic is on track for the largest IPO in history at a potential valuation of up to $965B. Its annualized revenue surpassed $30B by April 2026, with Goldman Sachs, JP Morgan, and Morgan Stanley named as co-lead underwriters.

  • OpenAI Launches GPT-5.4, Surpasses Human Baseline on Desktop Automation

    OpenAI released GPT-5.4 with a 1M-token context window and multi-step workflow automation. It scored 75% on the OSWorld-V desktop productivity benchmark, the first AI to exceed the human baseline of 72.4%. Google DeepMind unveiled Gemini Omni on the same day.

  • OpenAI, Google, Anthropic CEOs Attend G7 Summit Together for First Time

    Sam Altman, Dario Amodei, and Demis Hassabis will all attend the June 15–17 G7 summit in Γ‰vian. It is the first time CEOs of the three competing AI giants have simultaneously joined a G7 gathering, with AI regulation as a core agenda item.

  • Anthropic Files Confidential S-1 with SEC, Targets Record IPO

    Anthropic filed a confidential S-1 draft with the SEC on June 1, shortly after closing a $65 billion Series H at a $965 billion valuation. The public target valuation is up to $1.8 trillion with a $75 billion raise β€” potentially the largest IPO ever.

  • Claude Fable 5 Beats GPT-5.5 by 13 Points on FrontierMath's Hardest Problems

    Anthropic's Claude Fable 5 scored 88% on FrontierMath Level 4 β€” the hardest AI math benchmark β€” beating GPT-5.5's ~75% by 13 percentage points. The leap over the past year sets a new bar for mathematical reasoning.

  • ChatGPT Surpasses 1 Billion Monthly Users Just 3 Years After Launch

    OpenAI's ChatGPT recorded 1 billion monthly active users as of May 2026 β€” the fastest platform in history to hit that milestone, beating TikTok (5 years) and Instagram (8 years). Paid subscribers have also exceeded 50 million.

  • Anthropic Surpasses $30B Revenue Run Rate; Claude Code Hits $1B Annualized in 6 Months

    Anthropic achieved a $30 billion revenue run rate in Q1 2026 β€” 80x growth compared to January 2024. The coding agent 'Claude Code' surpassed $1 billion in annualized revenue just 6 months after launch.

  • Trump Orders AI Companies to Submit Frontier Models to Government 30 Days Before Release

    President Trump signed the 'AI Innovation and Security Promotion' executive order on June 2. It requires AI companies to voluntarily submit frontier models to the government 30 days before public release, and establishes a Treasury-led AI cybersecurity information-sharing hub.

  • Anthropic confidentially files IPO S-1 at $965B valuation

    Anthropic filed a confidential S-1 with the SEC on June 1, shortly after closing its $65 billion Series H round at a $965 billion valuation. An IPO at or above $1 trillion is widely projected if market conditions hold.

  • OpenAI files S-1 on June 8, targets September listing, Goldman Sachs leads

    OpenAI filed a confidential S-1 with the SEC on June 8, appointing Goldman Sachs and Morgan Stanley as lead underwriters for a September 2026 target listing. The company also launched six enterprise Codex plug-ins alongside the filing.

  • Apple WWDC 2026: Google Gemini in iOS 27 Siri; Claude and ChatGPT optional

    Apple announced at WWDC on June 8 that it will integrate Google's Gemini 1.2-trillion-parameter model into Siri under a deal worth roughly $10 billion annually. iOS 27's new 'Extension' feature lets users swap in Gemini, ChatGPT, or Claude as their AI engine.

  • 2026 tech layoffs top 183,966 across 247 events; AI cited in 55% of cases

    As of June 12, tech sector layoffs in 2026 total 183,966 across 247 events, including Oracle's 30,000 cut. Companies in 55% of events cited AI/automation as the primary driver β€” a record proportion β€” as investment surges while headcount contracts.

  • Apple WWDC 2026: a Google Gemini-powered 'Siri AI' debuts

    Tim Cook's final WWDC keynote (June 8) unveiled a rebuilt 'Siri AI' backed by a $1B-a-year Gemini deal. It ships this fall with iOS 27 and macOS 27 Golden Gate, and outside chatbots plug directly into Siri, reshaping the assistant landscape.

  • Microsoft Build: seven in-house AI models, led by MAI-Thinking-1

    Microsoft AI unveiled seven models at Build (June 2-3), including the MAI-Thinking-1 reasoner (35B, 256K context) plus image, coding and voice models. MAI-Code-1-Flash hit 51% on SWE Bench Pro at just 5B β€” its first full in-house stack.

  • Nvidia completes its HBM4 supply chain, certifying all three makers

    Jensen Huang certified Samsung, SK Hynix and Micron for Vera Rubin HBM4 on June 5. Vera Rubin promises 10x agent throughput over Grace Blackwell with second-half cloud deployment set, sharply expanding Korean memory orders.

  • Google's Gemini 3.5 Pro nears June launch with 2M tokens and Deep Think

    After Gemini 3.5 Flash in May, the Pro version nears GA in Vertex preview. Its 2M-token context and Deep Think reasoning are the largest in a production model, absorbing Ultra-tier work and squaring off with OpenAI and Anthropic's top models.

  • Nvidia's Vera Rubin enters full production for top clouds

    After its GTC Taipei debut (June 1), the Vera Rubin NVL72 entered full production with AWS, Google Cloud, Azure and OCI as priority second-half customers. Lower inference-token costs versus Grace Blackwell reignite the datacenter GPU cycle.

  • Apple WWDC: Gemini-based new Siri, end of Intel Mac support

    At Tim Cook's final keynote on June 8, Apple unveiled a new Siri built on Google Gemini. iOS 27 supports back to the iPhone 11, while macOS Golden Gate fully ends official support for Intel Macs.

  • Nvidia Vera Rubin enters production, Samsung/SK/Micron HBM4 certified

    On June 5 Jensen Huang confirmed Samsung, SK Hynix and Micron all passed HBM4 certification for Vera Rubin. SK Hynix takes 60-70% of volume and Samsung 25-30%, with shipments starting in Q3.

  • MS Build unveils 7 in-house AI models, coding model lands in Copilot

    At Build on June 2 Microsoft unveiled seven MAI models, including reasoning model MAI-Thinking-1 built without OpenAI data. Coding-specialist MAI-Code-1-Flash cuts token use 60% and shipped instantly to Copilot.

  • Google Gemini 3.5 Pro nears June launch with 2M tokens, Deep Think

    Gemini 3.5 Pro, previewed at Google's May I/O, is nearing its formal June launch. It features a 2-million-token context window and a Deep Think reasoning mode, currently in an enterprise-only limited preview.

  • Nvidia enters PC chips with RTX Spark; AMD, Intel, Qualcomm fall

    At Computex on June 1 Jensen Huang declared Nvidia's entry into PC chips with RTX Spark, pairing Blackwell GPUs and an Arm-based CPU. Shares of AMD, Intel and Qualcomm all fell, reflecting intensifying competition.

  • Apple WWDC: Gemini-based new Siri, Tim Cook's farewell keynote

    At WWDC 2026 on June 8, Apple unveiled a new Siri rebuilt on Google's 1.2-trillion-parameter Gemini model. Tim Cook formalized his Sept. 1 CEO departure, with hardware SVP John Ternus to succeed him.

  • iOS 27 and macOS Golden Gate unveiled, Intel Mac support ends

    Apple released iOS 27 and macOS Golden Gate betas. Gemini Siri features require iPhone 15 Pro or later, and macOS shifts to Apple Silicon only, ending official support for Intel Macs.

  • Apple unveils homeOS developer preview, HomePad due this fall

    Apple unveiled a developer preview of homeOS for a HomePad with a 7-inch display and A18 chip. It runs FaceTime without an iPhone, with the related home hardware lineup due this fall.

  • Nvidia Vera Rubin in production, HBM4 from 3 suppliers locked

    Nvidia locked Samsung, SK Hynix and Micron as HBM4 suppliers for Vera Rubin, with commercial shipments due in Q3. SK Hynix is said to handle a large share of the initial volume.

  • Google Gemini 3.5 Pro nears June launch, 2M tokens and Deep Think

    Google's Gemini 3.5 Pro, unveiled at I/O 2026, nears its June release. It offers a 2-million-token context and a Deep Think reasoning mode, with a limited Vertex AI preview underway.

  • Apple WWDC 2026: Gemini-based new Siri, Tim Cook's farewell keynote

    On June 8, Apple unveiled a Siri rebuilt on a Google Gemini 1.2-trillion-parameter model. It supports multi-step tasks, conversation memory, and a standalone Siri app, using a three-tier routing of on-device, Private Cloud, and Gemini.

  • iOS 27, macOS Golden Gate unveiled, Liquid Glass adds transparency slider

    Apple unveiled iOS 27 and macOS 'Golden Gate' betas. Liquid Glass adds a slider to adjust from ultra-transparent to fully tinted, plus 70% faster photo loading, 80% faster AirDrop, and AI photo-editing features.

  • Apple unveils homeOS developer preview, teasing a smart-home hub era

    At WWDC 2026, Apple unveiled a developer preview of homeOS, a new OS for a HomePad with a 7-inch display and A18 chip. It can run FaceTime without an iPhone and teased a home hardware lineup launching this fall.

  • Nvidia Vera Rubin enters production, Samsung/SK/Micron HBM4 confirmed

    On June 5 in Seoul, Jensen Huang confirmed HBM4 certification for Samsung, SK Hynix, and Micron. Vera Rubin, whose mass production was announced June 1 at GTC Taipei, ships in Q3 and offers 10x the throughput of Grace Blackwell.

  • Google Gemini 3.5 Pro nears June launch, 2M-token context, Deep Think

    Google's Gemini 3.5 Pro, teased at I/O 2026, is nearing its June launch. It features a 2-million-token context and Deep Think reasoning but remained unreleased in early June, with pricing expected around 15 dollars per million input tokens.

  • Apple WWDC 2026 reveals Gemini-powered Siri and iOS 27

    Apple unveiled iOS 27 and a complete Siri overhaul at the WWDC 2026 keynote on June 8. The new Siri runs on a Google Gemini-based model of approximately 1.2 trillion parameters, costing 1 billion dollars annually, featuring a chatbot-style interface.

  • Nvidia confirms Vera Rubin HBM4 supply chain; Jensen Huang visits Seoul

    Nvidia's Jensen Huang visited Seoul on June 5 and officially certified Samsung, SK Hynix, and Micron as suppliers for the next-generation Vera Rubin accelerator's HBM4 memory. SK Hynix will handle 60-70% of supply, with first shipments in Q3.

  • Microsoft Build 2026: Unveils 7 proprietary AI models, declares Windows as AI OS

    Microsoft revealed seven proprietary MAI models at Build 2026 on June 2-3, signaling reduced reliance on OpenAI. The company repositioned Windows as an AI agent execution platform.

  • Google's Gemini 3.5 Pro launching in June with 2M-token context and Deep Think

    Following May's I/O where it introduced Gemini 3.5 Flash, Google previewed Gemini 3.5 Pro for June launch. The model features a 2-million-token context window and Deep Think reasoning, currently in limited preview on Vertex AI.

  • ChatGPT doubles memory capacity, auto-saves personalization

    ChatGPT rolled out a major memory system upgrade on June 4. Auto-updates and enhanced personalization are now live, with Plus and Pro subscribers gaining double memory capacity for better long-conversation retention.

  • Nvidia unveils RTX Spark superchip, targets Intel/AMD-dominated PC market

    On June 1 Jensen Huang unveiled the RTX Spark superchip, an integrated CPU+GPU co-developed with MediaTek. It will ship this fall in Dell, HP, Lenovo and MS Surface devices; AMD, Intel and Qualcomm shares fell on the day.

  • Nvidia's Vera Rubin enters production; Samsung, SK hynix, Micron certified for HBM4

    At GTC Taipei on June 1, Huang announced full production of the next-gen Vera Rubin platform. All three of Samsung, SK hynix and Micron were certified as 6th-gen HBM4 suppliers, which Huang reaffirmed on a June 5 Korea visit.

  • Huang asks for more HBM; SK hynix seen taking 60-70% of HBM4

    On June 2 Huang publicly asked SK hynix to expand output, citing a chip supply crunch. The industry estimates SK hynix will take 60-70% of HBM4 volume, Samsung 25-30% and Micron the rest.

  • Microsoft unveils in-house coding AI 'MAI-Code-1-Flash,' cuts OpenAI reliance

    At Build on June 2, Microsoft unveiled its first in-house coding model, MAI-Code-1-Flash, which turns natural-language descriptions into app and web source code. The move to reduce OpenAI reliance intensifies coding-model competition with Google and Anthropic.

  • Apple WWDC opens June 8, teases Gemini-powered Siri overhaul

    Apple is expected to unveil a new Siri powered by Google's Gemini model alongside iOS 27 and macOS 27 at its June 8 keynote. Apple Intelligence expansions across Wallet, Safari and Shortcuts, plus a 7-inch HomePad, are also rumored.

  • Nvidia certifies Samsung, SK Hynix and Micron as Vera Rubin HBM4 suppliers

    Nvidia said in Seoul on June 5 that CEO Jensen Huang officially certified Samsung, SK Hynix and Micron as HBM4 suppliers for its Vera Rubin platform. Vera Rubin offers 10x the agentic-AI throughput of the prior Grace Blackwell generation.

  • Computex follow-up: RTX Spark superchip specs and 30-plus partners detailed

    The RTX Spark unveiled at Computex 2026 is a superchip combining a 20-core Grace ARM CPU and Blackwell GPU, offering up to 128GB of unified memory and 1 petaflop of AI performance. More than 30 laptops from ASUS, HP and Microsoft Surface ship this fall.

  • Microsoft unveils seven in-house AI models at Build 2026; MAI-Thinking-1 tops benchmarks

    Microsoft unveiled seven in-house AI models at Build 2026 on June 2. Flagship MAI-Thinking-1, a 35-billion-active-parameter MoE reasoning model, scored 97% on AIME 25 and 53% on SWE Bench Pro and was trained independently without OpenAI data.

  • WWDC D-2 iOS 27 leaks: standalone Siri app, waitlist, notification UI changes

    Ahead of Apple's June 8 WWDC 2026 keynote, last-minute iOS 27 leaks surfaced via Bloomberg on June 5. The new Siri is redesigned as a standalone chatbot app, may use a feature-access waitlist, and brings UI changes such as notifications sliding in from the left.

  • Anthropic files confidential IPO S-1 with the SEC; $965B value, $47B revenue

    Anthropic filed a confidential draft S-1 with the US SEC on June 1. After a recent $65 billion Series H, it is valued at about $965 billion, with May annualized revenue of $47 billion, roughly 4x year-on-year growth.

  • Nvidia unveils RTX Spark, opening the Windows-PC AI superchip era

    Unveiled at Computex 2026, RTX Spark pairs a Blackwell GPU's 6,144 CUDA cores with a 20-core Grace CPU over NVLink-C2C. It ships this fall in Asus, Dell, HP, Lenovo and MS Surface laptops, with a 3-gen roadmap shown.

  • Nvidia enters full Vera Rubin production; HBM4 from Samsung, SK, Micron

    Jensen Huang confirmed full production of the next-gen rack-scale Vera Rubin platform at GTC Taipei on June 1. Some 350 plants across 30 countries take part, with first AWS, Google, Azure and Oracle shipments this summer.

  • Microsoft Build 2026 unveils 7 in-house AI models, easing OpenAI reliance

    Microsoft announced seven in-house MAI-brand models at Build on June 2. The headliner is the 35B-parameter, 256K-context reasoning model MAI-Thinking-1, alongside code, image and voice models that cut its OpenAI dependence.

  • Apple WWDC opens June 8, teasing a fully redesigned LLM-based Siri

    Apple unveils a chatbot-style new Siri at its June 8 keynote (10am PT). It adds on-screen awareness, personal context and app control, moves the Siri icon to the Dynamic Island and debuts iOS 27 and the '27' OS lineup.

  • AMD reveals Ryzen AI Max PRO 400 and Halo dev platform at Computex

    AMD announced the Ryzen AI Max PRO 400 series, a Zen 5 Strix Halo successor supporting up to 192GB to run larger LLMs. The Halo dev platform with the AI Max+ PRO 495 opens for pre-order in June from $3,999.

  • Nvidia unveils RTX Spark superchip, targeting Windows PCs

    At Computex 2026 Jensen Huang unveiled RTX Spark, pairing a 20-core Arm CPU with a Blackwell GPU and 128GB unified memory. Recasting Windows as an agentic AI OS, some 30 notebooks from Dell, HP and MS ship this fall.

  • Nvidia's Vera Rubin enters full production as AI-factory engine

    Nvidia announced full production of its Vera Rubin platform, pairing an 88-core Vera CPU with Rubin GPUs. Built on TSMC 3nm with Samsung, SK Hynix and Micron HBM4, it lifts agentic throughput 10x.

  • Microsoft unveils 7 in-house AI models at Build to cut OpenAI reliance

    At its Build conference Microsoft announced seven in-house AI models, including MAI-Code-1-Flash and MAI-Thinking-1. Mustafa Suleyman said they improve cost efficiency 10x over GPT-5.5 to reduce OpenAI reliance.

  • AMD unveils Ryzen AI Max Pro 400, runs 300B models locally

    At Computex 2026 CEO Lisa Su unveiled the Ryzen AI Max Pro 400 series, the first to run a 300-billion-parameter model locally on a PC, an on-device push aimed at cloud-cost burdens and data-privacy concerns.

  • Apple WWDC opens June 8, teasing a redesigned Siri debut

    Apple unveils its OS 27 updates, including iOS 27, at the June 8 WWDC 2026 keynote. A new Apple Intelligence-based Siri, delayed twice, is the centerpiece, alongside AI features in Wallet, Safari and Shortcuts.

  • Nvidia RTX Spark, an Arm-based Windows superchip, unveiled

    At Computex, Nvidia unveiled the N1X chip with a 20-core Grace CPU, 6,144-CUDA Blackwell GPU and 128GB unified memory. It ships in fall 2026, challenging Apple's M5.

  • Intel Nova Lake and Arc G3 revealed at Computex keynote

    CEO Lip-Bu Tan previewed the up-to-52-core Nova Lake desktop CPU and a handheld-focused Arc G3 chip in his June 2 keynote, with a second-half launch flagged.

  • TSMC 2nm GAA ramp accelerates, Apple monopolizes volume

    TSMC's N2 GAA output is seen topping 100,000 wafers a month by late 2026. Apple has secured over half of total volume, intensifying supply constraints for rivals.

  • Claude Opus 4.8 tops AI index, beating GPT-5.5

    Anthropic's Opus 4.8, launched May 28, took the top spot on the Artificial Analysis index at 61.4, beating GPT-5.5. Dynamic workflows and 4x-improved honesty drew attention.

  • Apple WWDC26 opens June 8, teasing iOS 27 and AI Siri

    WWDC26, running June 8-12, is expected to bring iOS 27, a much-stronger Siri and Google Gemini integration. Whether Apple Intelligence finally ships in earnest is in focus.

  • Nvidia's next-gen AI platform Vera Rubin enters production, ships in fall

    Jensen Huang announced it in his June 1 Computex keynote. The 88-core Vera CPU and Rubin GPU use TSMC 3nm, cutting inference token cost up to 10x versus Blackwell. AWS, Google, MS and Oracle will adopt it this year.

  • Nvidia enters PCs with Arm superchip RTX Spark

    Nvidia unveiled the RTX Spark, also called N1X, with a 20-core Arm CPU, 128GB unified memory and a 6,144-CUDA-core Blackwell GPU. It launches this fall in MS, Dell, HP, Asus, Lenovo and MSI Windows laptops, challenging Apple's M5.

  • TSMC begins 2nm mass production, targeting 100k wafers/month this year

    TSMC began high-volume 2nm (N2) production at its Kaohsiung Fab 22, the first large-scale use of GAA nanosheet transistors replacing FinFET. With 2026 capacity fully booked, Apple takes over half, with Qualcomm, MediaTek, AMD and Nvidia also secured.

  • Intel to unveil Nova Lake at June 2 Computex keynote

    CEO Lip-Bu Tan keynotes June 2 at 1:30 PM, showing the 52-core Nova Lake desktop, the handheld Arc G3, and the 288-core Clearwater Forest server chip. Over 90% of Nova Lake tiles, however, are made by TSMC on N2.

  • Anthropic ships Claude Opus 4.8, topping GPT-5.5 and Gemini

    Unveiled May 28, the latest flagship ranks first on SWE-Bench Pro coding (69.2%) and OSWorld computer use, among others. Pricing held at $5 per million input tokens. It debuted as the IPO race with OpenAI heats up.

  • Nvidia, MS Computex debut N1X Arm chipΒ·new PC era

    Jensen Huang unveils N1 and N1X on June 1 at 11 AM Taipei time. The N1X packs 20 Arm cores and 6,144 CUDA Blackwell integrated graphics. Dell, Lenovo, Asus, and MSI are preparing Windows on Arm devices.

  • Computex 2026 opensΒ·Vera Rubin NVL72 AI factory focus

    Computex June 1-5 centers on Vera Rubin NVL72 supercomputer. Training is 3.5x and inference 5x faster than Blackwell, with inference cost cut to 1/7th. Huang previews unrevealed surprise products, accelerating AI infrastructure competition.

  • Anthropic Claude Opus 4.8 launchΒ·agent coding 69.2% autonomous

    Opus 4.8 released May 28 lifted agent coding scores from 64.3% to 69.2% and cut code defects 4x. Same price with 2.5x faster Fast Mode and Claude Code dynamic workflows debut.

  • Apple WWDC26 6/8 opensΒ·AI, Siri overhaul focus

    Apple's developer conference opens June 8 at 10 AM PT. Delayed Apple Intelligence and chatbot-style new Siri are core, with iOS 27 updates and M5 Mac Studio and Mac mini hardware refreshes expected.

  • TSMC 3nm H2 max 15% hikeΒ·2027 +5-10% forecast

    TSMC is raising H2 3nm wafer prices up to 15% due to AI and ASIC demand, at roughly $20K per wafer. Fab 18 monthly capacity jumps from 130K to 160-175K units. H2 gross margin guided 68-70% with additional 2027 increases forecast.

May 2026 Β· 136

April 2026 Β· 150

  • Anthropic Targets $900B Valuation, Eclipsing OpenAI's $850B

    Negotiating funding at $900B. ARR reached $30B; Google's $40B commitment (cash and compute) fuels growth trajectory.

  • Claude Opus 4.7 Launches; Mythos Preview Restricted to Cybersecurity

    Anthropic released Claude Opus 4.7 as flagship general-purpose model. Cybersecurity-specialized Claude Mythos Preview available exclusively to select enterprise customers.

  • Anthropic-Google-Broadcom Partner on 5GW Compute Capacity

    Joint partnership secures 5GW computing capacity, ramping gradually from next year. Infrastructure advantage reflected in valuation premium.

  • Pentagon Adopts Google Gemini for Classified Networks

    Pentagon and Google agreed to deploy Gemini AI system on classified networks. Google follows OpenAI in winning classified government contracts; Anthropic pursues litigation over safeguard mandates.

  • Google Launches New AI Agent Series to Challenge OpenAI and Anthropic

    Google released April 22 new AI agent lineup competing head-to-head with OpenAI and Anthropic. Targets search, productivity, and enterprise workflow domains simultaneously.

  • Big Four Tech Capex Approaches $1 Trillion Combined

    Meta projects capex of $125-145 billion, Alphabet $175-185 billion, and Amazon approximately $200 billion. AI infrastructure investment cycles are accelerating.

  • OpenAI Reportedly Missed Internal User and Revenue Targets

    A WSJ report suggesting OpenAI fell short of internal goals has triggered investor concerns about the ROI on large data center expenditures, triggering broader AI stock declines on April 28.

  • Musk Testifies to $38 Million Initial Investment in OpenAI Litigation

    In ongoing litigation, Musk testified that Altman and Brockman transformed OpenAI from a nonprofit to Microsoft's de facto subsidiary, leaving him feeling 'foolish' for his approximately $38 million seed contribution.

  • 'World Models' Gain Academic Attention as Next-Generation AI

    Nature's April 27-30 feature identifies world models trained on simulated environments as a cornerstone of next-generation AI development.

  • MIT Unveils Novel Techniques for Efficient AI in Resource-Constrained Settings

    MIT has published new methods that simultaneously improve accuracy and efficiency for AI models operating under resource constraints. IBM also announced a new quantum-AI research collaboration.

  • OpenAI Weekly Users and Revenue Miss Internal Targets β€” WSJ 4/28

    S&P down 0.49%; semiconductor joint decline stems directly from this report. Questions raised on massive capex sustainability for AI data centers.

  • Anthropic ARR $30B β€” Announced 4/7 Overtake of OpenAI ($25B)

    Claude Opus 4.7 launches 4/16: first in agentic coding, multidisciplinary reasoning, and tool use benchmarks. Ten financial AI agents launch, competing with Goldman and MS.

  • TSMC Q1 Revenue $35.9B (+41%), EPS $3.49 (+65%)

    Announcement 4/16. Nvidia reserves bulk of CoWoS packaging capacity; some outsourced. A13/N2U 2029 production roadmap released.

  • MS Azure +40%, Copilot Seats 20 Million β€” Q3 Earnings

    MSFT adjusted EPS $4.27, beats consensus by $0.21. Copilot seats grew by 5 million quarterly to 20 million cumulative; Azure Q4 guidance 39–40%.

  • Google Launches AI Agents β€” OpenAI and Anthropic Challenge

    Bloomberg 4/22: Google unveils new AI agent product targeting enterprise markets. OpenAI and Anthropic pursuing consulting joint venture.

  • Google Commits Up to $40 Billion and 5GW to Anthropic

    Alphabet will invest up to $40 billion in Anthropic across phases, with exclusive 5-gigawatt compute allocation. The initial tranche of $10 billion values Anthropic at $350 billion.

  • Anthropic ARR Surges to $30 Billion

    Anthropic's annualized revenue (ARR) accelerated from $10 billion at end-2024 to $90 billion at end-2025, and reached $30 billion in early April 2026.

  • Anthropic-Google Chip and Cloud Agreement Worth $200 Billion

    Anthropic reportedly agreed to pay Google $200 billion for access to TPU chips and cloud infrastructure.

  • Trump Administration Expands Direct AI Model Testing

    The White House announced expanded oversight procedures for directly testing Google, Microsoft, and xAI models.

  • Claude Mythos Limited Private Release

    Anthropic is operating Claude Mythos in private release with major tech and financial infrastructure companies ahead of general availability.

  • Google-Anthropic $40 Billion Deal; 5GW Compute

    Alphabet committed $10 billion immediately (valuing Anthropic at $350 billion) plus an additional $30 billion contingent on performance, along with 5GW of compute resources.

  • Anthropic Mythos; Cybersecurity Focus

    Anthropic unveiled its Mythos model in April as the most powerful to date, released in limited form to select partners. Cybersecurity applications are emphasized.

  • OpenAI Surpasses $25 Billion ARR; Exploring H2 2026 IPO

    OpenAI has exceeded $25 billion in annualized revenue and has entered preliminary stages for a potential H2 2026 IPO. GPT-5.4, revealed March 5, set new records on computer use benchmarks.

  • Google Launches AI Agent Builder Toolkit

    Google unveiled an AI agent building tool on the 22nd aimed at enterprise workflow automation, directly challenging OpenAI and Anthropic.

  • Anthropic Explores Funding at $90 Billion Valuation

    Anthropic is considering additional funding at a $90 billion valuation, which would establish it as the highest-valued independent AI company, surpassing OpenAI.

  • Anthropic ARR Reaches $30B (as of Early April)

    Surge from $1B end-2024 to $9B end-2025 to $30B April 2026.

  • Google Cloud Revenue +63% β€” Backlog Reaches $460B

    Q1 revenue $20 billion; backlog doubled in a single quarter. Company acknowledges near-term compute constraints.

  • Alphabet Q1 Revenue $109.9B (+22%) β€” Net Income $62.58B

    Eleven consecutive quarters of double-digit growth; EPS $5.11, up 81%.

  • OpenAI ARR Surpasses $25B β€” Potential IPO by Year-End

    AI capital duopoly with Anthropic; reports of 2026 second-half public offering review.

  • Anthropic Releases 'Mythos' Model β€” Limited Partner Release

    Emphasis on cybersecurity applications; positioned as company's most capable model.

  • Google Commits $10B Immediate, Up to $30B Additional in Anthropic

    $350B valuation; computing resources bundled. Strategic pivot to diversify AI investments.

  • OpenAI Shifts ChatGPT Ads to Cost-Per-Click Model

    Migration from impression-based CPM to click-based CPC aligns with marketing industry standards.

  • OpenAI Launches Workspace Agents

    AI agents execute automated workflows across Slack, Gmail, and other enterprise tools; targets business and education verticals.

  • Meta Secures 1GW Space-Based Solar Reservation

    Partnership with Noon Energy includes 100GWh ultra-long-duration energy storage to hedge data center power demand.

  • Big Tech AI Infrastructure Spending Forecast: $700B Annually

    Fortune reports 2026 big tech capex for data centers and chips will reach $700Bβ€”no clear end in sight.

  • Google Announces Virtual Bot Inbox and Workspace Agent Tools

    At Google Cloud Next 2026, Google unveiled enterprise AI agent builders, a dedicated inbox for virtual bots to post progress updates, and Workspace-integrated agents.

  • Anthropic Claude Opus 4.7 Leads Coding Benchmark at 87.6% on SWE-bench

    Claude Opus 4.7, released April 16, achieved 87.6% on SWE-bench Verified, ranking first in coding benchmarks while maintaining pricing identical to version 4.6.

  • TSMC Advanced Process Capacity Fully Booked Through 2028

    Nvidia, Apple, AMD, and Qualcomm have reserved all TSMC 2nm capacity through 2028, indicating advanced process nodes are at full utilization.

  • Samsung Achieves 50% Yield on 2nm Process Ahead of Mass Production

    Samsung's 2nm node achieved 50% yield ahead of second-half mass production, positioning Texas facilities to attract TSMC overflow customers.

  • Nvidia Revenue Reaches $215.9 Billion, Global #2 Semiconductor Player

    Global semiconductor revenue rankings for 2026: Samsung $235.7 billion (1st), Nvidia $215.9 billion (2nd), TSMC $121.9 billion (3rd), SK Hynix $68.7 billion.

  • Merck-Google Cloud $1B Agentic AI R&D Partnership

    Gemini Enterprise deployment spans R&D, manufacturing, commercial, and enterprise functions.

  • Thinking Machines Lab Secures Multi-Billion GB300 Contract from Google

    Mira Murati's startup locks infrastructure; AI compute competition intensifies.

  • Cognizant Launches 'Agentic Retail CX'

    AI agents autonomously handle abandoned carts, returns, and personalized promotions.

  • Toyota Woven City Unveils AI Vision Engine

    Vision Language Model deployment strategy 'Kakezan' announced for broad industrial application.

  • AMD Ryzen 9000 Dual 3D V-Cache

    Desktop local generative AI workload optimization; same-day April 22 launch.

  • Google Cloud Next '26 draws 32k attendees; 260 announcements

    Unveiled Gemini Enterprise Agent Platform, 8th-generation chip, and Gemma 4 open model. Google declared the 'Agentic Era'.

  • Gemma 4: 'most powerful open model per byte'

    Launched alongside Deep Research Max for data analysis and Colab Learn Mode coding tutor.

  • OpenAI annualized revenue surpasses $25 billion; IPO under review

    GPT-5.5 Instant system card updated with improved intelligence, clarity, and personalization. IPO possible in late 2026.

  • Alibaba Cloud/Qwen announces new model release April 21

    New release on April 21 accelerates open-source LLM competition.

  • Bloomberg Tech: Google's new AI chip challenges NVIDIA

    Following Google's 8th-generation TPU announcement, Bloomberg reported on competitive pressures in the training and inference acceleration chip market.

  • Anthropic Unveils Opus 4.7 With Enhanced Coding, Agent, and Vision Capabilities

    Public launch on April 16. Mythos, the security-specialized variant, available in limited preview to 11 partners.

  • Google Cloud Next 2026: Gemini Enterprise, 8th-Gen TPU, Gemma 4

    The event centered on agentic AI announcements, including open-model Gemma 4, Deep Research Max, and Colab Learn Mode.

  • Nvidia Saturates TSMC CoWoS Advanced Packaging; ASE and Amkor Pick Up Overflow

    Demand surge forces TSMC to outsource certain stages to competing assemblers. CoWoS-L receives its first Blackwell GPU deployment.

  • OpenAI Annualized Revenue Exceeds $25B; IPO Process Underway

    Reports indicate OpenAI has surpassed $25B in annualized recurring revenue, entering early-stage public market preparation.

  • JPMorgan Elevates AI to Core Infrastructure; $19.8B Tech Budget

    JPMorgan reclassified AI from R&D to critical infrastructure, allocating ~$19.8B of its 2026 technology budget and 2,000 AI personnel.

  • Anthropic Launches Claude Design (April 17)β€”First Visual Anthropic Labs Product

    Anthropic revealed Claude Design on April 17 as a dedicated visual-work offering from Anthropic Labs. A Claude Mythos Preview that auto-discovered thousands of zero-day vulnerabilities across OS and browser environments led the team to defer general release.

  • OpenAI Announces Workspace Agents and Releases GPT-5.5 (April 22-23)

    OpenAI revealed Workspace Agentsβ€”a custom-GPT successor integrating Slack, Salesforce and beyondβ€”on April 22 as a research preview, then deployed GPT-5.5 and GPT-5.5 Pro across ChatGPT and Codex tiers on April 23. Workspace Agents remain free through May 6.

  • TSMC's HPC Mix (AI/5G) Reaches 61% of Sales; Q1 Revenue $35.9B

    TSMC's Q1 revenue hit $35.9B (+40.6% year-over-year), with HPCβ€”AI and 5G combinedβ€”comprising 61%. CapEx guidance increased to $52-56B. TSM stock dipped after-hours on April 17, but the AI infrastructure cycle is viewed as accelerating.

  • ASML Raises 2026 Guidance; Korean Customers Place $8B EUV Orders Each

    ASML boosted 2026 guidance to €36-40B on April 16. Samsung and SK Hynix each placed roughly $8 billion EUV orders, with Korean revenue reaching half of quarterly sales. The AI memory cycle drove simultaneous equipment, design and foundry expansion.

  • Alibaba Open-Sources Qwen3.6-35B-A3B Sparse MoE Model

    Alibaba's Tongyi Lab released the sparse mixture-of-experts model Qwen3.6-35B-A3B on April 15, reaching 1,097 Hacker News points and leading that week's open-source LLM discourse. The model signals narrowing gaps in coding and reasoning performance versus closed-source systems.

  • Anthropic Claude Opus 4.7 (April 16)β€”'Safer Than Mythos'

    Anthropic launched Claude Opus 4.7 with enhanced coding, instruction-following, and enterprise task performance. Partner-only Mythos variant clears cybersecurity use cases while remaining unpublished.

  • OpenAI Announces GPT-5.5β€”Coding, Computer Use, Deep Research

    OpenAI unveiled GPT-5.5 on April 23, emphasizing coding prowess, computer-use ability, and deeper research capacity. Frontier model competition accelerates.

  • Google TurboQuant at ICLR 2026β€”6x KV Cache Memory Reduction

    Google Research presented TurboQuant at ICLR 2026, cutting inference-time KV cache memory burden by 6x. Critical lever for LLM operational costs.

  • Northwestern: Artificial Neurons Communicate with Real Brain Cells (April 18)

    Northwestern engineers announced breakthrough in printing artificial neurons capable of communicating with living neuronsβ€”paradigm shift for neurotech and brain-computer interfaces.

  • DeepSeek Valued at $20B+β€”Alibaba, Tencent In Talks

    Alibaba and Tencent discuss investment in DeepSeek at $20B+ valuation. New flagship model reveal expected April 24, one year after breakthrough.

  • Anthropic discusses $900 billion valuation funding round

    Discussions are underway for a capital raise at a $900 billion valuation, surpassing OpenAI. ARR reached $30 billion in early April.

  • Google commits up to $40 billion to Anthropic investment

    Starting with $10 billion (at a $350 billion valuation), Google's total commitment can reach $40 billion upon milestone achievement. Google and Amazon are harvesting half of AI boom profits through Anthropic stakes.

  • Frontier Model Forum coordinates against Chinese model cloning

    OpenAI, Anthropic, and Google share information to block adversarial distillation-based model theft by Chinese AI firms.

  • TSMC Q1 earnings validate super-cycle thesis

    Revenue reached NT$1.134 trillion with net income of NT$572.48 billion. Q2 guidance sits at $39–40.2 billion USD (up 10% QoQ). Full-year 2026 USD revenue expected to grow 30%+ as forecast.

  • Claude Opus 4.7 goes live with price held steady

    Opus 4.7 exceeds Opus 4.6 across coding, multi-domain reasoning, tool use, and agentic computer interaction. Automated cybersecurity-risk blocking is included. Available on AWS, Azure, and GCP simultaneously.

  • Anthropic Unveils Claude Opus 4.7 and Claude Design Tool

    Anthropic revealed Claude Opus 4.7 alongside Claude Design, a visual generation tool in research preview for Pro, Max, Team, and Enterprise users.

  • TSMC Q1 Profit Jumps 58% on Relentless AI Chip Demand; 2026 Guidance Raised

    TSMC's Q1 net profit surged 58% to NT$572.48 billion, marking four straight record quarters. AI demand proved resilient despite geopolitical shocks, prompting the chipmaker to lift 2026 USD revenue guidance above 30%.

  • AI Chip Startups Attract Record Funding as OpenAI and Nvidia Each Deploy $20B

    With OpenAI and Nvidia each backing AI chip startups to the tune of $20 billion, 2026 AI chip startup funding has hit an all-time high of $8.3 billion.

  • Meta Debuts Major AI Model After Recruiting Alexandr Wang for $14B Deal

    Following its $14 billion acquisition of Alexandr Wang, Meta unveiled its first major AI model, signaling renewed competition with Google and OpenAI.

  • OpenAI Closes $122B Fundraising Round to Fuel Next-Phase AI Acceleration

    OpenAI completed a $122 billion capital raise aimed at accelerating the next phase of AI infrastructure development, signaling the broadening capital supercycle.

  • OpenAI Debuts GPT-5.4-Cyber; TAC Program Expands

    OpenAI launched GPT-5.4-Cyber on April 14 for verified cybersecurity professionals, expanding the Trusted Access program to thousands of individuals and hundreds of teams. Additional capabilities including binary reverse-engineering granted.

  • Microsoft Acquires Norway Stargate Data Center from OpenAI

    Bloomberg reported April 14 that Microsoft, through neocloud subsidiary Nscale, acquired a 30,000-GPU Nvidia facility in Narvik, Norwayβ€”the second Stargate siteβ€”signaling a shift in OpenAI's strategy and Microsoft's increasing infrastructure control.

  • Nvidia and OpenAI Pledge 10GW Vera Rubin Infrastructure

    Nvidia and OpenAI signed a letter of intent for a minimum 10GW system deployment using the Vera Rubin platform. The first 1GW is scheduled for H2 2026. Anthropic, Meta, and Mistral also exploring Vera Rubin adoption.

  • Anthropic Explores Custom AI Chip Development

    Anthropic is exploring custom AI chip design to manage Claude operational costs and capacity. Separately from NVIDIA criticism at Davos, Vera Rubin utilization continues. The company is negotiating a Series D at $900 billion valuation as of late April.

  • ASML Guidance Raised Despite China Headwind

    ASML raised 2026 revenue guidance to €36-40 billion, yet U.S. export controls on China reduced China mix exposure and share fell 6%. TSMC posted Q1 net income up 58% with fourth consecutive record quarter; 2026 capex guidance raised to $52-56 billion.

  • OpenAI Launches GPT-5.4-Cyber With Restricted Rollout

    On April 14, OpenAI released its defensive cybersecurity model, GPT-5.4-Cyber, to select Trusted Access for Cyber participants. The move pivots policy toward verified access instead of blanket restrictions.

  • Anthropic Mythos and Project Glasswing Reshape Cyber Defense

    Anthropic's April 7 Mythos Preview claims autonomous discovery and exploitation of 99% unpatched zero-days. Project Glasswingβ€”a 40-member alliance including AWS, Apple, Microsoft, Google, CrowdStrike, and Palo Alto Networksβ€”controls access.

  • Google Announces Gemini Enterprise Platform as Agentic Era Cornerstone

    In April, Google unveiled the Gemini Enterprise Agent Platform, eighth-generation chips, and open-model Gemma 4, positioning them as the agentic era's bedrock. Accenture launched the Gemini Enterprise Acceleration Program in parallel.

  • InnoEX 2026: 100+ Humanoids on Display

    Hong Kong's InnoEX (April 14–15) showcased over 100 robots from four of the world's five leading humanoid makers, featuring boxing, performance, and rescue demonstrations.

  • Adobe GenStudio Gets Brand Intelligence and Supply-Chain Automation

    Adobe expanded GenStudio with Brand Intelligence and AI agents for content supply chains, automating the full planning-to-measurement workflow.

  • Kimi Code K2.6 preview launches 4/13

    Moonshot AI's dedicated coding model enters preview. Full API and benchmarks due late April/early May.

  • Stanford AI Index 2026: insider-to-public perception chasm

    GPT-5.4 hits 50% on clock reading; complex-reasoning benchmarks diverge sharply from everyday tasks.

  • Anthropic's "Claude Mythos" goes dark after 7-day ASL-4 trigger; thousands of zero-days auto-discovered

    Replaced by Claude Opus 4.7 launch decision on 4/16.

  • Data center capex surge reports mount

    AI News Today (4/13): mega-scale infrastructure expansion straining regional power grids.

  • Bluefish closes $43M Series B: AI search visibility metrics

    Threshold Ventures, NEA co-lead. Fortune 500 firms now KPI-track ChatGPT/Claude/Perplexity placement.

  • Rubin: Six Chips, One AI Supercomputer

    Vera CPU, Rubin GPU, NVLink 6, ConnectX-9, BlueField-4, Spectrum-6. Claims 10x token cost reduction, 4x MoE training efficiency. Vera Rubin full system production targeted H2 2026.

  • Rubin GPU Specs Locked

    336B transistors, 50 PFLOPS per GPU, 288 GB HBM4. Blackwell systems selling at ~$40K/GPU and clearing out through mid-year.

  • Apple, Gemini Partnership for Next-Gen Siri

    Google Gemini-powered, context-aware Siri launches H2 2026 with iOS 27. iPhone 17 captured 6% global single-model share in 1Q, rank #1 among handsets.

  • Claude Mythos Identifies Thousands of Zero-Days

    Anthropic disclosed that Claude Mythos discovered thousands of zero-day vulnerabilities in major systems, while flagging broad deployment risks.

  • Palo Alto CVE-2026-0300 Under Active Exploit

    Palo Alto Networks disclosed threat actor attempts targeting the critical vulnerability since April 9.

  • Anthropic Reaches $30B Run Rate; 1,000+ Million-Dollar Customers

    Anthropic's 2026 run rate has grown to $30 billionβ€”three times its end-2025 level. The company now counts over 1,000 enterprise customers with annual spend exceeding $1 million, doubling from 500 in February.

  • Frontier Model Forum Coordinates Against Model Copying

    On April 6, OpenAI, Anthropic, and Google announced a coordinated response framework through the Frontier Model Forum to counter Chinese competitors' attempts to extract advanced U.S. model outputs.

  • Nvidia Locks in TSMC CoWoS Capacity; TSMC Expands Fabs

    Nvidia has effectively cornered most of TSMC's state-of-the-art CoWoS capacity, with excess routed to ASE and Amkor. TSMC responds with concurrent fab expansions in Taiwan and Arizona to resolve the 'AI chip new bottleneck.'

  • Palo Alto Networks Reports CVE-2026-0300 Exploit Attempts

    Palo Alto Networks disclosed April 9 detection of exploitation attempts targeting CVE-2026-0300 (User-ID Authentication Portal buffer overflow vulnerability allowing unauthenticated RCE).

  • Vercel Security Incident Dominates Hacker News

    A Vercel security incident held sustained Hacker News frontpage visibility during April, refocusing attention on cloud infrastructure dependencies and supply-chain trust.

  • Google ADK, OpenAI Agents SDK compete for standard (April 9)

    Anthropic vs. OpenAI vs. Google agent standards war begins. Tool use, memory, MCP compatibility set forks.

  • Reddit sues Anthropic for unauthorized scraping

    Reddit accuses Anthropic of terms-of-service violations and commercial training without permission. Data licensing market reset looms.

  • CHATBOT Act introduced April 28 in U.S. Senate

    AI chatbot bans family accounts for minors, mandates baseline safety, prohibits targeted ads. Idaho, Oregon, Washington legislate preemptively.

  • TSMC FY26 capex $52-56 billion

    Leading-edge nodes and advanced packaging in focus. Underpins NVIDIA's fiscal 2026 revenue guidance of $215.9 billion (+65%).

  • Celltrion CT-P71 earns FDA Fast Track designation (April 9)

    Nectin-4-targeted antibody-drug conjugate for advanced urothelial cancer. Korean biosimilar company's ADC challenge lands accelerated review.

  • Anthropic ARR surpasses OpenAI at $30B vs $25B

    As of April 7, Anthropic eclipsed OpenAI for the first time. B2B enterprise revenue proved decisive.

  • Nvidia secures most of TSMC's advanced CoWoS capacity

    TSMC added two packaging plants to Arizona. Outsourcing to ASE and Amkor increasing. Blackwell is the first CoWoS-L production part.

  • Microsoft Fairwater AI data center operational in Wisconsin

    The $3.3B campus came online ahead of schedule, enabling hundreds of trillions of parameters in training with Blackwell GB200s.

  • Anthropic unveils 'Project Glasswing'

    Announced April 8; full details coming later.

  • ROMA: Recursive Open Meta-Agent framework released

    Breaks large tasks into subtask trees for parallel multi-agent execution. Posted to arXiv with long-horizon capabilities.

  • Nvidia Reserves Bulk of TSMC's CoWoS Advanced Packaging Capacity

    Nvidia locked up most of TSMC's CoWoS capacity, with CNBC flagging packaging as AI's next bottleneck on April 8.

  • Nvidia Explores Intel Packaging Partnership as Part of $5B Intel Investment

    Nvidia is evaluating Intel's packaging lines as a fallback option, tied to its $5 billion Intel investment pledge announced April 8.

  • Google Unveils TPU 8t and TPU 8i at Cloud Next, Taking on Nvidia and AMD

    Google unveiled two TPU modelsβ€”8t and 8iβ€”at Cloud Next 2026 in Las Vegas, throwing down a gauntlet against Nvidia and AMD.

  • OpenAI Signs $20B+ Chip Purchase Agreement with Cerebras

    OpenAI agreed to buy over $20 billion worth of chips from AI chip startup Cerebras, diversifying its supply chain away from sole dependence, Digitimes reported.

  • Big Tech 2026 Capex to Hit Record $725B as Memory and Chip Costs Mount

    Tom's Hardware projected Big Tech 2026 capital expenditures at a record $725 billion, driven by memory and chip cost inflation.

  • Zhipu AI GLM-5.1 released April 7 under MIT license

    40 billion active, 744 billion total MoE parameters; 200K context. Tuned for 8-hour autonomous engineering benchmarks.

  • Anthropic eyes $90 billion valuation with fresh capital

    Google poised to deploy additional $40 billion; Anthropic's annualized revenue trajectory $30 billion. May overtake OpenAI ($85.2B).

  • Bezos Project Prometheus: $10B in new funds at $38B valuation

    JPMorgan, BlackRock backing. Stealth AI manufacturing lab closes $10B raise in April. Cumulative funding crosses $16B.

  • OpenAI: $122 billion new capital; $85.2B valuation

    Round closed end of March; largest capital raise on record. Infrastructure and agent investment accelerate.

  • Meta debuts first major AI model post-$14B Alexander Wang deal

    April 8 model launch after $14B acquisition. Corporate agent platform competition heats up.

  • Frontier Model Forum blocks 24,000 spoofed accounts in first operation

    Anthropic documented that DeepSeek, Moonshot AI, and MiniMax attempted 16 million Claude calls through 24,000 fraudulent accounts. This marks the Frontier Model Forum's first external threat-intelligence sharing campaign.

  • TSMC 2nm and CoWoS fully booked through 2028

    TSMC's advanced 2nm and CoWoS packaging slots are fully reserved through 2028. Apple, Nvidia, AMD, and Qualcomm hold the premium slots; ASE and Amkor are taking overflow subcontracting.

  • Samsung 2nm reaches 50 percent yield ahead of mass production

    Digitimes: Samsung's 2nm nears H2 production with 50 percent yield and secured Tesla and Nvidia orders. Samsung aims to capture TSMC overflow from its Texas fab.

  • Anthropic confirms Claude Mythos remains private to 50 firms

    Anthropic verified that its most powerful model, Claude Mythos, stays walled within 50-company firewall and never reaches public availability.

  • Tesla releases V14.3 in Q1 update; Cybercab production ramping

    Tesla's Q1 update delivered V14.3 in April with reinforced learning and vision encoders, cutting inference latency by up to 20 percent. Driverless robotaxi service launched officially in Dallas and Houston in April.

  • Anthropic ships Claude Mythos Preview, April 7

    Mythos Preview identified thousands of zero-days across major OSes and browsers and independently discovered and exploited a 17-year-old FreeBSD RCE, Anthropic disclosed. CyberGym: 83.1%.

  • Project Glasswing launches with twelve partners

    Anthropic rolled out Project Glasswing to distribute Mythos Preview to fifty critical-infrastructure organizations. AWS, Apple, Broadcom, Cisco, CrowdStrike, Google, JPMorgan Chase, Linux Foundation, Microsoft, NVIDIA, and Palo Alto Networks signed as launch partners.

  • WhatLLM: April model roundup

    WhatLLM framed April's AI trend as 'Anthropic Won't Ship Its Best. Open Source Will,' capturing the acceleration in the open-source camp alongside Mythos's gating strategy.

  • Nextgov: OpenAI gates frontier model to cyber defenders

    Nextgov reported OpenAI using the same gating playbookβ€”frontier-model access to core cyber-defense staff. Both Anthropic and OpenAI shipped restricted-cyber-AI in seven days.

  • Exabeam: what Mythos and Glasswing signal for infosec

    Exabeam flagged Mythos and Glasswing as possible operating standards for 'Secure AI'β€”a shift toward requiring autonomous vulnerability discovery as a default for infrastructure vendors.

  • Anthropic posts $30B ARR, inks 3.5GW compute with Google/Broadcom

    Anthropic hit $30B ARR and sealed a 3.5GW compute supply partnership with Google and Broadcom. The milestone marks the first time the firm surpassed OpenAI (around $25B) in annual revenue.

  • Anthropic unveils security-focused model on April 8

    Anthropic on April 8 released a dedicated model for spotting and fixing software vulnerabilities. OpenAI followed suit within a week, unveiling its own cyber-specialized model.

  • Apple approves Nvidia GPU drivers for Mac

    Apple on April 8 approved third-party drivers letting Nvidia GPUs run on Apple Silicon Macs, enabling direct eGPU support for AI workloads without security workarounds.

  • BlueHammer PoC hits GitHub after Defender flaw exposed

    A proof-of-concept for a Microsoft Defender privilege-escalation bug went public on GitHub April 3. A standard-privilege local attacker could escalate to SYSTEM before Microsoft's patch cycle.

  • Flagship model lineup: GPT-5.4, Claude Sonnet 4.6, Gemini 3.1 Pro

    As of early April, public flagship models included OpenAI's GPT-5.4 Thinking, Anthropic's Claude Sonnet 4.6, Google's Gemini 3.1 Pro, and xAI's Grok 4.20 Beta 2. No major release blitz occurred early April.

  • Claude Code source spillage: 513k lines of TypeScript exposed

    March 31: npm v2.1.88 .map file leaks 1,906 files, ~513k lines. April 1 Anthropic mass GitHub takedown attempt; later narrowed to single repo, 96 forks.

  • Cursor 3 agent-first IDE lands as Claude Code rival

    Cursor 3 April 2 debut marks Agents Window shift; codename Glass; direct Claude Code and Codex competitor.

  • OpenAI acquires tech talk show TBPN in narrative-shaping move

    April 2: OpenAI buys TBPN (Technology Business Programming Network) for hundreds of millions, placing it under AI policy chief Chris Lehane. Editorial independence pledged.

  • Microsoft plants $10B AI flag in Japan through 2029

    April 3: Microsoft pledges 1.6 trillion yen ($10B) for Japan AI infra, cybersecurity, talentβ€”3.4x its April 2024 commitment. Partners SoftBank, Sakura Internet; aims 1M+ trained workers by 2030.

  • Nvidia tackles PC gaming shader compile bottleneck with new app

    April 3: Nvidia unveils shader pre-compile app tackling idle-time wait loops in PC games.

  • Flagship models collide: GPT-5.4 Thinking, Claude Opus 4.7, Gemini 3.1 Pro all live

    As of April 1–2, OpenAI's GPT-5.4 Thinking, Anthropic's Claude Opus 4.7, and Google's Gemini 3.1 Pro stand as the latest public flagship models. Anthropic also launched a cybersecurity-focused "Claude Mythos Preview" for select customers.

  • Google commits up to $40B to Anthropic; valuation jumps to $35B

    Google will invest up to $40 billion in Anthropic: $10 billion immediate at $35 billion post-money, plus $30 billion in performance-based tranches. Amazon adds $5 billion; Anthropic commits $100 billion toward 5GW of compute.

  • Anthropic, Broadcom, Google ink multi-GW TPU partnership starting 2027

    Anthropic, Google, and Broadcom announced a multi-gigawatt TPU compute partnership launching 2027.

  • Nvidia locks down most TSMC CoWoS-L packaging; farm work to ASE, Amkor

    Nvidia's Blackwell GPUs apply TSMC's latest CoWoS-L packaging, with most capacity booked. TSMC outsources overflow to ASE and Amkor as secondary providers.

  • Apple Business launches April 14β€”free MDM, email, zero-touch deploy vs. Microsoft, Google

    Apple Business (announced March 24) ships April 14 as a unified portal with free MDM, biz email, and zero-touch rolloutβ€”tackling Microsoft 365 and Google Workspace.

  • OpenAI $12.2B, Anthropic $3B, xAI $2B, Waymo $1.6B β€” record mega-rounds

    Crunchbase data: four mega-deals captured $18.8 billion (65%) of Q1's $300 billion global VC, with OpenAI, Anthropic, xAI, and Waymo leading the charge.

  • TSMC 2nm and 3nm full reservation through 2028; Samsung 2nm hits 50% yield

    TSMC's cutting-edge nodes sold out through 2028, while Samsung's 2nm process achieves 50% yield pre-production, capturing Tesla and Nvidia overflow demand.

  • Anthropic MCP crosses 97 million installs β€” agent standard emerging

    Anthropic's Model Context Protocol clears 97 million cumulative installs as of March 2026, cementing it as foundational to agent infrastructure.

  • AI seed startup valuations hit all-time highs

    TechCrunch reports AI seed round valuations have moved beyond perception to statistical fact, with across-the-board valuation increases.

  • Oracle's 30,000-person cut frees $8-10 billion for AI data center bet

    Bloomberg and CNBC report Oracle's March 31 layoff purges roughly 30,000 headcount and unlocks $8-10 billion in annual cash flow redirected to OpenAI infrastructure partnerships and other AI data center buildout.

What you're reading now arrives in your inbox daily at 21:00 UTC.

Get today's brief every day at 21:00 UTC

7 must-reads Β· 17 fields Β· tracked storylines delivered to your inbox daily. Pick only the fields you want; unsubscribe anytime.

Past issues β†’