Lookonchain APP

App Store

Citi: AI Inference Demand Remains Tight, Bottleneck Shifting From Chips to Power and Data Centers

2 hours ago

June 16, Citigroup noted that demand for AI inference remains robust, with a shortage of computing power spreading from the latest generation of chips to prior GPUs, forcing model vendors to accelerate monetization through tactics like dynamic pricing, usage quotas, and routing mechanisms. In a June 14 report, analysts including Heath Terry found that rental rates for A100 GPUs rose 0.6% over the past week, and 11% cumulatively over six weeks—signaling AI computing demand isn’t limited to only cutting-edge hardware. The firm added that some state-of-the-art models have lifted prices sharply after improving their intelligence benchmarks, with Citigroup observing that “scarcity is being monetized faster than it’s being solved.” The report also highlighted that no model vendor currently boasts all three key advantages: intelligence, speed, and affordability. Top-tier models have seen their intelligence scores climb roughly 4 points, yet their overall prices have nearly doubled. Meanwhile, mid-range models have made gains in speed, with the median output rate of the top 20 models jumping from 64 tokens per second to 105 tokens/s over six weeks. Additionally, the gap in capabilities between closed-source and open-source models is widening: proprietary models now lead open-source ones by around 10 points in intelligence scores, up from approximately 6 points previously. This has led top model providers to prioritize locking in the high-end market with superior performance, rather than competing on price against open-source alternatives. Beyond computing power, electricity costs and data center location are emerging as new constraints on AI expansion. The firm cited a private neocloud that has signed contracts for 4.9GW of demand, though its planned pipeline exceeds 40GW—revealing a massive chasm between surging demand and available supply. Data centers tend to be sited in regions with electricity prices of 9–12 cents per kilowatt-hour, while the share of renewable energy and long-term power purchase agreements also influence location choices. Looking ahead, Citigroup projects AI infrastructure costs will continue to climb. Rising component prices, power access fees, and upfront infrastructure investments are driving higher capital expenditure (capex) for equivalent compute power—such as H100 GPUs—with electricity costs shifting from operational expenses to upfront capital outlays. The next wave of value may flow to the “inference routing layer”: a platform that automates choosing the right model, quantization method, and hardware for individual tasks to lower inference costs and boost efficiency. However, implementation faces hurdles like enterprise data security, intellectual property concerns, and privacy protection. Across the broader AI ecosystem, the report notes growth isn’t limited to GPUs alone—it extends to data centers, power, optical communications, cloud infrastructure, and model applications. Citigroup named relevant coverage targets in its appendix: Ciena, Lumentum, and MiniMax, demonstrating that the AI inference cycle is expanding from chips to a wider range of infrastructure and application segments.
Relevant content

The largest ETH short seller, known as “pension-usdt.eth,” has initiated a liquidation event, giving back $5.9 million in unrealized gains due to a price rebound.

June 16 – Per monitoring from HyperInsight (https://t.me/HyperInsight), the largest Ethereum (ETH) short position on Hyperliquid, held by wallet "pension-usdt.eth," has taken a $5.9 million hit over the past week amid ETH’s recent sustained rally. The address just initiated profit-taking for the first time in 30 minutes. The wallet earlier shorted 60,000 ETH using 3x leverage, with an open position valued at roughly $107 million and an average entry price of $1,810. As of press time, it has closed around 2,135 of its ETH short positions, leaving a remaining stake that currently holds $1.24 million in profit. Wallet address: 0x0ddf9bae2af4b874b96d287a5ad42eb47138a902

4 minutes ago

HIP-3 US Stock Gainers: SPCX Leads Gains, Storage Semiconductor Sector Shows Strength

**Date: June 16** Per Hyperinsight Monitoring (https://t.me/HyperInsight), as of 18:00 on June 16, in Hyperliquid’s HIP-3 U.S. Stock Market, SPCX (SpaceX) paced all gainers with a 7.25% 12-hour jump, trading at $211.39 at press time. Its 24-hour trading volume hit $1.129 billion. Among the top five gainers, four came from the storage and chip sector: Western Digital (WDC) climbed 5.20% over 12 hours to take second place; Micron Technology (MU) saw a 3.34% 12-hour gain and a 3.62% pre-market advance, leading pre-market trading; the DRAM spot index rose 1.98%; SK Hynix (SKHX) added 1.77%, showing broad momentum across the storage industry chain.

4 minutes ago

U.S. Pre-market: Cryptocurrency Stocks Mixed, PURR Rises Over 11%

June 16: Pre-market trading in U.S. crypto-related stocks was mixed, according to Bitget market data. Here are the key movements for major tickers: - MSTR: +0.66% - COIN: +0.48% - CRCL: +1.20% - SBET: -0.17% - BMNR: -0.41% - HOOD: +1.70% - PURR (HYPE Treasury Company): +11.24%

4 minutes ago

Tether Signs Memorandum of Understanding with Dubai Multi Commodities Centre to Advance Blockchain Education and Asset Tokenization

June 16 — Stablecoin issuer Tether has entered into a partnership memorandum of understanding (MoU) with the Dubai Multi Commodities Centre (DMCC) to explore collaboration on three key areas: tokenization, digital asset education, and blockchain applications. DMCC, one of the world’s leading commercial hubs, hosts more than 26,000 businesses that make up 15% of Dubai’s total foreign direct investment (FDI). Under the MoU, Tether will deliver specialized workshops, blockchain advisory services, and digital asset pilot projects to companies in DMCC’s network. The pair will also look to enhance the DMCC community experience via a peer-to-peer digital payment system, and support knowledge sharing and innovation at the DMCC Crypto Center — including co-hosting events and hackathons. Tether CEO Paolo Ardoino noted that the UAE is actively integrating digital asset infrastructure into the global market and real economy. He added that this partnership aims to accelerate the practical applica

4 minutes ago

Bitunix Analyst: Market Starts Trading Peace Dividend, but the Truly Revalued are Global Funding Cost and Liquidity Order

June 16. The market’s focus has gradually shifted from the Middle East conflict itself to fund reallocations following the peace agreement. As Trump arrives in Europe for the G7 summit, both the U.S. and Iran have confirmed they will sign a memorandum of understanding on the 19th, with progress ongoing to reopen the Strait of Hormuz. But per Mitsui O.S.K. Lines— the world’s largest oil tanker operator— the market no longer cares if the deal gets signed; its attention is now on whether shipping, insurance, and the energy supply chain can return to normal operations. This explains why oil prices have fallen recently, stocks have climbed, but shipping companies remain cautious: risk events are receding, yet the risk premium hasn’t fully disappeared. From a macro perspective, global markets are grappling with three intersecting capital trends. First, inflation expectations are adjusting amid lower energy risk. If the U.S.-Iran agreement is implemented smoothly, it will help cut energy an

4 minutes ago

Binance Extends Zero-Fee Promotion for Bank Three-Party Custody Service Until End of 2026, Introduces New Tiered Pricing Starting in 2027

**Binance Extends Zero-Fee Custody Promotion for Institutional/VIP Clients Through 2026** On June 16, Binance announced it is extending its zero-fee promotion for bank-backed third-party custody services for institutional and VIP clients, pushing the effective deadline roughly six months beyond its original schedule to December 31, 2026. Starting January 1, 2027, the platform will roll out a new tiered fee structure, with all fee tiers receiving a 2.5% reduction from prior rates. The annual percentage rates (APR) will be tied to clients’ pledged collateral levels as follows: 1.000% for collateral up to $50 million; 0.500% for $50 million to $100 million; 0.250% for $100 million to $500 million; and 0.100% for collateral exceeding $500 million. Binance’s bank third-party custody service launched in November 2023 as the world’s first crypto exchange-led third-party custody offering. The service lets institutional clients deposit collateral (including fiat, U.S. Treasury bonds, Ba

4 minutes ago