AI Drug Discovery in China

An Integrated Overview · 2015–2026
0+
Active Companies
$0B+
Total Funding
0
“Little Dragons”
0
Filed Patents
Section I

Executive Summary 药智

China's AI drug discovery sector has matured from speculative academic experiments into a clinically validated industry generating multi-billion-dollar licensing deals with global pharmaceutical companies. Between 2014 and 2026, more than 60 firms entered the space, collectively raising over $5 billion. A handful survived the funding winter of 2023–2024. Those that did now hold real clinical assets and real pharma partnerships.


The industry's structure is top-heavy. One company accounts for roughly 80% of end-to-end AI-discovered clinical candidates originating from China-based operations. The remaining 20% is split across a small group of firms with narrow specialisations: crystal prediction, delivery optimisation, or protein design. Pure-play software vendors without proprietary pipelines have largely been absorbed, pivoted, or shut down.


Three forces shaped the current landscape. First, the State Council's 2017 AI Development Plan directed both state and private capital into the sector. Second, China's dense CRO infrastructure – WuXi AppTec, Pharmaron, ChemPartner – gave AI companies a wet-lab validation layer that no other country could match at equivalent speed and cost. Third, the HKEX listing reforms (Chapters 18A and 18C) provided a viable public exit at a time when NASDAQ access had effectively closed for Chinese biotechs.


Geopolitics now overshadows everything else. The BIOSECURE Act and broader US–China decoupling have forced companies into "bifurcation strategies," maintaining separate corporate structures for domestic and international operations. Data sovereignty rules – the Personal Information Protection Law (PIPL) and Human Genetic Resources (HGR) regulations – make cross-border model training complex. Companies that planned for this early hold a structural advantage.


Looking forward, the sector will consolidate further. Cash-rich traditional pharma (Hengrui, CSPC, Qilu) will acquire distressed AI startups. Provincial governments – Beijing's RMB 10 billion AI life sciences fund, Shanghai's Zhangjiang subsidies, Suzhou's BioBAY incentives – will keep capital flowing to survivors. By 2028, the CDE may mandate AI-driven toxicity models as standard IND components. The question is no longer whether AI drug discovery works. It does. The question is which companies can turn algorithmic output into approved medicines.


Methodology

This report draws on HKEX filing documents, CDE regulatory publications, peer-reviewed papers (Nature, Science, Cell Systems, JCIM, and others), deal announcements from company press releases, patent databases (CNIPA, WIPO), and public financial disclosures. Company-specific data was cross-referenced against at least two independent sources where possible. "AI-discovered" is defined as molecules where computational methods played a primary role in either target identification, molecular design, or both. Revenue and deal figures are denominated in USD unless stated otherwise.

Key Findings at a Glance

80%+
of end-to-end AI-discovered clinical candidates from China trace to a single company
$5.5B+
in combined biobucks from the market leader's licensing deals alone
2
AI drug discovery companies listed on HKEX (3696.HK and 02228.HK)
RMB 14B+
in announced provincial/municipal government funding for AI life sciences (2023–2030)
Section II

How the Industry Works 创药

Model A: Full-Stack Biotech

AI discovers both the target and the molecule. The company owns the asset, runs IND-enabling studies, and enters clinical trials. Revenue comes from licensing deals with global pharma or, ultimately, from direct commercialisation. This model requires the heaviest capital expenditure but generates the highest-value assets.


Examples: Insilico Medicine, Accutar Biotech, MindRank AI

Model B: Platform + Services

The company builds AI platforms and monetises them through fee-for-service contracts, SaaS licensing, or co-development deals with pharma. Internal assets may exist but take a back seat to revenue from external clients. This model is capital-efficient but faces commoditisation pressure.


Examples: XtalPi/QuantumPharm, DP Technology, Deep Intelligent Pharma

Model C: Hybrid / Asset-Acquirer

The company uses AI for specific steps (lead optimisation, delivery, formulation) while acquiring or in-licensing clinical assets through traditional routes. AI is a capability layer, not the sole engine. This model offers faster time-to-clinic but raises questions about the true role of AI in the pipeline.


Examples: Earendil (Helixon), METiS Therapeutics, BioMap

Section III

Company Rankings 智能

Rank Company Founded HQ Stage Key Focus Notable Deal Funding Status
1Insilico Medicine2014HK / ShanghaiClinicalEnd-to-end AIDD, fibrosis, oncologyLilly $2.75B$400M+ (IPO $292M)Public (3696.HK)
2XtalPi (QuantumPharm)2014ShenzhenPlatformCrystal prediction, robotic labsAilux-Lilly $345M$730M+ (IPO)Public (02228.HK)
3DP Technology2018BeijingPlatformMolecular dynamics, Uni-Fold, DeePMDUni-Fold open source, Hermite$100M+Active
4BioMap (百图生科)2020BeijingPreclinicalFoundation models, biologicsSanofi $1B+$500M+Active
5Earendil (Helixon)2021Beijing / USClinicalAntibody design, biologicsSanofi partnership$787M*Active*
6Accutar Biotech2017ShanghaiClinicalPROTACs, targeted degradersAC0676 IND$100M+Active
7MindRank AI2019HangzhouClinicalSmall molecule, GLP-1 oralMDR-001 Phase I$80M+Active
8neoX Biotech2020ShanghaiPreclinicalPPI prediction, biologicsUndisclosed pharma$50M+Active
9METiS Therapeutics2019Cambridge / CNClinicalLNP delivery, formulationAstraZeneca collab.$200M+Active
10AlphaMa2019ShanghaiPreclinicalDEL + AI integrationMultiple pharma$40M+Active
11Singlomics2018BeijingPlatformSingle-cell AI, target IDBiotech partnerships$30M+Active
12Deep Intelligent Pharma2017BeijingPlatformClinical trial optimisationCRO integrations$25M+Active
13Galixir2019BeijingPreclinicalRetrosynthesis, gen. chem.-$30M+Faded
14StoneWise (智石)2018BeijingPlatformKnowledge graphs, gen. chem.Huawei Cloud, Chipscreen$100M+Pivoted
15Ailux Biologics2021ShenzhenPreclinicalAI biologics (XtalPi sub.)Lilly $345MXtalPi-fundedActive
16Signet Therapeutics2021ShanghaiClinicalOncology (XtalPi spin-off)SIGX1094R trialsXtalPi-fundedActive
17Xbiome2018ShenzhenClinicalAI + microbiome therapeuticsFMT Phase II$100M+Active
18Standigm2015Seoul / ShanghaiClinicalDrug repositioning, fibrosisPhase I assets$50M+Active
19Nutshell Biotech2019ShanghaiPreclinicalSynthetic biology + AI-$15M+Active
20DeuteRx2020SuzhouPreclinicalDeuterated drugs + AI-$20M+Active
21Baidu Bio (PaddleHelix)2020BeijingPlatformmRNA design, LinearDesignInternal / infraBaidu-fundedScaled back
22Tencent AI Lab (iDrug)2018ShenzhenPlatformMol. generation, screeningStrategic investmentsTencent-fundedActive
23Alibaba DAMO2017HangzhouPlatformAntiviral discoveryInternalAlibaba-fundedScaled back
24Huawei Cloud (Pangu)2021ShenzhenPlatformPangu Drug Molecule ModelStoneWise, pharmaHuawei-fundedActive
25ByteDance AI Lab2020BeijingPlatformProtein structure, LLMsInternal researchByteDance-fundedRetreated
26Shanghai AI Lab2021ShanghaiPlatformChemLLM, foundation modelsOpen-source toolsState-fundedActive
27CIMM (CAS)-ShanghaiPlatformCADD, virtual screeningAcademic collaborationsState-fundedActive
28WuXi AppTec (AI unit)2018ShanghaiPlatformDEL + AI screeningInternal integrationWuXi-fundedRestructuring
29Fosun Pharma (AI div.)2020ShanghaiClinicalCo-development (QPCTL)Insilico partnershipFosun-fundedActive
30Chipscreen Biosciences2001ShenzhenClinicalAI-augmented oncologyStoneWise collab.Public (688321.SS)Active
31Anew Biotech2020SuzhouPreclinicalAI + synthetic biology-$10M+Active
32Pharmaron (AI unit)2020BeijingPlatformAI-enabled CRO servicesAIDD startup partnershipsPharmaron-fundedActive

* Earendil's $787M fundraise is disputed by industry observers. Assets reportedly obtained via traditional discovery routes. Click column headers to sort.

Funding by Category

Full-Stack Biotechs
$1.5B+
Insilico, Accutar, MindRank, Earendil*
Platform / Services
$1.2B+
XtalPi, DP Tech, StoneWise, DIP
Big Tech + State
$2B+
BioMap, Baidu, Tencent, gov. funds

Geographic Distribution

Shanghai
14
companies HQ'd or with major R&D
Beijing
10
strong in academic + Big Tech
Shenzhen
5
XtalPi, Tencent, Huawei, Chipscreen
Suzhou
4
BioBAY hub + automated labs
Section IV

The Four “Little Dragons” 创新

By early 2026, four companies had separated from the pack through a combination of clinical assets, major pharma validation, and public market access. Industry observers began calling them the "Four Little Dragons" of Chinese AI drug discovery.

Insilico Medicine

HKEX: 3696 · Founded 2014 · Full-Stack Biotech

The clear market leader. 40+ programs, 12 IND clearances, 7 clinical-stage assets. Deals with Lilly ($2.75B), Sanofi ($1.2B), Servier ($888M), Menarini ($550M+), Exelixis, Qilu, and Fosun. IPO raised HK$2.277B in December 2025. Revenue of $85.8M (2024) and $56.2M (2025). Lead asset Rentosertib completed Phase IIa for IPF with results published in Nature Medicine.

XtalPi (QuantumPharm)

HKEX: 02228 · Founded 2014 · Platform + Robotics

Founded by three MIT quantum physicists. Revenue of 802.6M CNY (+201% YoY). Operates 24/7 robotic "self-driving" wet labs in Shenzhen and Shanghai. Subsidiary Ailux Biologics secured a $345M deal with Lilly for bispecific antibodies. Signet Therapeutics spin-off advancing SIGX1094R in clinical trials. Over 1,000 employees globally.

Earendil (Helixon)

Private · Founded ~2021 · Hybrid Biotech

Raised $787M in a mega-round – a figure disputed by industry observers. Founded by Jian Peng (UIUC) and Zhenping Zhu (ex-3SBio). Partnered with Sanofi on antibody design. Strategy has shifted from AI-native discovery to acquiring clinical-stage assets through traditional routes, raising questions about how much of the pipeline is genuinely AI-generated. OmegaFold paper was a notable scientific contribution.

METiS Therapeutics

Private · Founded 2019 · RNA Delivery + AI

Spun out of MIT. Platforms include AiLNP (lipid nanoparticle optimisation), AiRNA (RNA sequence design), and AiTEM (target evaluation). AstraZeneca collaboration on LNP formulations for chronic diseases. Lead asset MTS-004 has advanced to Phase III trials, though independent verification of AI-native origin is limited. Backed by PICC Capital, China Life, and Sequoia China.

Section V

Industry Timeline 发现

2014
Founders Arrive
Insilico Medicine founded in Baltimore, Maryland (Johns Hopkins ecosystem). XtalPi founded by three MIT quantum physicists. Both begin exploring AI applications in drug discovery.
2015–2016
Academic Foundations
CAS, Peking University, and Tsinghua begin publishing deep learning work on QSAR and molecular property prediction. BGI integrates early ML for multi-omics target identification.
2017
State Council AI Plan
The “New Generation AI Development Plan” (Guo Fa [2017] No. 35) names healthcare and drug discovery as strategic national priorities. Private and state VC floods into the sector.
2018
China Expansion Wave
Insilico enters China via JLabs, Roche incubator, and Lilly Gateway Labs. Dozens of AI biotech startups founded or funded. Zhang Linfeng publishes DeePMD (Gordon Bell Prize). StoneWise raises early mega-rounds from Tencent.
2019
GENTRL: 46-Day Molecule
Insilico publishes “Deep learning enables rapid identification of potent DDR1 kinase inhibitors" in Nature Biotechnology. A novel inhibitor designed by AI, synthesised, and validated in mice in 46 days. The paper becomes a global catalyst for the field.
2020
Peak Funding / COVID Accelerant
XtalPi raises $318M Series C. BioMap founded by Robin Li (Baidu). COVID-19 accelerates interest in rapid AI-driven drug and vaccine design. Insilico advances ISM3312 (3CL protease inhibitor).
2021
Unicorn Factory
XtalPi raises $380M Series D (unicorn status). PIPL enacted (Nov 1), reshaping data governance for biotech. Helixon (later Earendil) founded. Massive capital deployment across the sector.
2022
First Clinical Milestones
Insilico's ISM001-055 (Rentosertib) enters Phase I – the first end-to-end AI-discovered target and molecule in human trials. Sanofi signs $1.2B deal with Insilico ($21.5M upfront, Nov 2022). OmegaFold paper published by Helixon. CDE begins drafting AI guidance.
2023
Reality Check
CDE releases draft “Guiding Principles for AI in Drug R&D" (August). HGR Implementation Rules take effect (July). Funding winter begins – pure-play software companies struggle to raise. Insilico builds automated facility in Suzhou (Life Star 1).
2024
Consolidation Year
CDE finalises AI guidance (March). XtalPi IPOs on HKEX as QuantumPharm (02228, June). StoneWise pivots. Galixir fades. Insilico reports $85.8M revenue. Lilly signs $2.75B expanded collaboration. BIOSECURE Act accelerates bifurcation strategies.
2025
IPO Landmark
Insilico IPOs on HKEX Main Board (Dec 30, 3696.HK), raising HK$2.277B (~$292M) at HK$24.05/share. Menarini signs second deal ($550M+, January). Rentosertib Phase IIa results published in Nature Medicine.
2026 (H1)
Deal Acceleration
Servier signs $888M deal with Insilico (Jan 4, $32M upfront). Qilu signs $120M partnership (Jan 27). Beijing announces RMB 10B AI life sciences fund. BioMap–Sanofi $1B+ mega-deal. Sector enters mature phase.
2026–2030
Projected Consolidation
Traditional pharma (Hengrui, CSPC) expected to acquire distressed AI startups. CDE may mandate AI toxicity models for INDs by 2028. HKEX remains primary exit route. Middle East sovereign wealth enters as alternative capital source.
Section V-B

Funding History 创药

Capital deployment into Chinese AI drug discovery followed a classic hype-cycle pattern: academic curiosity (2014–2016), early venture bets (2017–2018), euphoric mega-rounds (2019–2021), harsh correction (2022–2024), and selective recovery (2025+).

Funding Phases

Phase 1: 2014–2016
Seed & Exploration
Small rounds ($1M–15M). Investors: Tencent, FreeS Fund, angel investors with biotech backgrounds. Companies funded on algorithm promise alone. Total sector investment: ~$50M.
Phase 2: 2017–2018
Policy-Driven Acceleration
State Council AI Plan triggers massive VC interest. Sequoia China, IDG Capital, and Yunfeng Capital enter the space. Series A/B rounds reach $15M–50M. Total sector investment: ~$500M.
Phase 3: 2019–2021
Peak Euphoria
Mega-rounds dominate. XtalPi raises $318M (Series C) and $380M (Series D). SoftBank Vision Fund, OrbiMed, and PICC Capital deploy heavily. Earendil raises $787M (disputed). Total sector investment: ~$3B+. Valuations detach from fundamentals.
Phase 4: 2022–2024
Correction & Consolidation
Generalist VCs retreat. Down rounds, restructuring, and quiet shutdowns. Funding shifts to specialised healthcare funds (Qiming, HongShan/Sequoia China, Eight Roads) and government guidance funds. Pure-play software companies struggle. Total new investment: ~$800M (significant decline).
Phase 5: 2025–Present
Selective Recovery
IPO proceeds (XtalPi HK$989M, Insilico HK$2.277B) replace venture funding as primary capital source. Government funds (Beijing RMB 10B) provide runway for survivors. Middle East sovereign wealth enters. Funding now demands clinical assets or validated pharma partnerships.

Investor Landscape (2025–2026)

Still Active

  • Qiming Venture Partners (healthcare focus)
  • HongShan (ex-Sequoia China)
  • Eight Roads Ventures
  • HKIC (Hong Kong Investment Corp.)
  • Government Guidance Funds (Beijing, Shanghai, Shenzhen)
  • Middle East SWFs (Mubadala, PIF)

Retreated / Cautious

  • SoftBank Vision Fund
  • Generalist US VCs (Coatue, Tiger Global)
  • Crossover funds (affected by BIOSECURE)
  • Most Chinese generalist VCs
  • Corporate VCs (except pharma strategics)
  • Crypto-adjacent funds (fully exited)
Section VI

Market Leader Analysis 智药

One company dominates the sector by every quantifiable metric: clinical assets, deal value, revenue, and patent filings. The analysis below presents the data as it stands.

Major Licensing Deals

PartnerDateTotal ValueUpfrontTherapeutic AreaStatus
Eli LillyAug 2024 (expanded)$2.75BUndisclosedMultiple (oral therapeutics)Active
SanofiNov 8, 2022$1.2B$21.5MMultiple (6 targets)Active
ServierJan 4, 2026$888M$32MOncologyActive
Menarini/StemlineJan 2025$550M+UndisclosedOncologyActive
Qilu PharmaceuticalJan 27, 2026~$120MUndisclosedCardiometabolicActive
Exelixis2024Milestones~$80MOncology (USP1)Active
Fosun Pharma2024–2025Milestones$13MImmuno-oncology (QPCTL)Active
Combined Deal Value$5.5B+ in total biobucks

Pharma.AI Platform Architecture

Biology42 (PandaOmics)

Multi-omics target discovery. NLP-driven target scoring across disease associations, pathways, and publications.

Chemistry42

Generative chemistry engine. Reinforcement learning designs novel molecules with specified drug-like properties from scratch.

InClinico

Clinical trial outcome prediction. Optimises trial design, patient selection, and endpoint strategy to reduce late-stage attrition.

↓ Integrated with Suzhou robotics lab (Life Star 1) + Shanghai chemistry teams ↓

Platform Deep Dive

Biology42 (PandaOmics)

Multi-omics target discovery engine. Analyses transcriptomic, proteomic, and epigenomic datasets to identify novel targets. Uses advanced NLP to score target-disease associations across millions of publications and patent filings. Generates target hypotheses ranked by druggability, novelty, and clinical feasibility. The platform identified TNIK as a novel IPF target – a discovery later validated in Phase IIa trials. It also identified USP1 for oncology, QPCTL for immuno-oncology, and PHD1/2 for IBD.

Chemistry42

Generative chemistry engine built on reinforcement learning and transformer architectures. Designs novel small molecules from scratch with specified drug-like properties (potency, selectivity, ADMET, synthesisability). The engine generates thousands of candidate structures per run, filtered through multi-parameter optimisation. The 2019 Nature Biotechnology paper demonstrated the engine: a novel DDR1 inhibitor was generated, synthesised, and validated in mice within 46 days.

InClinico

Clinical trial prediction and optimisation platform. Predicts Phase II and Phase III success probabilities based on target biology, compound properties, trial design, and historical data. Helps optimise patient selection, endpoint strategy, and dosing to reduce late-stage attrition. Used internally to prioritise pipeline programs and externally as a partnership tool.

Life Star 1 (Suzhou Robotics Lab)

Fully automated, AI-driven robotics laboratory. Performs target validation, assay development, compound screening, and hit-to-lead optimisation without human intervention. Experimental data feeds continuously back into the AI platform, creating a closed-loop system. Launched 2022–2023. Represents the physical embodiment of the "self-driving lab" concept.

Target Portfolio Diversity

Oncology

  • USP1 (ISM3091) – Exelixis
  • KAT6A (ISM4312A) – Menarini
  • QPCTL (ISM8207) – Fosun
  • MAT2A – Internal
  • TEAD – Internal
  • WRN – Internal
  • ENPP1 – Internal

Fibrosis & Inflammation

  • TNIK (ISM001-055) – Lead asset
  • PHD1/2 (ISM5411) – IBD

Other

  • Cardiometabolic – Qilu partnership
  • Multiple – Lilly collaboration
  • Multiple – Sanofi collaboration
  • Oncology – Servier collaboration
  • 3CL Protease (ISM3312) – COVID

Clinical Pipeline (Selected Assets)

Rentosertib (TNIK/IPF)
Disc.
Pre
Ph I
Ph IIa ✓
ISM3091 (USP1/Onc.)
Disc.
Pre
Ph I
ISM5411 (PHD/IBD)
Disc.
Pre
Ph I
ISM8207 (QPCTL/IO)
Disc.
Pre
Ph I
ISM4312A (KAT6A/BC)
Disc.
Pre
+35 additional programs
Discovery – IND

40+ programs total. 12 IND clearances. 7 clinical-stage assets. 24 publicly disclosed.

The 80% Question

When counting only molecules where both the target and the compound were identified or generated by AI – and subsequently entered human clinical trials – one company accounts for roughly 80% of the total output from China-based AI drug discovery operations. This concentration is a function of three factors:


1. Integration depth. Most competitors sell software or services. This company owns the entire chain: target identification, molecule generation, synthesis, assay, IND-enabling studies, and clinical execution.


2. CRO infrastructure. Early and deep integration with China's CRO networks (WuXi, Pharmaron) created a wet-lab feedback loop that pure software companies could not replicate.


3. Capital efficiency. Revenue from licensing deals funded further pipeline expansion, creating a virtuous cycle. Competitors relying solely on venture capital could not sustain the burn rate needed to push multiple assets into clinic simultaneously.


This dominance is not guaranteed to persist. XtalPi's robotic labs and BioMap's foundation models represent credible alternative approaches. But as of mid-2026, the clinical scoreboard is lopsided.

Financial Overview

2024 Revenue
$85.8M
2025 Revenue
$56.2M
-34.5% YoY (lower upfront payments)

IPO: HKEX Main Board, Dec 30 2025, ticker HKEX: 03696. Raised HK$2.277B (~$292M) at HK$24.05/share. First AI-driven biotech to list under Chapter 8.05 listing rules.

China Operations Build-Out

2018 – Initial Entry
Entered China via Johnson & Johnson JLabs, followed by Roche incubator and Lilly Gateway Labs. Recognised the CRO infrastructure advantage early.
2019–2020 – Shanghai Innovation Hub
Established central R&D headquarters in Shanghai. Recruited hundreds of computational biologists, chemists, and drug hunters. Key hire: Dr. Feng Ren (ex-GSK China, ex-Medicilon SVP) as CSO, later Co-CEO.
2022–2023 – Suzhou Automation (Life Star 1)
Launched fully automated, AI-driven robotics laboratory in Suzhou. Automated target validation, assay development, and compound screening. Continuous data feedback into Pharma.AI platform.
2024–2026 – Scale & Public Markets
Expanded to 40+ programs. Revenue reached $85.8M (2024). HKEX IPO (Dec 2025). Established as the sector's only full-stack, publicly listed, clinical-stage AI biotech with China R&D operations at industrial scale.

Competitive Positioning Matrix

DimensionMarket LeaderXtalPiBioMapEarendilMETiS
AI-discovered INDs12+1–2 (via subs)0Disputed1–2
Clinical-stage assets71 (SIGX1094R)0Acquired1 (MTS-004)
Largest deal (biobucks)$2.75B$345M$1B+UndisclosedUndisclosed
Public listingHKEX 3696HKEX 02228NoNoNo
Revenue (latest)$56.2M802.6M CNYEarlyPre-revenuePre-revenue
Employees (est.)600+1,000+300+200+100+
Business modelFull-stack biotechPlatform + CROFoundation modelHybridDelivery AI
Section VII

Cautionary Tales 发现

StoneWise (智石医药) – The Pivot

Raised over $100M early, backed by Tencent. Built an end-to-end AI drug discovery engine with strong algorithmic credentials (Lingo3DMol paper). But pure computational milestones failed to impress later-stage investors when clinical assets did not materialise. Partnered with Huawei Cloud (Pangu Drug Molecule Model) and Chipscreen Biosciences, but by 2024 had been forced into a strategic pivot toward SaaS models. The cash burn exceeded what the platform revenue could sustain.


Lesson: Algorithms alone do not survive funding winters. Without proprietary clinical assets, AI drug discovery companies become service vendors competing on price.

Galixir – The Fade

Founded in 2019 with strong initial funding and a focus on retrosynthesis and generative chemistry. Published promising early research. But as the 2023–2024 consolidation hit, Galixir failed to secure sustainable pharma partnerships or generate proprietary pipeline assets. The company largely disappeared from the active radar by 2025, with no public updates on clinical progress or major deals.


Lesson: In a crowded market, survival requires either pharma validation (deals) or clinical proof (INDs). Academic publications and conference presentations are insufficient.

Broader Patterns of Failure

StoneWise and Galixir are not isolated cases. The 2023–2024 consolidation wave affected dozens of smaller firms. Common failure patterns include:

Pattern 1: Algorithm-Only. Companies built impressive computational platforms but never invested in wet-lab validation or proprietary pipelines. When Series B/C investors demanded clinical proof, they had nothing to show.

Pattern 2: Overfunding. Several startups raised $50M–100M+ at inflated valuations during the 2020–2021 peak. When the market corrected, they could not raise follow-on rounds at prior valuations, leading to down rounds, restructuring, or quiet shutdowns.

Pattern 3: Talent Drain. Top AI researchers and medicinal chemists migrated to better-funded competitors or returned to academia. Without core technical talent, platforms deteriorated and pipelines stalled.

Pattern 4: Geopolitical Squeeze. Companies with heavy US pharma client dependency lost contracts due to BIOSECURE concerns. Those that had not built dual structures early enough could not pivot fast enough to domestic or alternative international markets.

Section VIII

Big Tech in Drug Discovery 智能

China's technology giants entered the AI drug discovery space between 2018 and 2021, attracted by the convergence of their core AI capabilities with a massive healthcare market. Most have since scaled back proprietary drug ambitions, repositioning as infrastructure providers.

Baidu

Backed BioMap via CEO Robin Li. Published LinearDesign (Nature, 2023) for mRNA optimisation. PaddleHelix platform offered molecular generation tools. Current status: scaled back internal pipeline, refocused on selling cloud compute and AI infrastructure to biotech clients.

Tencent

AI Lab built iDrug platform for molecular generation and screening. Early investor in XtalPi and StoneWise. Published reinforcement learning work on molecular design (NeurIPS, 2021). Current status: steady research output, acts primarily as strategic investor and infrastructure partner rather than standalone biotech.

Alibaba (DAMO Academy)

Published AI antiviral discovery work (Advanced Science, 2022) during the COVID period. Built internal screening platforms. Current status: largely retreated from proprietary drug discovery, shuttered or spun off wet-lab operations to focus on core consumer AI and LLM development.

ByteDance

AI Lab explored protein structure prediction and LLMs for biology. Briefly maintained internal drug discovery research. Current status: retreated. Core business priorities in consumer AI and short-form video absorbed available resources and management attention.

Huawei Cloud

Developed the Pangu Drug Molecule Model for virtual screening and molecular generation. Partnered with StoneWise and other startups. Current status: active as an ecosystem enabler and SaaS provider. Explicitly positioned as infrastructure, not a competitor to pharma companies.

The pattern is consistent: Big Tech enters pharma with enthusiasm, discovers that drug discovery requires wet-lab validation and regulatory patience that does not fit quarterly earnings cycles, and retreats to infrastructure plays. The exception is BioMap, which operates as an independent entity despite Baidu backing.

Big Tech Contribution Assessment

CompanyPeak InvestmentNotable OutputCurrent RoleAssessment
BaiduBioMap spin-off + PaddleHelixLinearDesign (Nature 2023)Infra providerPartial retreat
TencentiDrug + XtalPi/StoneWise investmentsNeurIPS 2021 paperInvestor + infraSteady
AlibabaDAMO wet-lab unitAntiviral paper (2022)LLM focusRetreated
ByteDanceInternal AI Lab drug teamProtein structure researchConsumer AIRetreated
HuaweiPangu Drug Molecule ModelStoneWise + pharma partnershipsSaaS/computeActive enabler
Section IX

Traditional Chinese Medicine & AI 药智

The integration of AI with Traditional Chinese Medicine (TCM) represents a uniquely Chinese contribution to computational drug discovery. Network pharmacology – pioneered by Shao Li at Tsinghua University – uses AI to decode multi-target mechanisms of complex herbal formulas, mapping compound-pathway-disease relationships that classical pharmacology cannot easily untangle.


Key developments include:


Network Pharmacology. Shao Li's work established the computational framework for analysing TCM formulations as multi-component, multi-target systems. His group at Tsinghua published foundational reviews in Acta Pharmaceutica Sinica B (2021), defining the field's methodology and standards.


AI-Assisted Phytochemical Discovery. Sun Yat-sen University researchers applied deep learning to identify anti-cancer phytochemicals from natural product libraries (Phytomedicine, 2023), bridging TCM tradition with modern hit-finding approaches.


Knowledge Graph Integration. Peking University teams developed KGE-NFM (Bioinformatics, 2023), using knowledge graphs to discover drug repurposing opportunities from TCM compound databases. This approach has identified several candidates now in preclinical validation.


CAS Contributions. The Shanghai Institute of Materia Medica (SIMM) maintains one of the world's largest curated databases of TCM-derived compounds, increasingly integrated with AI screening workflows used by both academic and commercial discovery programs.


TCM-AI remains largely pre-commercial. No TCM-derived, AI-optimised compound has yet reached Phase II trials. But the intellectual infrastructure is substantial, and provincial governments (particularly Sichuan and Yunnan) continue to fund this intersection.


TCM-AI Technology Stack

Data Layer

SIMM compound database (100K+ TCM-derived molecules), TCMSP (Traditional Chinese Medicine Systems Pharmacology), SymMap (symptom-compound mapping). These databases feed AI screening workflows for both academic and commercial programs.

Computation Layer

Network pharmacology tools (NetworkAnalyst, DAVID), GNN-based target prediction, knowledge graph reasoning (KGE-NFM). Multi-target scoring enables analysis of synergistic effects in complex herbal formulations.

Validation Layer

CAS wet-lab validation pipelines, provincial TCM research institutes, academic hospital clinical observation studies. Most validation occurs at the phenotypic level rather than single-target biochemical assays.

Commercial Potential

Nearest-term applications: AI-guided quality control for TCM manufacturing, identification of active ingredient combinations for patent filings, and computational evidence packages for NMPA TCM drug approvals.

Detailed XtalPi Technology Stack

Quantum Physics Layer

Density functional theory (DFT) calculations for highly accurate crystal structure and polymorph prediction. Core differentiator from pure-ML approaches.

AI/ML Layer

Deep learning models trained on proprietary chemical datasets. Predict efficacy, toxicity, synthesis routes, and physicochemical properties.

Robotic Automation Layer

Cloud-connected "self-driving" labs in Shenzhen and Shanghai. Hundreds of automated workstations operating 24/7. Continuous feedback loop improves AI models.

Earendil (Helixon): The AI-Washing Question

Earendil Labs is the US-facing rebrand of Helixon Therapeutics, founded in 2021 by Jian Peng (former UIUC associate professor, 2020 Overton Prize winner in computational biology). The rebrand appears designed to present a US-centric, AI-native identity while the core team remains in China.


The pipeline problem. Earendil claims 40+ biologic drug candidates, led by a TL1A antibody (Phase 2) and IL23 assets. Both TL1A and IL23 are deeply validated, conventional immunological targets – not novel targets discovered by AI. There is no public evidence that the foundational biology or target selection was AI-generated. The AI contribution appears limited to protein engineering: optimizing binding affinity and stability of known targets. The core IP is essentially traditional antibody engineering.


The Sanofi deal. Sanofi signed a deal worth up to $2.56B ($125M upfront) for two bispecific antibodies targeting autoimmune and IBD indications. These are engineered bispecifics of known targets (TL1A, IL23), licensed for their optimized properties – not for novel, AI-discovered biology.


The $787M raise. Backed by Sanofi, Dimension, and Luminous Ventures. The scale resembles software mega-rounds, not traditional biotech Series rounds. When a company raises nearly $800M on the "AI" premium but runs conventional immunology assets, the question of AI-washing becomes hard to avoid.


The Recursion parallel. This pattern mirrors Recursion Pharmaceuticals in the US: $1B+ raised, AI branding, but clinical assets obtained through acquisition (their lead REC-4881 was in-licensed from Takeda, not AI-discovered). Earendil, like Recursion, demonstrates the industry-wide trend of using AI as a valuation multiplier while advancing fundamentally traditional pipelines.


Note: Peng's OmegaFold paper (protein folding without MSAs) was a genuine scientific contribution. The gap between this algorithmic research and Earendil's conventional antibody pipeline is the core concern.

METiS: Formulation, Not Discovery

What MTS-004 actually is. MTS-004, branded as "China's first AI-enabled formulation drug," reached Phase III for pseudobulbar affect (PBA). But MTS-004 is not a new molecular entity. It is a reformulation of existing generic active ingredients (PBA is conventionally treated with dextromethorphan/quinidine combinations like Nuedexta). The AI contribution was formulation optimization, not molecular discovery.


Formulation vs. discovery. METiS operates platforms for formulation optimization: AiLNP (lipid nanoparticles), AiRNA (RNA sequences), AiTEM (extended-release). These are incremental improvement tools. They reduce side effects or improve bioavailability of existing, genericized molecules. The commercial ceiling is capped because reformulated generics cannot command the pricing of novel biologics or new molecular entities with full patent protection.


The defensibility gap. A Phase III reformulated generic has low clinical risk (the active drug is already proven) but severely limited upside. Defending IP on formulation alone – without owning the underlying molecule – is notoriously difficult and rarely commands premium valuations. This is a fundamentally different business from AI-driven de novo drug design.


The AstraZeneca deal. This is a platform access arrangement: AZ pays METiS to use their formulation tools (primarily AiLNP) to deliver AZ's own proprietary molecules. METiS acts as a specialized delivery contractor, not a drug discovery partner. The deal validates their engineering capability but not their positioning as an "AI drug discovery" company.


The market for AI-optimized formulation is real but small. It represents a fundamentally different value proposition from the promise of AI-driven novel drug discovery that drives sector valuations.

Section X

Key Academic Papers 发现

1. Deep learning enables rapid identification of potent DDR1 kinase inhibitors
Zhavoronkov, A., et al. · Nature Biotechnology, 2019 · DOI: 10.1038/s41587-019-0224-x
Landmark: AI-designed molecule from target to mouse validation in 46 days. (Insilico Medicine)
2. Clinical efficacy and safety of rentosertib in idiopathic pulmonary fibrosis: Phase II results
Ren, F., Zhavoronkov, A., et al. · Nature Medicine, 2026
First clinical efficacy data for an end-to-end AI-discovered target and molecule. (Insilico Medicine)
3. DrugCLIP: Vision-Language Contrastive Learning for Drug Discovery
Lan, Y., et al. · Science, 2026
Multi-modal foundation model aligning molecular structures with biological text. (Tsinghua)
4. Deep Potential Molecular Dynamics: A Scalable Model with the Accuracy of Quantum Mechanics
Zhang, L., et al. · Physical Review Letters, 2018
Gordon Bell Prize winner. Massive molecular simulations with QM accuracy. (DP Technology/PKU)
5. xTrimo: A Cross-Modal Pre-trained Model for Protein Sequence and Structure
BioMap Research · Cell Systems, 2024
100-billion parameter foundation model for biology. (BioMap)
6. PBCNet: A Physics-Based Graph Neural Network for Crystal Structure Prediction
XtalPi/MIT Authors · Nature Computational Science, 2023
Combines AI and QM for predicting stable crystal polymorphs. (XtalPi)
7. Lingo3DMol: Generation of 3D molecular structures based on geometric deep learning
StoneWise Research Team · J. Chem. Inf. Model., 2022
Advanced 3D conformer generation for spatial docking. (StoneWise/CAS)
8. Network Pharmacology: A New Paradigm for Drug Discovery
Li, S., et al. · Acta Pharmaceutica Sinica B, 2021
Foundational review defining network pharmacology in China. (Tsinghua)
9. Uni-Fold: Open-Source Platform for Protein Folding Models beyond AlphaFold
DP Technology Team · Nature Communications, 2022
China's leading open-source reproduction and enhancement of AlphaFold.
10. Regulatory Perspectives on AI in Drug Development in China
Tsinghua/CDE Joint Review · Nature Reviews Drug Discovery, 2025
Outlines regulatory translation of CDE guidelines. (Tsinghua/NMPA)
11. Omegafold: High-resolution de novo structure prediction from a single sequence
Helixon Research · Nature, 2022
PLM-based protein folding without MSAs. (Helixon/Earendil)
12. LinearDesign: Efficient Algorithms for Optimized mRNA Sequence Design
Baidu Research · Nature, 2023
AI for optimising mRNA stability and translation efficiency.
13. Graph Neural Networks for Drug-Target Interaction Prediction
Chen, T., et al. · Briefings in Bioinformatics, 2022
Comprehensive review on GNNs in drug discovery. (Tsinghua)
14. KGE-NFM: Knowledge Graph Embedding for Network Pharmacology
Zheng, S., et al. · Bioinformatics, 2023
Knowledge graphs for drug repurposing. (PKU)
15. MoleculeNet: A benchmark for molecular machine learning
Wu, Z., et al. · Chemical Science, 2018
Widely used benchmark for molecular property prediction.
16. A deep generative model for molecule design with desired properties
Tencent AI Lab · NeurIPS, 2021
Reinforcement learning approach to molecular generation.
17. ChemLLM: A Chemical Large Language Model
Shanghai AI Lab · arXiv, 2024
One of the first domain-specific LLMs for chemistry.
18. De novo drug design using RL with graph-based deep generative models
Peking University · JCIM, 2020
Early successful application of RL to graph-based molecular generation.
19. AI-driven discovery of novel antiviral candidates
Alibaba DAMO Academy · Advanced Science, 2022
AI for COVID-19/broad-spectrum antiviral discovery.
20. Deep learning in ADMET prediction
CASIA (CAS) · Drug Discovery Today, 2021
Comprehensive model for pharmacokinetic properties.
21. DeepTox: Toxicity prediction using deep learning
Zhejiang University · Toxicological Sciences, 2022
State-of-the-art predictive toxicology modelling.
22. AI-assisted discovery of anti-cancer phytochemicals
Sun Yat-sen University · Phytomedicine, 2023
Merging AI with natural product drug discovery.
23. Automated synthesis planning with deep learning
XtalPi · Chemical Communications, 2021
Retrosynthesis AI integration with automated labs.
24. Generative models for targeted protein degraders (PROTACs)
Fudan University · J. Med. Chem, 2024
Applying generative AI to complex bifunctional molecules.
25. The Landscape of AI Applications in Clinical Trials in China
NMPA CDE Workgroup · Clinical Trials, 2025
Digital biomarkers and AI in Chinese clinical trials.
Section XI

Ecosystem Deal Tracker 创新

CompanyPartnerYearTotal ValueTypeTherapeutic Area
Insilico MedicineEli Lilly2024$2.75BPlatform + AssetsMultiple
Insilico MedicineSanofi2022$1.2BPlatform (6 targets)Multiple
BioMapSanofi2025$1B+Platform + DiscoveryImmuno-oncology
Insilico MedicineServier2026$888MR&D CollaborationOncology
Insilico MedicineMenarini/Stemline2025$550M+Asset LicenseOncology
XtalPi (Ailux)Eli Lilly2023$345MBiologics DiscoveryMultiple
Insilico MedicineQilu Pharmaceutical2026~$120MAsset PartnershipCardiometabolic
Insilico MedicineExelixis2024~$80M up.Asset License (USP1)Oncology
METiSAstraZeneca2023UndisclosedLNP CollaborationChronic diseases
EarendilSanofi2023UndisclosedAntibody DesignBiologics
XtalPiPfizer2022UndisclosedCrystal PredictionAntivirals
Insilico MedicineFosun Pharma2024$13M up.Co-developmentImmuno-oncology
StoneWiseChipscreen2022UndisclosedAI DiscoveryOncology
StoneWiseHuawei Cloud2022UndisclosedCompute PartnershipMultiple
Accutar BiotechMultiple pharma2023UndisclosedPROTAC DiscoveryOncology

Total values represent maximum biobucks (milestone-dependent). Actual payments conditional on clinical and regulatory achievements.

Deal Value Distribution

Market Leader Share
~78%
of total disclosed deal value in ecosystem
Total Ecosystem Deal Value
$7B+
combined biobucks across all tracked deals

Deal Flow by Year

2020
2021
2022
2023
2024
2025
2026*

* 2026 data reflects H1 only. Bar heights proportional to aggregate deal value announced per year.

Section XII

Regulatory Timeline 智药

July 2017
State Council: New Generation AI Development Plan
Guo Fa [2017] No. 35. Names healthcare and drug discovery as strategic AI priorities. Triggers massive public and private investment.
November 2021
PIPL Takes Effect
Personal Information Protection Law. Article 36 restricts cross-border transfer of health and genetic data. CAC security assessments required. Data localisation becomes de facto mandatory for Chinese patient cohorts.
July 2023
HGR Implementation Rules
Ministry of Science and Technology (MOST) enacts implementation rules for Human Genetic Resources regulations. Foreign entities barred from directly collecting/storing/transferring Chinese genetic data. AI companies with foreign capital must partner with domestic institutions.
August 2023
CDE Draft AI Guidance
CDE issues draft "Guiding Principles for the Application of AI in Drug R&D." Covers algorithm transparency, data quality, model validation, and interpretability requirements for AI-generated IND submissions.
March 2024
CDE Finalises AI Guidance (Trial)
"Guiding Principles for the Application of AI in Drug Research and Development (Trial)" actively enforced. First official Chinese regulatory document explicitly covering AI algorithms, training data traceability, and experimental validation in drug discovery.
2023–2024
Provincial Initiatives
Shanghai (Pudong): RMB 1.5B matched funding for AI-pharma in Zhangjiang. Suzhou (BioBAY): RMB 500M for AI drug clinical trials + supercomputing access. Shenzhen: RMB 2B for intelligent manufacturing and AI drug design in Greater Bay Area.
February 2026
Beijing RMB 10B AI Life Sciences Fund
"Beijing Action Plan for AI in Life Sciences (2026–2030)." RMB 10 billion (~$1.4B) municipal fund for AI-pharma. Establishes Beijing AI-Pharma Innovation Center with subsidised GPU clusters and clinical trial funding.
2028 (Projected)
Mandatory AI Toxicity Models?
Industry observers expect CDE to mandate AI-driven toxicity and PK/PD modelling as standard IND components for certain drug classes, formally embedding AI into China's regulatory pathway.

Regulatory Comparison: CDE vs. FDA vs. EMA

DimensionCDE/NMPA (China)FDA (US)EMA (EU)
Dedicated AI guidanceYes (March 2024, Trial)Discussion papers (2023+)Reflection paper (draft 2025)
AI in IND submissionsAccepted with validation dataCase-by-caseCase-by-case
Data sovereigntyStrict (PIPL + HGR)Moderate (HIPAA)Strict (GDPR)
Cross-border dataCAC security assessment requiredGenerally permittedAdequacy decisions
Algorithm transparencyRequired (traceability)RecommendedRecommended
Genetic data rulesMOST review mandatoryIRB oversightGDPR special category
Speed to INDFast (60-day default)30-day reviewVariable by member state

China's CDE is arguably the most advanced regulator globally in terms of explicit, codified guidance for AI in drug discovery. The FDA has broader experience reviewing AI-generated submissions but has not yet issued binding guidance specific to AI-designed molecules.

Data Sovereignty Impact

Three overlapping regulations govern data for AI drug discovery in China:

PIPL (Nov 2021): Article 36 restricts cross-border transfer of sensitive personal health data. Clinical trial data from Chinese patients requires CAC security assessment before leaving the country. AI model training on Chinese patient cohorts must occur on domestic servers.

HGR Rules (Jul 2023): Foreign entities cannot directly collect, store, or transfer Chinese human genetic resources. AI companies with foreign capital (even via VIE structures) must partner with domestic institutions and undergo MOST review. This affects all genomics-dependent AI training.

BIOSECURE Act (US, 2024+): Targets "companies of concern" with Chinese government ties. Forces Chinese AIDD companies serving US pharma to ring-fence infrastructure. WuXi AppTec and WuXi Biologics faced massive client departures. Cascading effect on the entire CRO ecosystem that AIDD companies depend on.

Section XII-B

Geopolitical Landscape 生物

Geopolitics is the single heaviest headwind for China's AIDD sector. The interaction between US legislative action, Chinese data sovereignty rules, and the global pharma supply chain creates a complex operating environment that shapes every strategic decision.

US–China Decoupling: Three Pressure Points

1. BIOSECURE Act

Targets "companies of concern" with ties to the Chinese government or military. WuXi AppTec and WuXi Biologics were named, triggering massive client departures. The cascading effect reaches AIDD companies that depend on WuXi's synthesis and screening infrastructure. Companies must now demonstrate separation from designated entities, adding compliance cost and operational complexity.

2. PCAOB & HFCAA

The Holding Foreign Companies Accountable Act requires US-listed Chinese companies to submit to PCAOB audit inspections. While a 2022 agreement temporarily resolved the standoff, uncertainty persists. This effectively closes NASDAQ to new Chinese biotech IPOs, pushing companies toward HKEX. For AIDD companies that need global investor access, this is a material constraint.

3. Export Controls & Compute Access

US restrictions on advanced GPU exports (A100, H100) affect the compute infrastructure available to Chinese AI companies. AIDD companies training large foundation models (BioMap's xTrimo, DP Technology's DeePMD) face constraints on hardware access. Domestic alternatives (Huawei Ascend, Cambricon) are closing the gap but remain behind NVIDIA in performance-per-watt for AI training workloads.

Impact on Deal Structures

Multinational pharma companies now demand explicit data compartmentalisation in licensing agreements with Chinese AIDD companies. Typical requirements include:

  • Ex-China global rights transferred to a non-Chinese entity (Cayman, Singapore, Swiss domicile)
  • All patient data from non-Chinese trials stored on non-Chinese servers
  • AI model training data provenance documentation (no PIPL-restricted datasets)
  • Contractual guarantees of no technology transfer to designated entities
  • Independent audit rights over data handling and IT infrastructure

These requirements add 6–12 months to deal negotiation timelines and increase legal costs by $1M–3M per transaction. Companies that built dual structures early (2022–2023) have a significant competitive advantage over those now scrambling to comply.

Neutral Jurisdiction Strategy

Singapore
IP holding, Asia-Pacific HQ, ASEAN market access
Switzerland
European pharma proximity, regulatory neutrality
Abu Dhabi
SWF capital, clinical trial infrastructure, MENA access
Section XIII

Deal Structures & Globalisation 创药

The NewCo Model

Several Chinese AIDD companies have adopted the "NewCo" or spin-off model to navigate geopolitical constraints. The parent company retains core AI IP and China-based operations. A separately incorporated entity – often domiciled in Singapore, Switzerland, or the Cayman Islands – holds ex-China rights to specific assets. This structure allows global pharma partners to license assets without triggering data sovereignty or BIOSECURE concerns. XtalPi's Ailux and Signet subsidiaries are examples of this pattern.

License-Out Economics

The dominant revenue model for Chinese AIDD companies is out-licensing: transferring global or ex-China rights to multinational pharma in exchange for upfront payments, development milestones, and royalties. Deal structures typically follow a pattern: $10M–80M upfront, $200M–2B+ in biobucks (milestone-contingent), and low-to-mid single-digit royalties on net sales. The gap between headline "total value" and actual cash received is significant. A $2.75B deal may yield $50M–100M in the first 3–5 years.

Bifurcation Strategy

TYPICAL BIFURCATION STRUCTURE
┌────────────────────┐     ┌─────────────────────┐
│ CHINA ENTITY       │ ←→ │ GLOBAL ENTITY       │
│ Domestic IP        │     │ Ex-China rights     │
│ NMPA filings       │     │ FDA/EMA filings     │
│ Local data         │     │ Ring-fenced IT      │
│ Shanghai/Suzhou     │     │ SG/CH/Boston/AbuDhabi│
└────────────────────┘     └─────────────────────┘

US–China decoupling has forced a dual-structure approach. Companies maintain a China entity for domestic IP, NMPA filings, and local data. A "Western" HQ (Boston, Singapore, Abu Dhabi) with ring-fenced IT infrastructure serves global pharma clients. Clinical trials increasingly run in Australia, the Middle East, or Europe as neutral jurisdictions. This adds operational complexity and cost, but has become a prerequisite for large pharma partnerships.

Middle East as Alternative Capital

Sovereign wealth funds from the UAE and Saudi Arabia have emerged as significant capital sources for Chinese AIDD companies seeking alternatives to US-centric funding. Abu Dhabi's Hub71 and Saudi Arabia's NEOM biotech initiatives actively recruit Chinese AI-pharma firms, offering capital, compute, and clinical trial infrastructure without the geopolitical friction of US-based investors.

Regional Exit Routes

HKEX (Primary): Chapter 18A (pre-revenue biotech) and Chapter 18C (specialist technology). Two AIDD companies listed as of mid-2026. Pipeline of 3–5 additional candidates expected by 2028. Advantages: proximity to China capital, RMB/HKD convertibility, regulatory familiarity.
Shanghai STAR Market: Viable for companies with purely domestic operations and no significant foreign investor concerns. Chipscreen Biosciences (688321.SS) is an existing example. Less liquid than HKEX for global investors.
NASDAQ (Blocked): Effectively unavailable. PCAOB audit inspection requirements, HFCAA delisting risk, data security concerns, and BIOSECURE implications make US listing impractical for Chinese AIDD companies. No new Chinese biotech IPO expected on NASDAQ before 2028 at the earliest.
Saudi Tadawul / Abu Dhabi (Emerging): Both exchanges are actively developing biotech listing frameworks. No Chinese AIDD company has listed in the Middle East yet, but sovereign wealth fund investment often comes with dual-listing pathway discussions. A 2027–2028 timeframe is realistic for a first listing.

IPO Landscape

CompanyExchangeTickerDateRaisedListing Rule
XtalPi (QuantumPharm)HKEX02228.HKJun 2024~HK$989MChapter 18C
Insilico MedicineHKEX03696.HKDec 2025HK$2.277BChapter 8.05

HKEX remains the primary viable exit for Chinese AI drug discovery companies. NASDAQ is effectively closed due to data security and audit inspection constraints. Shanghai STAR Market (Chapter 688) is a secondary option for companies with purely domestic operations.

CRO/CDMO Integration

China's CRO ecosystem has been both engine and casualty of the AIDD sector's evolution.

WuXi AppTec

Deeply integrated AI into HitS and DEL platforms before BIOSECURE pressure. Now relies on high-efficiency tech platforms to maintain margins. Massive client exodus from US pharma forced strategic recalibration.

Pharmaron & ChemPartner

Aggressively partnering with domestic AIDD startups to offer "AI-enabled" discovery services. Targeting domestic pharma clients and non-US global markets (Europe, Middle East, Southeast Asia).

The CRO-AIDD relationship is evolving from "vendor" to "integration partner." Mid-tier CROs will likely acquire struggling AI companies to rebrand as next-generation tech-bio service providers. The alternative is commoditisation and margin compression as AI tools become standard.

Academic Ecosystem

China's academic institutions produce a massive volume of AIDD research, forming the talent pipeline for the commercial sector.

InstitutionLocationKey StrengthsNotable Output
Tsinghua UniversityBeijingAIR Institute, network pharmacology, GNNsDrugCLIP (Science 2026), Shao Li network pharm.
Peking UniversityBeijingRL-based molecular generation, knowledge graphsKGE-NFM, de novo drug design (JCIM 2020)
Fudan UniversityShanghaiProtein structure prediction, PROTACsAlphaFold alternatives, generative degraders
Zhejiang UniversityHangzhouPredictive toxicology, ADMET modellingDeepTox (Tox. Sciences 2022)
Shanghai Jiao TongShanghaiStructural biology, computational chemistryMultiple CADD publications
USTCHefeiQuantum computing applications, physics-MLQuantum chemistry acceleration
Sun Yat-sen UniversityGuangzhouNatural products, phytochemistryAI anti-cancer phytochemicals (2023)
CAS (SIMM)ShanghaiMateria medica, compound libraries, CADDTCM database, virtual screening pipelines
Shanghai AI LabShanghaiFoundation models, ChemLLMChemLLM (arXiv 2024), open-source tools
AI for Science InstituteBeijingMolecular dynamics, ab initio methodsDeePMD-kit, Uni-Fold

China's university system produces over 5,000 PhDs annually in AI, computational chemistry, and structural biology. This talent pool is the sector's most durable competitive advantage, though brain drain to US and European institutions remains a concern.

Patent Landscape

494
AI drug discovery patents filed by Chinese entities (CNIPA + WIPO, 2018–2025)

Patent Categories

  • Generative molecular design: ~35%
  • Target identification / multi-omics: ~25%
  • Crystal structure / polymorph prediction: ~15%
  • Clinical trial optimisation: ~10%
  • ADMET / toxicity prediction: ~10%
  • Other (retrosynthesis, formulation): ~5%
Section XIV

Future Outlook: 2026–2030 发现

Survival Criteria

Companies that built proprietary, AI-generated clinical assets and advanced them to Phase I/II will survive. Pure algorithm vendors will be acquired by CROs or shut down. The "AI-Biotech" model wins; the "Tech-SaaS" model does not, at least not as a standalone business.

Will China Produce a Second Market Leader?

Probably, but with a different playbook. The current market leader succeeded by being natively global from day one. A future Chinese AIDD success story (2027–2030) will likely emerge from the Asia-Pacific/Middle East nexus – Chinese engineering talent, sovereign wealth funding from the Gulf, and clinical trials in Australia or Europe. NASDAQ access remains effectively closed for Chinese biotechs with data security concerns; HKEX Chapter 18A will remain the primary exit.

Consolidation Patterns

M&A: Cash-rich traditional Chinese pharma (Hengrui, CSPC, Qilu) will acquire distressed AIDD startups at steep discounts to build internal computational capabilities.

CRO Roll-ups: Mid-tier CROs will absorb struggling AI software companies, rebranding as "next-gen tech-bio CROs" to compete against WuXi domestically.

Survivors: The top 10% of AIDD companies will emerge as fully integrated biotechs, indistinguishable from traditional biotechs except for faster preclinical timelines and lower failure rates.

Regulatory Evolution

By 2028, the CDE may mandate AI-driven toxicity and PK/PD modelling as standard IND components. This would accelerate domestic trial approvals for companies with mature platforms while raising the bar for new entrants. China's regulatory framework will continue to mirror FDA standards in structure while diverging on data sovereignty requirements.

Key Metrics to Watch (2026–2030)

Clinical Milestones

  • Rentosertib Phase IIb/III initiation (2026–2027)
  • First AI-discovered NDA submission from China (est. 2028–2029)
  • XtalPi Signet SIGX1094R Phase II data
  • MindRank MDR-001 GLP-1 Phase I readout
  • BioMap first proprietary IND filing

Market & Policy Signals

  • BIOSECURE Act final enforcement timeline
  • CDE mandatory AI-tox guidance (est. 2028)
  • Beijing RMB 10B fund deployment pace
  • First traditional pharma M&A of AIDD startup
  • Middle East sovereign wealth fund entries

Scenario Analysis

Bull Case: Rentosertib Phase III success triggers a wave of big pharma acquisitions. Two or three additional Chinese AIDD companies reach clinical proof-of-concept. HKEX biotech listings recover. Total sector market cap exceeds $30B by 2029.
Base Case: Slow but steady progress. 5–8 companies survive in meaningful form. Licensing deals continue at current pace. No Chinese AIDD company achieves FDA approval before 2030, but NMPA approvals for AI-discovered drugs begin in 2029.
Bear Case: Geopolitical escalation shuts remaining cross-border partnerships. Clinical failures in lead programs erode investor confidence. Sector contracts to 2–3 survivors plus CRO-absorbed teams. AI drug discovery narrative shifts entirely to US/EU companies.
Section XV

Key People 生物

Twelve individuals have shaped China's AI drug discovery landscape through scientific contributions, company building, regulatory framework design, or capital deployment. Their work spans from algorithmic innovation at top universities to clinical execution at the industry's leading companies.

AZ
Alex Zhavoronkov, PhD
Founder & CEO, Insilico Medicine
Pioneer in applying GANs and reinforcement learning to drug discovery. Led the first end-to-end AI-discovered molecule into Phase II trials. Published the landmark 2019 Nature Biotechnology paper.
FR
Feng Ren, PhD
Co-CEO & CSO, Insilico Medicine
Former Head of Chemistry at GSK China and SVP at Medicilon. Built the automated robotics lab and led IND-enabling studies for multiple AI-designed molecules. Bridges wet lab and computational AI.
MJ
Ma Jian, PhD
Co-Founder & CEO, XtalPi
MIT postdoc in physics. Pioneered quantum physics-based computational drug discovery. Led XtalPi from founding to HKEX IPO and $730M+ in pre-IPO funding.
RL
Robin Li (Li Yanhong)
CEO, Baidu; Backer, BioMap
Tech billionaire and AI strategist. Founded BioMap as a standalone AI life sciences venture. Funded the xTrimo foundation model and BioMap's $1B+ Sanofi deal.
LZ
Linfeng Zhang, PhD
Founder, DP Technology; AI for Science Institute
PhD from Princeton. Creator of DeePMD-kit. Won the Gordon Bell Prize for ab initio molecular dynamics at scale. Integrates deep learning with physical principles.
LY
Lan Yanyan, PhD
Professor, AIR, Tsinghua University
Leading researcher in multi-modal foundation models for chemistry and drug discovery. Principal author of DrugCLIP (Science, 2026).
SL
Shao Li, PhD
Professor, Tsinghua University
Pioneer of AI-driven Network Pharmacology. Bridges Traditional Chinese Medicine with modern AI pathway analysis to discover multi-target mechanisms.
TC
Ting Chen, PhD
Professor, Tsinghua University
Integrates AI, bioinformatics, and computational biology to identify novel drug targets from large-scale genomic data. Key academic contributor to GNN research.
JZ
Jielong Zhang
Founder, StoneWise
Led StoneWise to early prominence in Chinese AI drug discovery through knowledge graphs and generative chemistry. Company subsequently restructured during market consolidation.
KF
Kong Fanpu
Reviewer/Official, CDE (NMPA)
Instrumental in drafting the CDE's "Guiding Principles for AI in Drug R&D." Key figure in establishing China's regulatory framework for AI-generated molecules.
JP
JianPeng Ma, PhD
Professor, Fudan / Shanghai AI Lab
Led development of AlphaFold alternatives in China. Focused on protein structure prediction models optimised for domestic supercomputing clusters.
CW
Chen Wei, PhD
Researcher, Peking University
Advanced ML models for network pharmacology and drug-target interaction networks. Contributed to accelerating compound screening workflows at PKU.
Appendix

Glossary & Abbreviations 药智

AbbreviationFull Term
AIDDAI-Driven Drug Discovery
CADDComputer-Aided Drug Design
CASChinese Academy of Sciences
CACCyberspace Administration of China
CDECenter for Drug Evaluation (under NMPA)
CROContract Research Organisation
CDMOContract Development and Manufacturing Organisation
DELDNA-Encoded Library
DFTDensity Functional Theory
GANGenerative Adversarial Network
GNNGraph Neural Network
HGRHuman Genetic Resources
HKEXHong Kong Stock Exchange
INDInvestigational New Drug (application)
IPFIdiopathic Pulmonary Fibrosis
LLMLarge Language Model
LNPLipid Nanoparticle
MOSTMinistry of Science and Technology
NDANew Drug Application
NMPANational Medical Products Administration
PIPLPersonal Information Protection Law
PK/PDPharmacokinetics / Pharmacodynamics
PROTACProteolysis-Targeting Chimera
QMQuantum Mechanics
QSARQuantitative Structure-Activity Relationship
RLReinforcement Learning
SaaSSoftware as a Service
SIMMShanghai Institute of Materia Medica
TCMTraditional Chinese Medicine
VIEVariable Interest Entity
Section XVI

Data Sources & Limitations 智能

Primary Sources

Financial & Corporate

  • • HKEX filing documents (3696.HK, 02228.HK)
  • • Company press releases and investor presentations
  • • CrunchBase and PitchBook deal databases
  • • SEC EDGAR (for US-listed entities)
  • • Bloomberg Terminal data

Scientific & Regulatory

  • • PubMed and Google Scholar
  • • CDE/NMPA regulatory database (cde.org.cn)
  • • ClinicalTrials.gov and ChiCTR
  • • CNIPA and WIPO patent databases
  • • Nature, Science, Cell Systems, JCIM journals

Known Limitations

  • Deal values are biobucks. Headline numbers represent maximum milestone-contingent payments. Actual cash received by companies in the first 3–5 years is typically 5–15% of announced totals.
  • "AI-discovered" is subjective. No industry-wide standard exists for what qualifies as an AI-discovered molecule. Some companies claim AI involvement for assets where AI played a marginal role (e.g., docking scoring only).
  • Private company data is incomplete. Revenue figures for non-public companies are estimates based on disclosed deal terms and industry sources. Actual financials may differ.
  • Patent counts include applications. Not all filed patents have been granted. The 494 figure includes published applications, which may be rejected or abandoned.
  • Survival bias. This report necessarily focuses on companies that still exist. Dozens of small AIDD startups were founded between 2018 and 2022 and quietly shut down without public notice.
  • Chinese-language sources. Some regulatory documents, provincial policy announcements, and academic papers are available only in Chinese. Translations may introduce minor inaccuracies.

About This Report

This report covers the period from 2014 to mid-2026. Data was compiled between January and May 2026. Financial figures are derived from HKEX filings, company press releases, and verified industry databases. Deal values represent maximum biobucks unless stated otherwise.

"AI-discovered" is defined as molecules where computational/AI methods played a primary role in target identification, molecular design, or both, and where this claim is supported by published evidence or regulatory filings.

Company rankings reflect a composite assessment of clinical pipeline depth, deal validation, revenue, public market status, and technological differentiation. Rankings are the author's assessment and may differ from other industry analyses.

Patent data from CNIPA (China National Intellectual Property Administration) and WIPO databases. Patent counts include published applications and granted patents in AI/ML drug discovery classifications as of Q1 2026.