RECENT STORIES:

Addressing digital sovereignty in a data-driven world
Huawei New-Gen OceanStor Dorado Converged All-Flash Storage Passes Ent...
ZTE Showcases Full-Stack AI Innovations at MWC Barcelona 2026, Creatin...
TDK opens its fifth regional headquarters in Asia-Pacific with a new b...
YY Group Malaysia Subsidiary to Grow Retail Promoter Workforce Fivefol...
Yiwu Market Resumes Operations After Lunar New Year, Sees Order Surge ...
LOGIN REGISTER
DigiconAsia
  • Features
    • Featured

      How AI is reshaping dating in Asia

      How AI is reshaping dating in Asia

      Monday, February 9, 2026, 5:00 AM Asia/Singapore | Features, Newsletter
    • Featured

      What’s next for augmented reality?

      What’s next for augmented reality?

      Wednesday, February 4, 2026, 8:41 AM Asia/Singapore | Features
    • Featured

      How non‑IT startups can plan secure, scalable IT infrastructure

      How non‑IT startups can plan secure, scalable IT infrastructure

      Monday, February 2, 2026, 8:00 PM Asia/Singapore | Features, Newsletter
  • News
    • Featured

      Nokia integrates all-flash data infrastructure into telco cloud for network modernization

      Nokia integrates all-flash data infrastructure into telco cloud for network modernization

      Friday, February 27, 2026, 10:29 AM Asia/Singapore | Case Studies, News
    • Featured

      Academic report reveals hidden human labor in humanoid-robot demos

      Academic report reveals hidden human labor in humanoid-robot demos

      Friday, February 27, 2026, 6:21 AM Asia/Singapore | News, Newsletter
    • Featured

      AI trading bot glitch turns US$16 request into US$442k blunder

      AI trading bot glitch turns US$16 request into US$442k blunder

      Thursday, February 26, 2026, 10:23 AM Asia/Singapore | News, Newsletter
  • Perspectives
  • Tips & Strategies
  • Whitepapers
  • Awards 2023
  • Directory
  • E-Learning

Select Page

Tips & Strategies

What a $150m AI startup taught the USA’s bloated US$146bn AI industry

By Matthew Oostveen, Chief Technology Officer, APJ, Pure Storage | Wednesday, April 23, 2025, 10:43 AM Asia/Singapore

What a $150m AI startup taught the USA’s bloated US$146bn AI industry

Despite being pinned down by US tech bans, China’s cost-effective AI model has outpaced global industry giants. What can we learn?

In late 2024, DeepSeek’s cost-effective generative AI (GenAI) model effectively demonstrated to the world that smaller, specialized models, paired with refined data management, can outperform large, resource-heavy foundational models, other factors notwithstanding.

This approach lowers costs, enhances efficiency, and shifts focus from building massive networks to optimizing data and infrastructure for AI innovation.

While the AI industry has long been fixated on foundational models — massive, all-knowing networks trained on everything and anything — the MoE approach has proven that smaller, more specialized models are both viable and superior in many ways.

Lessons from a surprise AI player

The meteoric rise of this approach has simply proven that smaller, more specialized models are both viable and superior in many ways.

To implement this, use a mixture-of-experts model, where smaller, highly trained models work together in tandem. This approach employs a sophisticated method for selecting the most appropriate expert model, optimizing for both performance and efficiency. Specifically:

  • Instead of one giant model doing everything, enterprises can deploy a system of interconnected models, each specialized in a specific domain. Smaller models require significantly less compute power, but the true benefit goes beyond cost savings.
  • Focused expertise makes it easier to test and verify performance in real-world applications. This approach enables the addition of more specialized model capabilities, without the complexity of building a foundation model. Small models also stand to gain reasoning capabilities more quickly, leading to better AI oversight and transparency in the long run.
  • Building foundational models is a cost-prohibitive exercise for most organizations, but this new paradigm lowers the barrier to create highly capable, domain-specific models using proprietary data. Looking ahead, industries can also expect the development of tools and base models that will streamline data distillation, making it easier to create smaller and more capable models.

Optimizing data and infrastructure for GenAI

For years, the AI industry has focused on hoarding data, maximizing token counts, and merely using brute force.

With the mixture-of-experts models, data management now takes center stage. To maximize AI effectiveness, shift from hoarding data to selecting, organizing, and refining it. AI is only as good as the data it’s trained on, so prioritize curating high-quality data, optimizing data pipelines, and building infrastructure that support AI. Specifically:

  • Use practices like continuous data enrichment, versioning, and traceability to ensure models are trained on up-to-date, reliable data, improving performance and reducing errors.
  • Enterprises also need to have systems in place that can quickly and dynamically organize and categorize data, filter out irrelevant information, and retrieve specific data at scale in real-time. This approach has already demonstrated this with a meticulously designed data selection pipeline, where data sets were filtered and refined instead of indiscriminately training on all available data. This approach has not only improved efficiency but also reduced costs.
  • AI-driven intelligent data selection is emerging as the cornerstone of future AI training, ensuring efficiency and precision in model development.
  • As AI shifts toward specialized models and data refinement, infrastructure must evolve to support this new reality. To support specialized models and data refinement, evolve infrastructure with a multi-dimensional approach to performance. Support thousands of smaller models working in parallel, as well as key-value stores that can efficiently handle data during inference time.
  • These models should be capable of processing and producing results at scale without compromising on speed or accuracy. In addition to performance, the infrastructure must also prioritize high connectivity and always-on availability. Systems need to be able to scale rapidly and manage vast quantities of data in real time.
  • A critical element in achieving this is efficient storage systems that can index, retrieve, filter, and represent large datasets effectively. Storage is no longer just about holding data: it is about enabling effective data use for AI to drive real innovation and unlock opportunities at the intersection of AI, data science, and data management.
  • The new paradigm requires businesses to rethink their approach to data storage, integration, and processing. Simplifying data management while ensuring performance and scalability can pave the way for a smarter AI ecosystem that can help industries drive innovation with data.

By implementing these measures and proactively pressing major software firms to uphold rigorous proactive and preemptive cyber diligence, we can all work and rest easier.

Share:

PreviousAI breakthroughs outpacing organizations’ ability to leverage them
NextDubai tests autonomous robots for sustainable last-mile delivery operations

Related Posts

Social-distancing considerations a new normal when Working-from-Office

Social-distancing considerations a new normal when Working-from-Office

June 2, 2020

How can governments adopt cloud-native computing without the risks?

How can governments adopt cloud-native computing without the risks?

August 20, 2024

For the Jardines group, responsible AI and sustainability go hand-in-hand with data security

For the Jardines group, responsible AI and sustainability go hand-in-hand with data security

July 19, 2024

How the insurance industry can harness AI to become proactive wellness partners

How the insurance industry can harness AI to become proactive wellness partners

December 12, 2024

Leave a reply Cancel reply

You must be logged in to post a comment.

Awards Nomination Banner

gamification list

PARTICIPATE NOW

top placement

Whitepapers

  • Achieve Modernization Without the Complexity

    Achieve Modernization Without the Complexity

    Transforming IT infrastructure is crucial …Download Whitepaper
  • 5 Steps to Boost IT Infrastructure Reliability

    5 Steps to Boost IT Infrastructure Reliability

    In today's fast-evolving tech landscape, …Download Whitepaper
  • Simplify Payroll Setup for Your Small Business

    Simplify Payroll Setup for Your Small Business

    In our free guide, "How …Download Whitepaper
  • Overcoming the Challenges of Cost & Complexity in the Cloud-first Era.

    Overcoming the Challenges of Cost & Complexity in the Cloud-first Era.

    Download Whitepaper

Middle Placement

Case Studies

  • Nokia integrates all-flash data infrastructure into telco cloud for network modernization

    Nokia integrates all-flash data infrastructure into telco cloud for network modernization

    Its December 2025 upgrade supports …Read More
  • Overcoming workforce challenges in Japan’s healthcare sector with generative AI: JCHO Osaka Hospital

    Overcoming workforce challenges in Japan’s healthcare sector with generative AI: JCHO Osaka Hospital

    A digitalization initiative launching by …Read More
  • Kingspan Insulation unifies 90‑site corporate network for enhanced agility and control

    Kingspan Insulation unifies 90‑site corporate network for enhanced agility and control

    Kingspan Insulation, Expereo, global network, …Read More
  • Genspark adopts AI-driven voice automation platform to boost global communication for customers

    Genspark adopts AI-driven voice automation platform to boost global communication for customers

    Genspark, Twilio, AI voice automation, …Read More

Bottom Sidebar

Other News

  • Huawei New-Gen OceanStor Dorado Converged All-Flash Storage Passes Enterprise Strategy Group Technical Validation

    March 3, 2026
    BARCELONA, Spain, March 3, 2026 …Read More »
  • ZTE Showcases Full-Stack AI Innovations at MWC Barcelona 2026, Creating an Intelligent Future

    March 2, 2026
    ZTE will present its latest …Read More »
  • TDK opens its fifth regional headquarters in Asia-Pacific with a new business entity in India

    March 2, 2026
    TDK establishes Asia-Pacific Regional Headquarters …Read More »
  • YY Group Malaysia Subsidiary to Grow Retail Promoter Workforce Fivefold, Targeting US$14 Million in 2026 Revenue

    March 2, 2026
    Strategic Expansion Deepens Retail Sector …Read More »
  • Yiwu Market Resumes Operations After Lunar New Year, Sees Order Surge on Day One

    March 2, 2026
    YIWU, China, March 2, 2026 …Read More »
  • Our Brands
  • CybersecAsia
  • MartechAsia
  • Home
  • About Us
  • Contact Us
  • Sitemap
  • Privacy & Cookies
  • Terms of Use
  • Advertising & Reprint Policy
  • Media Kit
  • Subscribe
  • Manage Subscriptions
  • Newsletter

Copyright © 2026 DigiconAsia All Rights Reserved.