A Gigamodel in a porcelain store… Part II

Summary

Summary

Summary

What is changing with the Gigamodels?

  • The race for size

  • A small handful of players produce the Gigamodels

  • The emergence of specialized players

The impact on research and industry

  • Research: promise for under-resourced languages; separation of learning?

  • In conclusion: some reservations, but a line of action

a gigamodel in a porcelain store
a gigamodel in a porcelain store
a gigamodel in a porcelain store

What changes with Gigamodels?

In each of their fields, Gigamodels establish the new state of the art. Such transitions occur more or less regularly in technology, and actors generally adapt to more efficient new approaches. What makes the transition to Gigamodels potentially different from others?

The race for size

One of the characteristics of Gigamodels is not only that they are large, but that there are increasingly larger ones emerging every year.

A small handful of actors produce Gigamodels

Regarding learning, the deep learning revolution had already led laboratories and research teams to equip themselves with GPUs (Graphical Processing Units) to launch the learning of the many parameters of deep neural networks. However, learning a Gigamodel requires GPU farm sizes that only a few actors possess globally.

As a result, Gigamodels are almost all produced by a small handful of actors: Google (Elmo, BERT, T5, Lambda, Imagen…), Microsoft+OpenAI (GPT2, GPT3, Dall-E, Whisper..), NVIDIA (Turing NLG, Megatron-Turing NLG with Microsoft, RIVA), Meta — ex-Facebook (XLM, Roberta, wav2vec…)

The emergence of specialized actors

Regarding the use of Gigamodels, whether it be their fine-tuning or their efficient deployment in production, a particular know-how is developing among a few actors, such as HuggingFace, the leading platform for sharing and tooling models, as well as at Microsoft with the ONNX format, or GPU manufacturer NVIDIA, with TensorRT, Triton, NEMO, RIVA…. It is clear that this expertise is key for the business model of these actors, who rely, for one on the revenues from deploying models in their Cloud, for the other on the sale of their GPUs.

The companies producing Gigamodels, such as Google and Microsoft, generally also have a business model involving the sale or rental of processing capacity.

The impact on research and industry

What is the impact of these Gigamodels on research and the industry of cognitive services, language, and speech technologies?

It is probably too early to accurately assess the impact of Gigamodels, but we can already identify the challenges they pose and possible developments.

Research: promise for under-resourced languages; separation of learnings?

Gigamodels carry within them the promise of better responding to applications concerning under-resourced languages or fields, even those with no written record, thanks to pre-trained models complemented with modest-sized supervised corpora, or directly through self-supervised learning. However, experiments still need to be conducted to verify the conditions of effectiveness for these types of approaches.

Another question is whether research will split between a few teams working on the design of the Gigamodels themselves, while the majority of others focus on fine-tuning, producing lighter models, or working with limited data, etc.

Industry: promise of speed; schism around data and Cloud?

For the industry, the promise of Gigamodels is primarily one of reduced entry barriers and decreased needs for supervised learning data.

However, Gigamodels also require specific infrastructures and environments for learning and deployment in production, so we can see two types of users emerging within the industry: those who will rely on specialized actors offering learning and deployment tools in their own cloud, and those who will develop their own capacity in terms of learning and production.

These different types of actors can be represented in the diagram below, taking into account the size of the GPU infrastructures and the size of the data corpora handled.

It appears as a data and infrastructure schism, especially considering the blue blocks. Whether this schism will be confirmed remains to be seen.

The development model of start-ups like HuggingFace seems to bet on this. The actor NVIDIA, which provides both HuggingFace and GAFAM as well as specialized companies and actors, seems ready to support other development models, in case the so-called mole blocks, which regain control over data and infrastructure, become more significant.

In conclusion: reservations, but a course of action

Arriving at the conclusion, it is important to qualify the statement:

First of all, Gigamodels, as promising as they may be, remain a recent phenomenon. They still largely coexist in the industry with hybrid models from the previous generation, learned from already collected and annotated data, which continue to guarantee high performance in their operational domain. The adoption and impact of Gigamodels on the industry are not yet settled and will depend on how agilely industry actors seize them and integrate them into business applications.

Moreover, despite their growing prowess, the limitations of LLMs must be taken into account; increasing the size of the models does not mechanically resolve all complexities of spoken communication, and each model must find its place and operational relevance, which also largely depends on interactions with human users.

Finally, the legal issues surrounding the data feeding these models cannot be overlooked.

All these nuances aside, it is undeniable that Gigamodels are disrupting the state of technology and opening immense and unexplored application possibilities. If European actors do not seize the opportunity and leave it to the large US platforms dominating production and deployment today, they will not have the chance to shape the new ecosystem currently being reconfigured.

The End!

hight performer
high performer emea
best support
best support
best support

4.7/5 based on +200 reviews

+150 companies use ViaDialog to simplify their customer relations.

  • logo francetv
  • logo sg
  • logo maif
  • logo sodexo
  • logo parc asterix
  • logo eurostar
hight performer
high performer emea
best support
best support
best support

4.7/5 based on +200 reviews

+150 companies use ViaDialog to simplify their customer relations.

  • logo francetv
  • logo sg
  • logo maif
  • logo sodexo
  • logo parc asterix
  • logo eurostar
hight performer
high performer emea
best support
best support
best support

4.7/5 based on +200 reviews

+150 companies use ViaDialog to simplify their customer relations.

  • logo francetv
  • logo sg
  • logo maif
  • logo sodexo
  • logo parc asterix
  • logo eurostar

Improve customer experience and optimize the performance of your agents.

check

20 years of expertise at your service

check

Billions of interactions managed each year

check

ISO 22301 certified for your peace of mind

check

Continuous innovation with our AI lab in Brittany

check

Declared electronic communications operator to ARCEP

hight performer
high performer emea
best support
best support
best support

4.7/5 on +200 reviews

logo sodexo
logo maif
logo sg
logo francetv

Contact our sales team.

Give us some personal information and we will contact you as soon as possible. See you very soon!

Improve customer experience and optimize the performance of your agents.

check

20 years of expertise at your service

check

Billions of interactions managed each year

check

ISO 22301 certified for your peace of mind

check

Continuous innovation with our AI lab in Brittany

check

Declared electronic communications operator to ARCEP

hight performer
high performer emea
best support
best support
best support

4.7/5 on +200 reviews

logo sodexo
logo maif
logo sg
logo francetv

Contact our sales team.

Give us some personal information and we will contact you as soon as possible. See you very soon!

Improve customer experience and optimize the performance of your agents.

check

20 years of expertise at your service

check

Billions of interactions managed each year

check

ISO 22301 certified for your peace of mind

check

Continuous innovation with our AI lab in Brittany

check

Declared electronic communications operator to ARCEP

hight performer
high performer emea
best support
best support
best support

4.7/5 on +200 reviews

logo sodexo
logo maif
logo sg
logo francetv

Contact our sales team.

Give us some personal information and we will contact you as soon as possible. See you very soon!

ViaDialog Logo

Products

Resources

Company

FR: +33 (0)1 77 45 30 00

152 Boulevard Pereire, 75017 Paris

DESIGNED
AND HOSTED
IN FRANCE

logo entreprise française
logo afaq iso22301
logo rgpd
high performer
high performer emea
meilleur soutien
Users love us
Users love us

Designed and hosted in Paris 🥐

Privacy Policy
Terms and Conditions
Cookies
CSR Policy
English