Google announces Gemini: it’s most powerful AI model, powered by TPU chips

Google claims it has developed a new Generative Artificial Intelligence (GenAI) system and Large Language Model (LLM) more powerful than any currently on the market, including technology developed by ChatGPT creator OpenAI.   Gemini can summarize text, create images and answer questions. Gemini was trained on Google’s Tensor Processing Units v4 and v5e.

Google’s Bard is a generative AI based on the PaLM large language mode. Starting today, Gemini will be used to give Bard “more advanced reasoning, planning, understanding and more,” according to a Google blog post.

While global users of Google Bard and the Pixel 8 Pro will be able to run Gemini now, an enterprise product, Gemini Pro, is coming on Dec. 13th.  Developers can sign up now for an early preview in Android AICore.

Gemini comes in three model sizes: Ultra, Pro and Nano. Ultra is the most capable, Nano is the smallest and most efficient, and Pro sits in the middle for general tasks. The Nano version is what Google is using on the Pixel, while Bard gets Pro. Google says it plans to run “extensive trust and safety checks” before releasing Gemini Ultra to select groups.

Gemini can code in Python, Java, C++, Go and other popular programming languages. Google used Gemini to upgrade Google’s AI-powered code generation system, AlphaCode.  Next, Google plans to bring Gemini to Ads, Chrome and Duet AI. In the future, Gemini will be used in Google Search as well.

……………………………………………………………………………………………………………………………………………………

Market Impact:

Gemini’s release and use will present a litmus test for Google’s technology following a push to move faster in developing and releasing AI products. It coincides with a period of turmoil at OpenAI that has sent tremors through the tight knit AI community, suggesting the industry’s leaders is far from settled.

The announcement of the new GenAI software is the latest attempt by Google to display its AI portfolio after the launch of ChatGPT about a year ago shook up the tech industry. Google wanted outside customers to perform testing on the most advanced version of Gemini before releasing it more widely, said Demis Hassabis, chief executive officer of Google DeepMind.

“We’ve been pushing forward with a lot of focus and intensity,” Hassabis said, adding that Gemini likely represented the company’s most ambitious combined science and engineering project to date.

Google said Wednesday it would offer a range of AI programs to customers under the Gemini umbrella. It touted the software’s ability to process various media, from audio to video, an important development as users turn to chatbots for a wider range of needs.

The most powerful Gemini Ultra version outperformed OpenAI’s technology, GPT-4, on a range of industry benchmarks, according to Google. That version is expected to become widely available for software developers early next year following testing with a select group of customers.

………………………………………………………………………………………………………………………………………………………………………………………………

Role of TPUs:

While most GenAI software and LLM’s are processed using NVIDIA’s neural network processors, Google’s tensor processing units (TPUs) will power Gemini.  TPUs are custom-designed AI accelerators, which are optimized for training and inference of large AI models.  Cloud TPUs are optimized for training large and complex deep learning models that feature many matrix calculations, for instance building large language models (LLMs). Cloud TPUs also have SparseCores, which are dataflow processors that accelerate models relying on embeddings found in recommendation models. Other use cases include healthcare, like protein folding modeling and drug discovery.

Google’s custom AI chips, known as tensor processing units, are embedded in compute servers at the company’s data center.  Photo Credit: GOOGLE

…………………………………………………………………………………………………………………………………

Competitors:

Gemini and the products built with it, such as chatbots, will compete with OpenAI’s GPT-4, Microsoft’s Copilot (which is based on OpenAI’s GPT-4), Anthropic’s Claude AI, Meta’s Llama 2 and more. Google claims Gemini Ultra outperforms GPT-4 in several benchmarks, including the massive multitask language understanding general knowledge test and in Python code generation.

…………………………………………………………………………………………………………………………………

References:

Everything to know about Gemini, Google’s new AI model (blog.google)

Google Reveals Gemini, Its Much-Anticipated Large Language Model (techrepublic.com)

https://cloud.google.com/tpu

https://www.wsj.com/tech/ai/google-announces-ai-system-gemini-after-turmoil-at-rival-openai-10835335?mod=ai_more_article_pos1

 

 

 

Generative AI Unicorns Rule the Startup Roost; OpenAI in the Spotlight

Introduction:

Despite mounting pressure on venture capital in a difficult economic environment, money is still flowing into generative Artificial Intelligence (AI) startups.  Indeed, AI startups have emerged as a bright spot for VC investments this year amid a wider slowdown in funding caused by rising interest rates, a slowing economy and high inflation.

VCs have already poured $10.7 billion into Generative AI [1.] start-ups within the first three months of this year, a thirteen-fold increase from a year earlier, according to PitchBook, which tracks start-ups.

Note 1. Generative AI is a type of artificial intelligence that can create new content, such as text, synthetic data, images, and audio.  The recent buzz around Generative AI has been driven by the simplicity of new user interfaces for creating high-quality content in a matter of seconds.

….………………………………………………………………………………….

Tech giants have poured effort and billions of dollars into what they say is a transformative technology, even amid rising concerns about A.I.’s role in spreading misinformation, killing jobs and one day matching human intelligence.  What they don’t publicize is that the results (especially from ChatGPT) may be incorrect or inconclusive.

We take a close look at Generative AI Unicorns with an emphasis on OpenAI (the creator of ChatGPT) and the competition it will face from Google DeepMind.

Generative AI Unicorns and OpenAI:

AI startups make up half of all new unicorns (startups valued at more than $1B) in 2023, says CBInsights.

At Generative AI firms, startups are reaching $1 billion valuations at lightning speed.  There are currently 13 Generative AI unicorns (see chart below), according to CBInsights which said they attained their unicorn status nearly twice as fast as the average $1 billion startup.

Across the 13 Generative AI unicorns, the average time to reach unicorn status was 3.6 years but for the unicorn club as a whole the average is 7 years — almost twice as long.

OpenAI, the poster child for Generative AI with its Chat GPT app, tops the list with a valuation of almost $30 billion.  Microsoft is the largest investor as it provided OpenAI with a $1 billion investment in 2019 and a $10 billion investment in 2023.  Bloomberg reported that the company recently  closed an investment fund, exceeding expectations with a value that surpasses $175 million.

However, OpenAI may have a formidable competitor in Google DeepMind (more details in DeepMind section below).

….……………………………………………………………………………………………………………………………………………………….

Anthropic is #2 with a valuation of $4.4B. It’s an AI safety and research company based in San Francisco, CA.  The company says they “develop large-scale AI systems so that we can study their safety properties at the technological frontier, where new problems are most likely to arise. We use these insights to create safer, steerable, and more reliable models, and to generate systems that we deploy externally, like Claude (to be used with Slack).”

In Q1-2023, Generative AI companies accounted for three of the entrants to the unicorn club with Anthropic, Adept, and Character.AI all gaining valuations of $1B or above.

New Generative AI Unicorns in May:

Ten companies joined the Crunchbase Unicorn Board in May 2023 — double the count for April 2023. Among them were several AI startups:

  • Toronto-basedCohere, a generative AI large language model developer for enterprises, raised $270 million in its Series C funding. The funding was led by Inovia Capital  valuing the 4-year-old company at $2.2 billion.
  • Generative video AI company Runway, based out of New York, raised a $100 million Series D led by Google. The funding valued the 5-year-old company at $1.5 billion.
  • Synthesia, a UK-based artificial intelligence (AI) startup, has raised about $90 million at a valuation of $1 billion from a funding round led by venture capital firms Accel and Nvidia-owned NVentures.  “While we weren’t actively looking for new investment, Accel and NVIDIA share our vision for transforming traditional video production into a digital workflow,” said Victor Riparbelli, co-founder and CEO of Synthesia.

….…………………………………………………………………………………………………………………………………………………..

Google DeepMind:

Alphabet CEO Sundar Pichai said in a blog post, “we’ve been an AI-first company since 2016, because we see AI as the most significant way to deliver on our mission.”

In April, Alphabet Inc. created “Google DeepMind,” in order to bring together two leading research groups in the AI field: the Brain team from Google Research, and DeepMind (the AI startup Google acquired in 2014). Their collective accomplishments in AI over the last decade span AlphaGo, Transformers, word2vec, WaveNet, AlphaFold, sequence to sequence models, distillation, deep reinforcement learning, and distributed systems and software frameworks like TensorFlow and JAX for expressing, training and deploying large scale Machine Learning (ML) models.

By launching DeepMind as Google’s Generative AI solution, there could be a new battle front opening in quantum computing, machine learning perception, gaming and mobile systems, NLP and human-computer interaction and visualization.

A recent DeepMind paper says the Alphabet unit has extended AI capabilities with faster sorting algorithms to create ordered lists.  Their paper says it shows “how artificial intelligence can go beyond the current state of the art,” because ultimately AlphaDev’s sorts use fewer lines of code for sorting sequences with between three elements and eight elements — for every number of elements except four. And these shorter algorithms “do indeed lead to lower latency,” the paper points out, “as the algorithm length and latency are correlated.”

Their researchers created a program based on DeepMind’s AlphaZero program, which beat the world’s best players in chess and Go. That program trained solely by playing games against itself, getting better and better using a kind of massively automated trial-and-error that eventually determines the most optimal approach.

DeepMind’s researchers modified into a new coding-oriented program called AlphaDev, calling this an important next step. “With AlphaDev, we show how this model can transfer from games to scientific challenges, and from simulations to real-world applications,” they wrote on the DeepMind blog.  The newly-discovered sorting algorithms “contain new sequences of instructions that save a single instruction each time they’re applied. AlphaDev skips over a step to connect items in a way that looks like a mistake, but is actually a shortcut.”

….………………………………………………………………………………………………………………………………………………………..

Conclusions:

While many luminaries, such as Henry Kissinger, Eric Schmidt and Daniel Huttenlocher, have lauded Generative AI as the greatest invention since the printing press, the technology has yet to prove itself worthy of the enormous praise.  Their central thesis, that a computer program could “transform the human cognitive process” in a way tantamount to the Enlightenment, is a huge stretch.

Gary Marcus, a well-known professor and frequent critic of A.I. technology, said that OpenAI hasn’t been transparent about the data its uses to develop its systems. He expressed doubt in CEO Sam Altman’s prediction that new jobs will replace those killed off by A.I.

“We have unprecedented opportunities here but we are also facing a perfect storm of corporate irresponsibility, widespread deployment, lack of adequate regulation and inherent unreliability,” Dr. Marcus said.

The promise and potential of Generative AI will not be realized for many years.  Think of it as a “research work in progress” with many twists and turns along the way.

….………………………………………………………………………………………………………………………………..

References:

https://www.cbinsights.com/research/generative-ai-unicorns-valuations-revenues-headcount/

https://pitchbook.com/news/articles/Amazon-Bedrock-generative-ai-q1-2023-vc-deals

Curmudgeon/Sperandeo:  Impact of Generative AI on Jobs and Workers

Generative AI in telecom; ChatGPT as a manager? ChatGPT vs Google Search

Generative AI could put telecom jobs in jeopardy; compelling AI in telecom use cases