News | Google Unveils AI Advancements in Text-To-Video Conversion, Language Translation, and More

Google Unveils AI Advancements in Text-To-Video Conversion, Language Translation, and More

Published by: Insights Desk Released: Nov 04, 2022 Source: DemandTalk

Highlights:

Google also announced an ambitious project to develop a single AI language model that supports the world’s 1,000 most spoken languages.

At a Google AI event held at the company’s offices at Pier 57 in New York City, Google unveiled several advances in Artificial Intelligence (AI), including generative AI, language translation, health AI, and disaster management.

The program also discussed its attempts to construct responsible AI, especially concerning control and safety, identifying generative AI, and “creating for everyone.”

Google CEO Sundar Pichai said, “We see so much opportunity ahead and are committed to making sure the technology is built in service of helping people, like any transformational technology. Reimagine how technology can be helpful in people’s lives.”

Furthermore, Pichai highlighted the risks and difficulties associated with AI. He stated, “That’s why Google is focused on responsible AI from the beginning, publishing AI principles which prioritize the safety and privacy of people over anything else.”

Google debuts Imagen Video — Phenaki combo

Douglas Eck, principal scientist at Google Research and research director of Google’s Brain Team, discussed a range of Google generative AI announcements, including the company’s cautious, sluggish (relative to DALL-E 2 or Stability AI) attempts to disclose its text-to-image AI systems.

While Google Imagen is not yet available to the general public, the firm stated that it would add a restricted version of the technology to its AI Test Kitchen app (earlier this year demonstrated LaMDA) as a means to collect early feedback. The firm demonstrated a demo, City Dreamer, which allows users to conjure visuals of a city based on a theme, such as pumpkins.

In addition, developing on its text-to-video work unveiled last month, Google released the first rendering of a video that shares both of the company’s complementary text-to-video research methodologies –Imagen Video and Phenaki. The outcome combines Phenaki’s ability to produce a video from a series of text prompts and Imagen’s high-resolution detail.

Douglas Eck, principal scientist at Google Research and research director for Google’s Brain Team, said, “I think it is amazing that we can talk about telling long-form stories like this with super-resolution video, not just from one prompt but a sequence of prompts, with a new way of storytelling.” He further added that he was excited about how filmmakers or video storytellers might use this technology.

Other generative AI advances

In the text area, Eck also presented the LaMDA dialogue engine and the Wordcraft Writers Workshop, which pushed established authors to develop experimental fiction using LaMDA as a tool.

Eck stated that Google would shortly publish a research paper on this topic.

Douglas Eck said, “One clear finding is that using LaMDA to write full stories is a dead end. It’s more useful to use LaMDA to add spice.” He added that the user interface also has to be right, serving as a “text editor with a purpose.”

Eck also highlighted Google’s efforts to use AI to generate code and recently introduced research from AudioLM that extends the audio from any audio clip entered without needing a musical score. It also introduced DreamFusion, the just-announced text-to-3D rendering that combines Imagen and NeRF’s 3D capabilities.

Google is building a universal speech translator

After analyzing Google’s progress in language AI research, Google Brain chief Zoubin Ghahramani outlined the company’s endeavor to represent the diversity of the world’s languages and its ambitious attempt to construct a model that supports the world’s top 1000 languages.

In addition, Google asserts that it is training a universal speech model on more than 400 languages, claiming that this is the “largest language model coverage seen in a speech model today.”

A strong focus on responsible AI

The AI announcements, which included James Manyika, SVP of Google Alphabet, and Marian Croak, VP of engineering at Google, addressed Google’s emphasis on responsible AI.

Croak said, “I think if we’re going to be leaders, it’s imperative that we push state-of-the-art on responsible AI technology. I’m passionate about wanting to discover ways to make things work in practice.”

pros enterprise ai for the industrial industries (...

unlocking ai’s potential: challenges and opportu...

transforming procurement with ai: opportunities, c...

ai, automation, and the strategic cao...

an introduction to ai in customer service...

5 ways ai can transform your customer experience...

ciso guide to generative ai attacks...

10 reasons to hire a customer-led voice assistant...

10 reasons to hire a customer-led voice assistant...

the definitive buying guide for contact center her...

cfo's guide to ai...

discover the future of business innovation with ge...

preparing for the future of cx by harnessing the p...

tableau gpt: innovate for the future with generati...

2023 mid-year cyber security report...

chatgpt security risks: a guide for cyber security...

empowering ai systems with scalable and resilient ...

leveraging ai to cut costs: use cases, examples, ...

consumers expect ai to radically transform service...

intelligence report: the latest on ai infrastructu...

how chatbot marketing supports today’s business ...

advanced adaptive ai bolsters business intelligenc...

the dynamic impact of ai in procurement...

ai in customer service – revealing common applic...

how to use dall-e for marketing success...

rpa vs ai: a comparative analysis for business aut...

maximizing business efficiency through ai integrat...

7 trendiest ai marketing campaigns igniting commer...

liquid neural network unveiling the fluid intellig...

the art of prompt engineering in general & marketi...

what is amazon bedrock?...

decode data like never before: chatgpt for data an...

workforce planning models –the power of ai skil...

black friday and the impact of ai in e-commerce...

how digital brain is a game changer for business s...

how ai chips are the driving force of modern techn...

exploring ai chip types, benefits, comparative an...

choosing between edge ai and cloud ai for business...

exploring haptic technology: applications and scop...

unveiling federated learning in the ai landscape...

mistral ai to raise usd 5 b in valuation from the ...

microsoft invests usd 1.5 b in g42, an ai enterpri...

document ai startup upstage raises usd 72 m for gl...

rivos’s start-up capital raises usd 250m in fun...

openai plans new office in tokyo and gpt-4 new ver...

elon musk-led xai corp. launches first multimodal ...

mistralai launches mixtral 8x22b, a leading open-s...

collibra’s ai government solution to enhance dat...

intel launches gaudi 3 ai chip as amd expands proc...

sima technologies inc. secures usd 70 m for develo...

optimization tools for gpt-4 now available for use...

opera adds 150 local llm variants to opera one bro...

openai debuted enhanced dall-e editor for premium ...

replit launches ai-based coding tool for developer...

octoai inc. introduces octostack to help companies...

fortinet integrates new ai features into its offer...

securiti’s compliance management solution to sim...

openai unveils chatgpt open access initiative...

modular inc. open sources mojo programming languag...

amazon invests usd 2.75b in anthropic, an openai c...

role of machine learning in networking...

Google Unveils AI Advancements in Text-To-Video Conversion, Language Translation, and More

Insights Desk

Related posts

Mistral AI to Raise USD 5 B in Valuation from the ...

Microsoft Invests USD 1.5 B in G42, an AI Enterpri...

Document AI Startup Upstage Raises USD 72 M for Gl...

Rivos’s Start-up Capital Raises USD 250M In Fun...

OpenAI Plans New Office in Tokyo and GPT-4 New Ver...

Elon Musk-led xAI Corp. Launches First Multimodal ...

MistralAI Launches Mixtral 8x22B, a Leading Open-s...

Collibra’s AI Government Solution to Enhance Dat...

Intel Launches Gaudi 3 AI Chip as AMD Expands Proc...

SiMA Technologies Inc. Secures USD 70 M for Develo...

Our Brands