8 Google Employees Invented Modern AI. Here’s the Inside Story

Date:

Share:


The last two weeks before the deadline were frantic. Though officially some of the team still had desks in Building 1945, they mostly worked in 1965 because it had a better espresso machine in the micro-kitchen. “People weren’t sleeping,” says Gomez, who, as the intern, lived in a constant debugging frenzy and also produced the visualizations and diagrams for the paper. It’s common in such projects to do ablations—taking things out to see whether what remains is enough to get the job done.

“There was every possible combination of tricks and modules—which one helps, which doesn’t help. Let’s rip it out. Let’s replace it with this,” Gomez says. “Why is the model behaving in this counterintuitive way? Oh, it’s because we didn’t remember to do the masking properly. Does it work yet? OK, move on to the next. All of these components of what we now call the transformer were the output of this extremely high-paced, iterative trial and error.” The ablations, aided by Shazeer’s implementations, produced “something minimalistic,” Jones says. “Noam is a wizard.”

Vaswani recalls crashing on an office couch one night while the team was writing the paper. As he stared at the curtains that separated the couch from the rest of the room, he was struck by the pattern on the fabric, which looked to him like synapses and neurons. Gomez was there, and Vaswani told him that what they were working on would transcend machine translation. “Ultimately, like with the human brain, you need to unite all these modalities—speech, audio, vision—under a single architecture,” he says. “I had a strong hunch we were onto something more general.”

In the higher echelons of Google, however, the work was seen as just another interesting AI project. I asked several of the transformers folks whether their bosses ever summoned them for updates on the project. Not so much. But “we understood that this was potentially quite a big deal,” says Uszkoreit. “And it caused us to actually obsess over one of the sentences in the paper toward the end, where we comment on future work.”

That sentence anticipated what might come next—the application of transformer models to basically all forms of human expression. “We are excited about the future of attention-based models,” they wrote. “We plan to extend the transformer to problems involving input and output modalities other than text” and to investigate “images, audio and video.”

A couple of nights before the deadline, Uszkoreit realized they needed a title. Jones noted that the team had landed on a radical rejection of the accepted best practices, most notably LSTMs, for one technique: attention. The Beatles, Jones recalled, had named a song “All You Need Is Love.” Why not call the paper “Attention Is All You Need”?

The Beatles?

“I’m British,” says Jones. “It literally took five seconds of thought. I didn’t think they would use it.”

They continued collecting results from their experiments right up until the deadline. “The English-French numbers came, like, five minutes before we submitted the paper,” says Parmar. “I was sitting in the micro-kitchen in 1965, getting that last number in.” With barely two minutes to spare, they sent off the paper.



Source link

━ more like this

Living alone for the first time in Bristol – London Business News | Londonlovesbusiness.com

Moving out and living alone for the first time is a significant milestone—a mix of excitement, nerves, and newfound freedom. It’s a chance...

Google adds Spanish and French to NotebookLM in huge language update

NotebookLM is one of Google’s lesser-used AI products but it introduced a feature that’s becoming increasingly popular — Audio Overviews. The company already...

South African equities momentum growth ahead of key trade data – London Business News | Londonlovesbusiness.com

South African equities are set for further upside, extending gains from Tuesday’s rally, with the JSE FTSE Top 40 Index...

Finance leaders are ‘losing visibility as expense management grows more complex’ – London Business News | Londonlovesbusiness.com

ExpenseIn, a leading expense management solution provider and part of the AccountsIQ Group, has today warned that many businesses are...

Google I/O 2025: Everything you need to know

Table of Contents Table of Contents Google I/O 2025: When will it happen? How and where can I watch Google I/O 2025? Google I/O 2025: What to...
spot_img