kb/Ashish_Vaswani-0.md at 2fbd85a83fe048bfb2e2feab15b77d5a79c2afae

turtle89431 d7ea9dde97 Scrape wikipedia-science: 21183 new, 4876 updated, 26697 total (kb-cron)

2026-05-05 09:49:51 -07:00

2.0 KiB

Raw Blame History

title	chunk	source	category	tags	date_saved	instance
Ashish Vaswani	1/1	https://en.wikipedia.org/wiki/Ashish_Vaswani	reference	science, encyclopedia	2026-05-05T16:48:10.620477+00:00	kb-cron

Ashish Vaswani (born 1986) is an Indian computer scientist. Vaswani conducted research at Google Brain and, earlier in his career, was affiliated with the Information Sciences Institute at the University of Southern California. Vaswani is a co-author of the 2017 paper "Attention Is All You Need", which introduced the Transformer neural network architecture. The Transformer model has been used in the development of subsequent NLP models BERT, ChatGPT, and their successors.

== Career == Vaswani completed his engineering in Computer Science from Birla Institute of Technology, Mesra (BIT Mesra) in 2002. In 2004, he enrolled at the University of Southern California for graduate studies. He earned his PhD in Computer Science at the University of Southern California supervised by David Chiang. During his research career at Google, Vaswani was part of the Google Brain team, where he conducted the work leading to the 'Attention Is All You Need' publication. Prior to joining Google, he was affiliated with the Information Sciences Institute at the University of Southern California. After Google, Vaswani co-founded Adept AI, a machine learning-focused startup that developed AI agents and tools for software automation. He has since left the company. He is currently co-founder and CEO of Essential AI.

== Notable works == Vaswani's most notable paper, "Attention Is All You Need", was published in 2017. The paper introduced the Transformer model, which uses self-attention mechanisms instead of recurrence for sequence-to-sequence tasks. The Transformer architecture has become foundational to modern language models and NLP systems, including BERT (2018), GPT-2, GPT-3 (2019–2020) and many more recent models. The "Attention Is All You Need" paper is among the most cited papers in machine learning.

== References ==

2.0 KiB Raw Blame History Unescape Escape

2.0 KiB

Raw Blame History