Alexander Waibel (born 2 May 1956 in Heidelberg, Germany) is a professor of Computer Science at Carnegie Mellon University[1] and Karlsruhe Institute of Technology (KIT).[2] Waibel’s research focuses on automatic speech recognition, translation and human-machine interaction. His work has introduced cross-lingual communication systems, such as consecutive and simultaneous interpreting systems on a variety of platforms.[3] In fundamental research on machine learning, he is known for the Time Delay Neural Network (TDNN),[4] the first Convolutional Neural Network (CNN) trained by gradient descent, using backpropagation. Alex Waibel introduced the TDNN in 1987 at ATR[5] in Japan.[6]

Alex Waibel
Alex Waibel (2018)
Born2 May 1956
Heidelberg, Germany
AwardsIEEE Senior Best Paper Award (1990), Allen Newell Award for Research Excellence (2002), Antonio Zampolli Prize (2014), Meta Prize (2011 & 2016), James L. Flanagan Speech and Audio Processing Award (2023)
Academic background
Alma materMassachusetts Institute of Technology (BS), Carnegie Mellon University (MS, PhD)
Doctoral advisorRaj Reddy
Academic work
DisciplineComputer Science
Sub-disciplineArtificial Intelligence, Machine Learning, Deep Learning
InstitutionsCarnegie Mellon University, Karlsruhe Institute of Technology
Notable studentsLaurence Devillers

Early life and education

edit

Alex Waibel spent part of his schooling in Barcelona, before entering the humanistisches Gymnasium in Ludwigshafen. From 1976 to 1979, he studied Electrical Engineering and Computer Science at the Massachusetts Institute of Technology. He was presented with the Guillamin Award for the best undergraduate thesis in 1979. In the same year, he joined Carnegie Mellon University, where he received an MS degree in 1980 and PhD degrees in computer science and cognitive science in 1986.

Academic career and research

edit

Waibel is the director of interACT,[7] the International Center for Advanced Communication Technologies at Karlsruhe Institute of Technology. He was one of the founders of C-STAR,[8] an international consortium for speech translation research, and served as its chairman from 1998 to 2000. In 2003, C-STAR evolved into IWSLT, the International Conference on Spoken Language Translation. Waibel has served as Chairman of its Steering Committee since its inception[1]. He directed and coordinated several multisite research programs in Europe and the US, including the CHIL program[9] (FP-6 Integrated Project on multimodality) in Europe and NSF-ITR project STR-DUST (the first domain independent speech translation project) in the U.S. He was project coordinator of the IP EU-BRIDGE,[10] funded by the EC (2012-2014).

In C-STAR, his team developed the JANUS speech translation system, the first American and European Speech Translation system, and in 2005, the first real-time simultaneous speech translation system for lectures. He developed with his lab several multimodal systems including face tracking, lip reading, emotion recognition from speech, perceptual meeting rooms, meeting recognizers, meeting browsers, and multimodal dialog systems for humanoid robots.

He demonstrated the first automatic interpreting services at the European Parliament in 2012.[11]

From 2019 to 2023, he directed OML (Organic Machine Learning) funded by the Federal Ministry of Education and Research (Germany), a fundamental research project to develop incremental and interactive machine learning, aiming to help AI better handle surprise in language and robotics.

Entrepreneurship

edit

In the areas of speech, speech translation, and multimodal interfaces Waibel holds several patents[12] and has founded and co-founded successful commercial ventures. He was the founder and chairman of Mobile Technologies, LLC, maker of the Jibbigo mobile speech-to-speech app to translate speech on a phone.

In 2005, Waibel unveiled the world's first automatic simultaneous translation service at a press conference at Carnegie Mellon University[13] stating that "the lecture translator automatically records, transcribes and translates the speech of a lecturer in real-time, and students can follow the lecture in their language on their PC or mobile phone."[14] Deployed in 2012,[15] it serves foreign students[16] as the pioneering service of its kind.

In 2013, Jibbigo was acquired by Facebook Inc. and Waibel joined the company to start the Language Technology Group which became part of Facebook's broader Applied Machine Learning efforts.[17] He was co-founder and director of MultiModal Technologies, Inc.[18] and M*Modal, specializing in medical records, which merged with 3M in 2019.[19]

In 2015, he cofounded KITES GmbH, to deploy simultaneous speech translation services to Universities and to the European Parliament. KITES was acquired by Zoom in 2021[2] and now delivers automatic subtitling and simultaneous translation during Zoom video conferencing calls. Waibel serves as Research Fellow at Zoom and on Advisory Boards in related enterprises.

In October 2018, Waibel closed out a successful legal case[20] against Wikimedia Foundation[21] citing German libel laws.[22]

Awards and honours

edit

Alex Waibel is a recipient of the IEEE Senior Best Paper Award for work on the TDNN (1990). He was awarded the Alcatel-SEL "Forschungspreis Technische Kommunikation" in 1994 for his work on computer speech translation systems. In 2002, he received the Allen Newell Award for Research Excellence and the Meta Prize Archived 2016-09-02 at the Wayback Machine in 2011 with Jibbigo Mobile Translators for outstanding mobile voice translators bringing speech translation to mobile devices. In 2014, he was the recipient of the Antonio Zampolli Prize for "outstanding contributions to the advancement of Language Resources and Language Technology Evaluation within Human Language Technology", LREC.[23] With InterACT, he was awarded a second Meta Prize Archived 2016-09-02 at the Wayback Machine for "Outstanding and Innovative Contributions to Cross-Lingual and Multilingual Communication Technologies" (2016). He received the Sustained Accomplishment Award of the ACM-ICMI for his work on multimodal interfaces (2019).[24] In 2023, he became the 21st honoree to receive the IEEE James L. Flanagan Speech and Audio Processing Award for "pioneering work on speech translation and supporting technologies".[25]

Waibel has been a Life Fellow of the IEEE, a Fellow of the International Speech Communication Association (ISCA) and a Member of the National Academy of Sciences of Germany, Leopoldina[26] since 2017. In 2023, Waibel was inducted as Fellow into the Explorers Club citing aviation expeditions and deep sea exploration.[27]

References

edit
  1. ^ "Alex Waibel Biography". www.cs.cmu.edu. Retrieved 2024-06-29.
  2. ^ "KIT". KIT. Retrieved 2024-04-22.
  3. ^ "Sprachbarrieren durchbrechen: Traum oder Wirklichkeit?" (PDF). Leopoldina. 2015.
  4. ^ "Phoneme recognition using time delay neural networks". Université de Toronto. March 1989. Retrieved 29 June 2024.
  5. ^ "Phoneme Recognition Using Time- Delay Neural Networks" (PDF). 30 October 1987. Retrieved 29 June 2024.
  6. ^ Waibel, Alex (December 1987). "Phoneme Recognition Using Time Delay Neural Networks SP87" (PDF). ATR.
  7. ^ "InterACT". interact.kit.edu. 30 September 2016. Retrieved July 7, 2024.
  8. ^ (inaktiv), Schweizer, Dorothea (28 August 2012). "KIT - C-STAR". isl.anthropomatik.kit.edu.{{cite web}}: CS1 maint: multiple names: authors list (link)
  9. ^ CHIL - Computers In the Human Interaction Loop. doi:10.1007/978-1-84882-054-8_1. ISBN 978-1-84882-054-8. Retrieved 2024-04-23.
  10. ^ Daroussi, Younes. "EU-BRIDGE". www.eu-bridge.eu.
  11. ^ 2012: Alex Waibel presenting the Lecture Translator at the European Parliament (English). Retrieved 2024-04-25 – via www.youtube.com.
  12. ^ "Alex Waibel Inventions, Patents and Patent Applications - Justia Patents Search". patents.justia.com.
  13. ^ "Demo of breakthroughs in cross lingual communication and speech-to-speech translation". The Interactive Systems Lab. March 2021. Retrieved 2024-06-27.
  14. ^ Waibel, Alex (2012). "Simultaneous Interpretation by Machines - Simultanübersetzung durch Maschinen - the world's first automatic simultaneous translation service at a university". Youtube. Retrieved 2019-03-26.
  15. ^ United States Patent
  16. ^ "Simultanübersetzung durch Maschinen". YouTube. 12 June 2012.
  17. ^ "Facebook to acquire Pittsburgh-based Mobile Technologies | TribLIVE.com". archive.triblive.com.
  18. ^ "MultiModal Technologies Inc - Company Profile and News". Bloomberg.com. Retrieved 2024-04-28.
  19. ^ "3M Completes Acquisition of M*Modal's Technology Business". 3M News Center. Retrieved 2024-04-28.
  20. ^ "Urteil im Fall Waibel" (PDF). Wikimedia Foundation. September 12, 2018.
  21. ^ "Raue LLP successful against Wikipedia". October 31, 2018.
  22. ^ "A German court forced us to remove part of a Wikipedia article's 'history.' Here's what that means". Wikimedia Foundation. April 11, 2019.
  23. ^ "And the winner is..." lrec2014.lrec-conf.org.
  24. ^ "ICMI 2019 Awards". icmi.acm.org.
  25. ^ "IEEE James L. Flanagan Speech and Audio Processing Award Recipients" (PDF). IEEE.
  26. ^ "List of Members". www.leopoldina.org.
  27. ^ "Waibel Named Explorers Club Fellow". Carnegie Mellon School of Computer Science. Retrieved 2024-01-20.
  NODES
News 2
Story 1