Prototype telephone for the Network Voice Protocol, signed by John Makhoul

John Makhoul is a Lebanese-American computer scientist who works in the field of speech and language processing. Dr. Makhoul's work on linear predictive coding was used in the establishment of the Network Voice Protocol, which enabled the transmission of speech signals over the ARPANET.[1] Makhoul is recognized in the field for his vital role in the areas of speech and language processing, including speech analysis, speech coding, speech recognition and speech understanding. He has made a number of significant contributions to the mathematical modeling of speech signals, including his work on linear prediction, and vector quantization. His patented work on the direct application of speech recognition techniques for accurate, language-independent optical character recognition (OCR) has had a dramatic impact on the ability to create OCR systems in multiple languages relatively quickly.[2]

Dr. Makhoul is a Chief Scientist at BBN Technologies, where he has led several successful research projects including the DARPA GALE program.[3]

Early life and education

Makhoul was born in Deirmimas, a village in southern Lebanon. He did his early schooling in Lebanon. During his high school years, he spent one year as an exchange student in a high school in Foley, Minnesota. He went to college at the American University of Beirut, where he graduated with a Bachelor of Engineering degree in Electrical Engineering in the year 1964. Makhoul then received his Master of Science degree in Electrical Engineering from Ohio State University in 1965, and finished his PhD from MIT in the year 1970. Makhoul has since been working at BBN Technologies.[4][5]

Awards and honors

Throughout his career, Makhoul has received several awards and honors. He is a Fellow of the IEEE for contributions to the theory of linear prediction and its applications to spectral estimation, speech analysis and data compression[6] and a Fellow of the Acoustical Society of America.[7][8] In 2013, he became a Fellow of the International Speech Communication Association (ISCA).[9]

Makhoul's 1975 IEEE Proceedings paper on linear prediction was named a "Citation Classic" by the Institute for Scientific Information. His other honors include the 1978 IEEE Senior Award, the 1982 IEEE Technical Achievement Award, the 1988 Society Award of the IEEE Signal Processing Society, and the 2000 IEEE Third Millennium Medal.[10]

In 2009, Makhoul was awarded the IEEE James L. Flanagan Speech and Audio Processing Award, which is awarded for an outstanding contribution to the advancement of speech and/or audio signal processing.[11]

In 2016, he received the ISCA Medal for "leadership and extensive contributions to speech and language processing ".[12]

References

  1. "Linear Predictive Coding and the Internet Protocol, A survey of LPC and a History of Realtime Digital Speech on Packet Networks" (PDF).
  2. "Citations for ISCA Medalists".
  3. Anderson, Nate. "Defense Department funds massive speech recognition and translation program".
  4. "Understanding speech: an interview with John Makhoul". IEEE Signal Processing Magazine. 22 (3): 76–79. 2005-05-09. doi:10.1109/MSP.2005.1425901. ISSN 1053-5888.
  5. "BBN Technologies' John Makhoul, Pioneer in Speech Signal Processing, Receives 2009 IEEE James L. Flanagan Speech and Audio Processing Award" (Press release). Retrieved 23 January 2018.
  6. "IEEE Fellows 1980 | IEEE Communications Society".
  7. "IEEE Fellows 1980".
  8. "New Fellows of the Acoustical Society of America—65 (3–6), 851(N), 1071(N), 1344(N), 1591(N)". The Journal of the Acoustical Society of America. 65 (3): 851–851. 1979-03-01. doi:10.1121/1.382511. ISSN 0001-4966.
  9. "ISCA Fellows, 2013".
  10. "John Makhoul, BBN Technologies Chief Scientist, Awarded IEEE's Highest Award in Speech" (Press release).
  11. "IEEE James L. Flanagan Speech and Audio Processing Award Recipients". Institute of Electrical and Electronics Engineers (IEEE).
  12. "ISCA Medalists".
This article is issued from Wikipedia. The text is licensed under Creative Commons - Attribution - Sharealike. Additional terms may apply for the media files.