Hain, Thomas, Professor

Professor Thomas Hain

School of Computer Science

Professor of Speech and Audio Technology

Director of CDT in Speech and Language Technologies

Director of Liveperson Centre

Member of the Speech and Hearing (SpandH) research group

t.hain@sheffield.ac.uk

Regent Court (DCS)

Full contact details

Professor Thomas Hain
School of Computer Science
Regent Court (DCS)
211 Portobello
Sheffield
S1 4DP

Profile

Thomas Hain obtained the degree 'Dipl.-Ing' in Electrical/Communication Engineering in 1994 from the University of Technology, Vienna. He joined the Speech Technology Group at Philips Speech Processing which he left in a senior position.

In 1997 he joined the Speech, Vision and Robotics Group at the Cambridge University Engineering Department as Research Associate and PhD Student. He took up a Lectureship at the SVR group in 2001.

In 2004 he joined the Speech and Hearing Group to work as Lecturer in Computer Science. He was promoted to Senior Lecturer in 2008 and Reader in 2011.

Research interests: Thomas' research interests cover many areas in natural language processing, speech, audio and multimedia technology, machine learning, and complex system optimisation and design.

His interests include: large vocabulary continuous speech recognition, non-linear methods in speech processing, low bit-rate speech coding, machine learning, multi-modal systems, image classification, microphone arrays, system and resource optimisation.

Publications

Books

Young SJ, Evermann G, Gales MJF, Hain T, Kershaw D, Moore GL, Odell JJ, Ollason D, Povey D, Valtchev V & Woodland PC (2004) The HTK Book. Cambridge, England: Cambridge University Engineering Department.
Young S, Evermann G, Gales M, Hain T, Kershaw D, Xunying L, Moore G, Odell J, Ollason D, Povey D , Ragni A et al () The HTK Book (for HTK Version 3.5, documentation alpha version). Cambridge University Engineering Department: Cambridge University Engineering Department.

Journal articles

Song H, Zhang L, Gao M, Zhang H, Hain T & Shan L (2025) . Scientific Reports, 15(1).
Hasan M, Jefferson N, Hain T & Dawson J (2022) . Computer Speech & Language, 74, 101339-101339.
Ravenscroft W, Goetze S & Hain T (2022) . Frontiers in Signal Processing, 2.
Shi Y, Huang Q & Hain T (2021) . Neural Networks, 142, 329-339.
El Hannani A, Errattahi R, Salmam FZ, Hain T & Ouahmane H (2021) . Journal of Big Data, 8.
Errattahia R, Hannani AEL, Hain T & Ouahmane H (2019) . Computer Speech and Language, 55, 187-199.
Deena S, Hasan M, Doulaty M, Saz O & Hain T (2019) . IEEE/ACM Transactions on Audio, Speech and Language Processing, 27(3), 572-582.
Saz Torralba O, Deena S, Doulaty M, Hasan M, Khaliq B, Milner R, Ng RWM, Olcoz J & Hain T (2018) . Multimedia Systems, 77(23), 30533-30550.
Ng W, Nicolao M & Hain T (2017) . Computer Speech and Language, 46, 327-342.
Saz O & Hain T (2017) . Computer, Speech & Language, 41, 180-194.
Kamper H, De Wet F, Hain T & Niesler T (2014) . Computer Speech and Language, 28(6), 1255-1268.
Fox C & Hain T (2013) . ICASSP IEEE International Conference on Acoustics Speech and Signal Processing Proceedings, 8086-8090.
Gibson M & Hain T (2012) . ICASSP IEEE International Conference on Acoustics Speech and Signal Processing Proceedings, 4341-4344.
Lecorvé G, Dines J, Hain T & Motlicek P (2012) Supervised and unsupervised Web-based language model domain adaptation. 13th Annual Conference of the International Speech Communication Association 2012 Interspeech 2012, 1, 182-185.
Gibson M & Hain T (2012) . IEEE Transactions on Audio, Speech and Language Processing, PP(99).
Furui S, Fiscus J, Friedland G & Hain T (2012) . IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 20(2), 353-355.
Alharbi G & Hain T (2012) . 2012 IEEE Workshop on Spoken Language Technology Slt 2012 Proceedings, 398-403.
Hain T, Burget L, Dines J, Garner PN, Grezl F, el Hannani A, Huijbregts M, Karafiat M, Lincoln M & Wan V (2011) . IEEE Transactions on Audio, Speech and Language Processing.
El Hannani A & Hain T (2010) . IEEE SIGNAL PROC LET, 17(1), 95-98.
Gibson M & Hain T (2010) . IEEE Transactions on Audio Speech and Language Processing, 18(6), 1269-1279.
Karafiát M, Burget L, Hain T & Černocký J (2008) Discrimininative training of narrow band - Wide band adapted systems for meeting recognition. Proceedings of the Annual Conference of the International Speech Communication Association Interspeech, 1217-1220.
Hain T, El Hannani A, Wrigley SN & Wan V (2008) Automatic speech recognition for scientific purposes - WebASR. Proceedings of the Annual Conference of the International Speech Communication Association Interspeech, 504-507.
Karafiát M, Burget L, Hain T & Černocký J (2008) Discrimininative training of narrow band - Wide band adapted systems for meeting recognition. INTERSPEECH 2008 - 9th Annual Conference of the International Speech Communication Association, 1217-1220.
Karafiát M, Burget L, Černocký J & Hain T (2007) Application of CMLLR in narrow band wide band adapted systems. International Speech Communication Association 8th Annual Conference of the International Speech Communication Association Interspeech 2007, 4, 2860-2863.
Renais S, Hain T & Boudard H (2007) Recognition and understanding of meetings the AMI and AMIDA projects. 2007 IEEE Workshop on Automatic Speech Recognition and Understanding Asru 2007 Proceedings, 238-247.
Hain T, Burget L, Dines J, Garau G, Wan V, Karafiat M, Vepa J & Lincoln M (2007) . ICASSP IEEE International Conference on Acoustics Speech and Signal Processing Proceedings, 4, IV357-IV360.
Wan V & Hain T (2006) Strategies for language model web-data collection. ICASSP IEEE International Conference on Acoustics Speech and Signal Processing Proceedings, 1, I1069-I1072.
Hain T, Woodland PC, Evermann G, Gales MJF, Liu X, Moore GL, Povey D & Wang L (2006) Corrections to "Automatic Transcription of Conversational Telephone Speech".. IEEE Trans. Speech Audio Process., 14, 727-727.
Hain T, Woodland PC, Evermann G, Gales MJF, Liu XY, Moore GL, Povey D & Wang L (2005) . IEEE T SPEECH AUDI P, 13(6), 1173-1185.
Hain T (2005) . SPEECH COMMUNICATION, 46(2), 171-188.

Book chapters

Saenz JAL & Hain T (2021) , Lecture Notes in Computer Science (pp. 61-72). Springer International Publishing
Hain T & Garner PN (2012) , Multimodal Signal Processing (pp. 56-83). Cambridge University Press
Renals S & Hain T (2010) In Clark A, Fox C & Lappin S (Ed.), The Handbook of Computational Linguistics and Natural Language Processing (pp. 299-332). Wiley-Blackwell
Moore D, Dines J, Doss MM, Vepa J, Cheng O & Hain T (2006) (pp. 285-296).
Carletta J, Ashby S, Bourban S, Guillemot M, Kronenthal M, Lathoud G, Lincoln M, McCowan I, Hain T, Kraaij W , Post W et al (2005) , Machine Learning for Multimodal Interaction, Lecture Notes in Computer Science (pp. 28-39). Edinburgh: Springer.

Conference proceedings

Close G, Hong K, Hain T & Goetze S (2025) . Speech and Computer, Vol. 16187(Part 1) (pp 39-51). Szeged, Hungary, 13 October 2025 - 13 October 2025.
Park C & Hain T (2025) . Proceedings of Interspeech 2025 (pp 3663-3667). Rotterdam, The Netherlands, 17 August 2025 - 17 August 2025.
Park C, Lu C, Chen M & Hain T (2025) . ICASSP 2025 - 2025 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (pp 1-5). Hyderabad, India, 6 April 2025 - 6 April 2025.
Do C-T, Imai S, Doddipatla R & Hain T (2024) . 2024 32nd European Signal Processing Conference (EUSIPCO) (pp 136-140). Lyon, France, 26 August 2024 - 26 August 2024.
Park C, Kang H & Hain T (2024) . Proceedings of 2024 32nd European Signal Processing Conference (EUSIPCO) (pp 131-135). Lyon, France, 26 August 2024 - 26 August 2024.
Close G, Hain T & Goetze S (2024) . Proceedings of 2024 32nd European Signal Processing Conference (EUSIPCO) (pp 21-25). Lyon, France, 26 August 2024 - 26 August 2024.
Sutherland R, Close G, Hain T, Goetze S & Barker J (2024) . Proceedings of 2024 32nd European Signal Processing Conference (EUSIPCO) (pp 421-425). Lyon, France, 26 August 2024 - 26 August 2024.
Meghanani A & Hain T (2024) . Interspeech 2024 (pp 2835-2839). Kos, Greece, 1 September 2024 - 1 September 2024.
Ravenscroft W, Close G, Goetze S, Hain T, Soleymanpour M, Chowdhury A & Fuhs MC (2024) . Proceedings of Interspeech 2024 (pp 4998-5002). Kos Island, Greece, 1 September 2024 - 1 September 2024.
Ma Z, Chen M, Zhang H, Zheng Z, Chen W, Li X, Ye J, Chen X & Hain T (2024) . Interspeech 2024 (pp 1580-1584). Kos Island, Greece, 1 September 2024 - 1 September 2024.
Mogridge R, Close G, Sutherland R, Hain T, Barker J, Goetze S & Ragni A (2024) Non-Intrusive Speech Intelligibility Prediction for Hearing-Impaired Users Using Intermediate ASR Features and Human Memory Models.. ICASSP (pp 306-310)
Ahmad R, Farooq MU & Hain T (2024) . ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Vol. 2024 (pp 11466-11470). Seoul, Korea, 14 April 2024 - 14 April 2024.
Meghanani A & Hain T (2024) . ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Vol. 2024 (pp 12086-12090). Seoul, Korea, 14 April 2024 - 14 April 2024.
Close G, Ravenscroft W, Hain T & Goetze S (2024) . ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Vol. 2024 (pp 351-355). Seoul, Korea, 14 April 2024 - 14 April 2024.
Farooq MU, Ahmad R & Hain T (2024) . Proceedings of 2023 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU) (pp 1-6). Taipei, Taiwan, 16 December 2023 - 16 December 2023.
Ravenscroft JW, Goetze S & Hain T (2024) . 2023 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU). Taipei, Taiwan, 16 December 2023 - 16 December 2023.
Meghanani A & Hain T (2024) . 2023 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU) Proceedings. Taipei, Taiwan, 16 December 2023 - 16 December 2023.
Islam E, Hain T & Nomo Sudro P (2024) . 2023 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU). Taipei, Taiwan, 16 December 2023 - 16 December 2023.
Park C, Chen M & Hain T (2024) Automatic speech recognition system-independent word error rate estimation. Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024) (pp 1979-1987). Torino, Italy, 20 May 2024 - 20 May 2024.
Meghanani A & Hain T (2024) . Proceedings of the 18th Conference of the European Chapter of the Association for Computational Linguistics (Volume 1: Long Papers), Vol. 1 (pp 1959-1967). St. Julian’s, Malta, 17 March 2024 - 17 March 2024.
Iakovenko O & Hain T (2024) . Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing (pp 5791-5800). Miami, Florida, USA, 12 November 2024 - 12 November 2024.
Ravenscroft J, Goetze S & Hain T (2023) . 2023 31st European Signal Processing Conference (EUSIPCO). Helsinki, Finland, 4 September 2023 - 4 September 2023.
Nomo Sudro P, Ragni A & Hain T (2023) . 2023 31st European Signal Processing Conference (EUSIPCO) Proceedings (pp 271-275). Helsinki, Finland, 4 September 2023 - 4 September 2023.
Ollerenshaw A, Jalal MA & Hain T (2023) . 2023 31st European Signal Processing Conference (EUSIPCO) Proceedings (pp 401-405). Helsinki, Finland, 4 September 2023 - 4 September 2023.
Close G, Hain T & Goetze S (2023) . Proceedings of 2023 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA). New Paltz, NY, USA, 22 October 2023 - 22 October 2023.
Close GL, Ravenscroft W, Hain T & Goetze S (2023) . Proc. 7th International Workshop on Speech Processing in Everyday Environments (CHiME 2023) (pp 33-38). Dublin, Ireland, 25 August 2023 - 25 August 2023.
Farooq MU & Hain T (2023) . Interspeech 2023 Proceedings (pp 5072-5076). Dublin, Ireland, 20 August 2023 - 20 August 2023.
Islam E, Park C & Hain T (2023) . 9th Workshop on Speech and Language Technology in Education (SLaTE) Proceedings (pp 151-155). Dublin, Ireland, 18 August 2023 - 18 August 2023.
Close G, Hain T & Goetze S (2023) PAMGAN+/-: Improving phase-aware speech enhancement performance via expanded discriminator training. AES Convention Europe 2023: 154th Audio Engineering Society Conference (pp 10656). Espoo, Helsinki, FInland, 13 May 2023 - 13 May 2023.
Ahmad R, Jalal MA, Umar Farooq M, Ollerenshaw A & Hain T (2023) Towards domain generalisation in ASR with elitist sampling and ensemble knowledge distillation. Proceedings of ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Rhodes Island, Greece, 4 June 2023 - 4 June 2023.
Ravenscroft W, Goetze S & Hain T (2023) . ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Rhodes Island, Greece, 4 June 2023 - 4 June 2023.
Close G, Ravenscroft W, Hain T & Goetze S (2023) . ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) Proceedings. Rhodes Island, Greece, 4 June 2023 - 4 June 2023.
Park B, Park C & Li G (2022) . 2022 29th IEEE International Conference on Electronics, Circuits and Systems (ICECS) (pp 1-4), 24 October 2022 - 26 October 2022.
Ravenscroft W, Goetze S & Hain T (2022) Receptive field analysis of temporal convolutional networks for monaural speech dereverberation. Proceedings of 30th European Signal Processing Conference (EUSIPCO 2022) (pp 80-84). Belgrade, Serbia, 29 August 2022 - 29 August 2022.
Ollerenshaw A, Jalal MA & Hain T (2022) Insights of neural representations in multi-banded and multi-channel convolutional transformers for end-to-end ASR. Proceedings of 2022 30th European Signal Processing Conference (EUSIPCO) (pp 434-438). Belgrade, Serbia, 29 August 2022 - 29 August 2022.
Close G, Hain T & Goetze S (2022) MetricGAN+/-: increasing robustness of noise reduction on unseen data. Proceedings of 2022 30th European Signal Processing Conference (EUSIPCO) (pp 165-169). Belgrade, Serbia, 29 August 2022 - 29 August 2022.
Ravenscroft W, Goetze S & Hain T (2022) . Proceedings of 2022 International Workshop on Acoustic Signal Enhancement (IWAENC). Bamberg, Germany, 5 September 2022 - 5 September 2022.
Farooq MU, Haniya Narayana DA & Hain T (2022) . Interspeech 2022 - 23rd Annual Conference of the International Speech Communication Association (pp 4850-4854). Incheon, Korea, 18 September 2022 - 18 September 2022.
Farooq MU & Hain T (2022) . Interspeech 2022 - 23rd Annual Conference of the International Speech Communication Association (pp 3849-3853). Incheon, Korea, 18 September 2022 - 18 September 2022.
Close G, Hollands S, Hain T & Goetze S (2022) . Interspeech 2022 - 23rd Annual Conference of the International Speech Communication Association (pp 3483-3487). Incheon, Korea, 18 September 2022 - 18 September 2022.
Lopez Saenz JA & Hain T (2022) . ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (pp 7267-7271), 23 May 2022 - 27 May 2022.
Park C, Ahmad R & Hain T (2022) . ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (pp 8587-8591). Singapore, Singapore, 23 May 2022 - 23 May 2022.
Saenz JAL, Jalal MA, Milner R & Hain T (2021) . 2021 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU) (pp 725-732), 13 December 2021 - 17 December 2021.
Huang S, Chen M, Xu Y, Ke D & Hain T (2021) . PRICAI 2021: Trends in Artificial Intelligence 18th Pacific Rim International Conference on Artificial Intelligence, PRICAI 2021, Hanoi, Vietnam, November 8–12, 2021, Proceedings, Part II (pp 559-573). Hanoi, Vietnam (virtual), 8 November 2021 - 8 November 2021.
Huang Q & Hain T (2021) . ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (pp 6473-6477). Toronto, ON, Canada, 6 June 2021 - 11 June 2021.
Chen M, Shi Y & Hain T (2021) . ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Toronto, ON, Canada, 6 June 2021 - 6 June 2021.
Do C-T, Doddipatla R & Hain T (2021) . ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Vol. 2021 (pp 6978-6982). Toronto, Ontario, Canada, 6 June 2021 - 6 June 2021.
Shi Y & Hain T (2021) . 2021 IEEE Spoken Language Technology Workshop (SLT) (pp 758-765). Shenzhen, China, 19 January 2021 - 22 January 2021.
Shi Y & Hain T (2021) . 2021 IEEE Spoken Language Technology Workshop (SLT) (pp 750-757). Shenzhen, China, 19 January 2021 - 19 January 2021.
Do C-T, Zhang S & Hain T (2021) . 2020 28th European Signal Processing Conference (EUSIPCO) (pp 321-325), 18 January 2021 - 21 January 2021.
Friedl K, Rizos G, Stappen L, Hasan M, Specia L, Hain T & Schuller B (2021) . Findings of the Association for Computational Linguistics: ACL-IJCNLP 2021 (pp 5004-5009), August 2021 - August 2021.
Ollerenshaw A, Jalal MA & Hain T (2021) . Interspeech 2021 (pp 4079-4083). Brno, Czechia, 30 August 2021 - 30 August 2021.
Shi Y, Huang Q & Hain T (2020) . The Speaker and Language Recognition Workshop (Odyssey 2020) (pp 451-458). Tokyo, Japan, 1 November 2020 - 5 November 2020.
Chen M & Hain T (2020) . Interspeech 2020 (pp 4866-4870). Shanghai, China, 25 October 2020 - 25 October 2020.
Jalal MA, Milner R & Hain T (2020) . Proceedings of Interspeech 2020 (pp 4113-4117). Shanghai, China (Online), 25 October 2020 - 25 October 2020.
Jalal MA, Milner R, Hain T & Moore RK (2020) . Interspeech 2020 (pp 4084-4088). Shanghai, China, 25 October 2020 - 29 October 2020.
Stappen L, Rizos G, Hasan M, Hain T & Schuller BW (2020) . Interspeech 2020 (pp 1808-1812). Shanghai, China, 25 October 2020 - 25 October 2020.
Shi Y, Huang Q & Hain T (2020) . Proceedings of Interspeech 2020 (pp 2992-2996). Shanghai, China, 25 October 2020 - 29 October 2020.
Huang Q & Hain T (2020) . Proceedings of Interspeech 2020 (pp 4611-4615). Shanghai, China, 25 October 2020 - 29 October 2020.
Shi Y, Huang Q & Hain T (2020) . Proceedings of Interspeech 2020 (pp 1530-1534). Shanghai, China, 25 October 2020 - 29 October 2020.
Shi Y, Huang Q & Hain T (2020) . ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (pp 7579-7583). Barcelona, Spain (virtual), 4 May 2020 - 8 May 2020.
Sailor HB, Deena S, Jalal MA, Lileikyte R & Hain T (2019) . 2019 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU) (pp 980-987), 14 December 2019 - 18 December 2019.
Jalal MA, Moore RK & Hain T (2019) . 2019 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU) (pp 853-859), 14 December 2019 - 18 December 2019.
Milner R, Jalal MA, Ng RWM & Hain T (2019) . 2019 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU) (pp 304-311), 14 December 2019 - 18 December 2019.
Doulaty M & Hain T (2019) . Interspeech 2019 (pp 3228-3232). Graz, Austria, 15 September 2019 - 15 September 2019.
Jalal MA, Loweimi E, Moore RK & Hain T (2019) . Proceedings of Interspeech 2019 (pp 1701-1705). Graz, Austria, 15 September 2019 - 15 September 2019.
Hain T & Schuller B (2019) Message from the technical program chairs. Proceedings of the Annual Conference of the International Speech Communication Association Interspeech, Vol. 2019-September (pp 13-15)
Loweimi E, Barker JP & Hain T (2018) . 2018 IEEE International Conference on Acoustics, Speech and Signal Processing Proceedings. Calgary, Alberta, Canada, 15 April 2018 - 15 April 2018.
Nicolao M, Sanders M & Hain T (2018) . Proceedings of Interspeech 2018 (pp 1666-1670). Hyderabad, India, 2 September 2018 - 2 September 2018.
Errattahi R, Deena S, El Hannani A, Ouahmane H & Hain T (2018) . 2018 IEEE Spoken Language Technology Workshop (SLT) (pp 190-196), 18 December 2018 - 21 December 2018.
Errattahi R, El Hannani A, Hain T & Ouahmane H (2018) . 2018 4th International Conference on Advanced Technologies for Signal and Image Processing (ATSIP) (pp 1-6), 21 March 2018 - 24 March 2018.
Loweimi E, Barker J & Hain T (2018) . Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, Vol. 2018-S (pp 696-700). Hyderabad, India, 2 September 2018 - 2 September 2018.
Deena S, Ng RWM, Madhyashtha P, Specia L & Hain T (2018) . Proceedings of IEEE Automatic Speech Recognition and Understanding Workshop. Okinawa, Japan
Deena S, Ng RWM, Madhyashta P, Specia L & Hain T (2017) . Proceedings of INTERSPEECH 2017: Conference of the International Speech Communication Association (pp 2715-2719). Stockholm, 20 August 2017 - 20 August 2017.
Loweimi E, Barker J & Hain T (2017) . Interspeech 2017 (pp 2466-2470). Stockholm, 20 August 2017 - 20 August 2017.
Loweimi E, Barker J, Torralba OS & Hain T (2017) . Proceedings of the Annual Conference of the International Speech Communication Association. Stockholm, 20 August 2017 - 20 August 2017.
Ng WM, Kwan ACM, Lee T & Hain T (2017) . 2017 IEEE International Conference on Acoustics, Speech and Signal Processing. New Orleans, USA
Loweimi E, Barker J & Hain T (2017) . ICASSP IEEE International Conference on Acoustics Speech and Signal Processing Proceedings (pp 5310-5314)
Milner R & Hain T (2017) . Proceedings - ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing (pp 4925-4929)
Wu C, Ng RWM, Torralba OS & Hain T (2017) . International Conference on Systems, Signals and Image Processing (IWSSIP). Poznań, Poland, 22 May 2017 - 22 May 2017.
(2017) . Interspeech 2017
Saz O, Doulaty M, Deena S, Milner R, Ng RWM, Hasan M, Liu Y & Hain T (2016) . 2015 IEEE Workshop on Automatic Speech Recognition and Understanding Asru 2015 Proceedings (pp 624-631)
Olcoz J, Saz Torralba O & Hain T (2016) . Proceedings of Interspeech 2016 (pp 2110-2114). San Francisco, CA, 8 September 2016 - 8 September 2016.
Hain T, Christian J, Saz O, Deena S, Hasan M, Ng RWM, Milner R, Doulaty M & Liu Y (2016) . Proceedings of Interspeech 2016. San Francisco, CA, 8 September 2016 - 8 September 2016.
Casanueva I, Hain T, Nicolao M & Green P (2016) Using phone features to improve dialogue state tracking generalisation to unseen states. Proceeding of SIGDIAL 2016. Los Angeles, USA, 13 September 2016 - 13 September 2016.
Loweimi E, Barker J & Hain T (2016) . Proceedings of the Annual Conference of the International Speech Communication Association Interspeech, Vol. 08-12-September-2016 (pp 3798-3802)
Ng R, Hain T & Chettri B (2016) . Combining weak tokenisers for phonotactic language recognition in a resource-constrained setting (pp 2939-2943), 9 September 2016 - 12 September 2016.
Deena S, Hasan M, Doulaty M, Saz O & Hain T (2016) . Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH (pp 2343-2347). San Francisco, USA, 8 September 2016 - 8 September 2016.
Al-Shareef S & Hain T (2016) . Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH (pp 1345-1349). San Francisco, USA, 8 September 2016 - 8 September 2016.
Liu Y, Fox C, Hasan M & Hain T (2016) . Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH (pp 3833-3837). San Francisco, USA, 8 September 2016 - 8 September 2016.
Casanueva I, Hain T & Green P (2016) . Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH (pp 2726-2730). San Francisco, USA, 8 September 2016 - 8 September 2016.
Milner R & Hain T (2016) . Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH (pp 2185-2189). San Francisco, USA, 8 September 2016 - 8 September 2016.
Doulaty M, Saz O, Ng RWM & Hain T (2016) . Proceedings of the 17th Annual Conference of the International Speech Communication Association (Interspeech). San Francisco, 8 September 2016 - 8 September 2016.
Ng W, Nicolao M, Saz O, Hasan M, Chettri B, Doulaty M, Lee T & Hain T (2016) . Proceedings of The Speaker and Language Recognition Workshop Odyssey 2016 (pp 181-187). Bilbao, Spain, 21 June 2016 - 21 June 2016.
Errattahi R, El Hannani A, Ouahmane H & Hain T (2016) . 2016 IEEE/ACS 13th International Conference of Computer Systems and Applications (AICCSA). Agadir, Morocco
Nicolao M, Christensen H, Cunningham S, Green P & Hain T (2016) A Framework for Collecting Realistic Recordings of Dysarthric Speech - the homeService Corpus. Proceedings of LREC 2016. Portorož, Slovenia, 24 May 2016 - 24 May 2016.
Ng RWM, Shah K, Specia L & Hain T (2016) . ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, Vol. 2016-M (pp 6120-6124). Shanghai, 20 March 2016 - 20 March 2016.
Milner R & Hain T (2016) . 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Shanghai, China, 20 March 2016 - 20 March 2016.
Alharbi G & Hain T (2016) The OpenCourseWare metadiscourse (OCWMD) corpus. Proceedings of the 10th International Conference on Language Resources and Evaluation Lrec 2016 (pp 1770-1776)
(2016) . Interspeech 2016
Milner R, Saz O, Deena S, Doulaty M, Ng R & Hain T (2015) . Proceedings of the 2015 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU) (pp 632-638). Scottsdale, AZ, 13 December 2015 - 13 December 2015.
Loweimi E, Barker J & Hain T (2015) Source-filter Separation of Speech Signal in the Phase Domain. 16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5 (pp 598-602). Dresden, Germany, 6 September 2016 - 6 September 2016.
Loweimi E, Doulaty M, Barker J & Hain T (2015) . USES 2015 - The University of Sheffield Engineering Symposium. The Octagon Centre, University of Sheffield, 24 June 2015 - 24 June 2015.
Doulaty Bashkand M, Saz O & Hain T (2015) Unsupervised Domain Discovery Using Latent Dirichlet Allocation for Acoustic Modelling in Speech Recognition. Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH (pp 3640-3644). Dresden, Germany, 6 September 2015 - 6 September 2015.
Doulaty M, Saz O, Ng RWM & Hain T (2015) . Proceedings of the 2015 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU) (pp 130-136). Scottsdale, AZ, 13 December 2015 - 13 December 2015.
Bell P, Gales M, Hain T, Kilgour J, Lanchantin P, Liu A, McParland A, Renals S, Saz O, Wester M & Woodland P (2015) . Proceedings of the 2015 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU) (pp 687-693). Scottsdale, AZ, 13 December 2015 - 13 December 2015.
Loweimi E, Doulaty M, Barker J & Hain T (2015) (pp 173-184)
Doulaty Bashkand M, Saz O & Hain T (2015) Data-Selective Transfer Learning for Multi-Domain Speech Recognition. Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH (pp 2897-2901). Dresden, Germany, 6 September 2015 - 6 September 2015.
Ng RWM, Shah K, Aziz W, Specia L & Hain T (2015) . 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (pp 5226-5230), 19 April 2015 - 24 April 2015.
Liu Y, Karanasou P & Hain T (2015) . 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE Xplore, 19 April 2015 - 19 April 2015.
Nicolao M, Beeston AV & Hain T (2015) . Acoustics, Speech and Signal Processing (ICASSP), 2015 IEEE International Conference on (pp 5351-5355). Brisbane, Australia, 19 April 2015 - 19 April 2015.
Liu Y, Karanasou P & Hain T (2015) AN INVESTIGATION INTO SPEAKER INFORMED DNN FRONT-END FOR LVCSR. 2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP) (pp 4300-4304)
Ng RWM, Shah K, Aziz W, Specia L & Hain T (2015) QUALITY ESTIMATION FOR ASR K-BEST LIST RESCORING IN SPOKEN LANGUAGE TRANSLATION. 2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP) (pp 5226-5230)
Saz O, Doulaty M, Deena S, Milner R, Ng RWM, Hasan M, Liu Y & Hain T (2015) The 2015 sheffield system for transcription of Multi-Genre Broadcast media.. ASRU (pp 624-631)
Christensen H, Nicolao M, Cunningham S, Green P, Deena S & Hain T (2015) . IET International Conference on Technologies for Active and Assisted Living (TechAAL) (pp 6 .-6 .)
Casanueva I, Hain T, Christensen H, Marxer R & Green P (2015) . Proceedings of the 16th Annual Meeting of the Special Interest Group on Discourse and Dialogue (pp 12-21), September 2015 - September 2015.
AlHarbi G & Hain T (2015) Using Topic Segmentation Models for the Automatic Organisation of MOOCs resources.. EDM (pp 524-527)
Loweimi E, Barker J & Hain T (2014) . The University of Sheffield Engineering Symposium Conference Proceedings Vol. 1, Vol. 1. The Octagon Centre, University of Sheffield
Zhang P, Liu Y & Hain T (2014) Semi-Supervised DNN Training in Meeting Recognition. Proceedings of. South Lake Tahoe, California and Nevada, USA, 7 December 2014 - 7 December 2014.
Ng RWM, Doulaty M, Doddipatla R, Aziz W, Shah K, Saz O, Hasan M, AlHarbi G, Specia L & Hain T (2014) The USFD SLT System for IWSLT 2014. Proceedings of the International Workshop on Spoken Language Translation. http://workshop2014.iwslt.org/64.php, 4 December 2014 - 4 December 2014.
Liu Y, Zhang P & Hain T (2014) . 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
Zhang P, Liu Y & Hain T (2014) . 2014 IEEE Workshop on Spoken Language Technology Slt 2014 Proceedings (pp 141-146)
Saz O, Doulaty M & Hain T (2014) . Proceedings of the 2014 Spoken Language Technology (SLT) Workshop (pp 118–123-118–123)
Christensen H, Casanueva I, Cunningham S, Green P & Hain T (2014) . 2014 IEEE Spoken Language Technology Workshop (SLT) (pp 254-259), 7 December 2014 - 10 December 2014.
Saz O & Hain T (2014) . Proceedings of the 2014 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP). Florence, Italy, 4 May 2014 - 4 May 2014.
Ng R, Cohn T & Hain T (2013) Adaptation of lecture speech recognition system with machine translation output. Proceedings of the 38th IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP). Vancouver, Canada
Saz O & Hain T (2013) Asynchronous factorisation of speaker and background with feature transforms in speech recognition. INTERSPEECH-2013 (pp 1238-1242). Lyon, France, 25 August 2013 - 25 August 2013.
Fox C, Liu Y, Zwyssig E & Hain T (2013) The Sheffield Wargames Corpus. 14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5 (pp 1115-1119)
Saz O & Hain T (2013) Asynchronous Factorisation of Speaker and Background with Feature Transforms in Speech Recognition. 14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5 (pp 1237-1241)
Christensen H, Green P & Hain T (2013) Learning speaker-specific pronunciations of disordered speech. 14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5 (pp 1158-1162)
Christensen H, Aniol MB, Bell P, Green P, Hain T, King S & Swietojanski P (2013) Combining in-domain and out-of-domain speech data for automatic recognition of disordered speech. 14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5 (pp 3609-3612)
Lanchantin P, Bell PJ, Gales MJF, Hain T, Liu X, Long Y, Quinnell J, Renals S, Saz O, Seigel MS , Swietojanski P et al (2013) Automatic Transcription of Multi-Genre Media Archives. CEUR Workshop Proceedings (pp 26–31-26–31). Marseille, France
Christensen H, Green P & Hain T (2013) Learning speaker-specific pronunciations of disordered speech. Proceedings of the Annual Conference of the International Speech Communication Association Interspeech (pp 1159-1163)
Fox C, Liu Y, Zwyssig E & Hain T (2013) The Sheffield Wargames Corpus.. 14th Annual Conference of the International Speech Communication Association (Interspeech 2013). Lyon, France, 25 August 2013 - 29 August 2013.
Christensen H, Casanuevo I, Cunningham S, Green P & Hain T (2013) HomeService: Voice-enabled assistive technology in the home using cloud-based automatic speech recognition. Slpat 2013 4th Workshop on Speech and Language Processing for Assistive Technologies Slpat 2013 Workshop Proceedings (pp 29-34)
Christensen H, Cunningham S, Fox C, Green P & Hain T (2012) A comparative study of adaptive, automatic recognition of disordered speech. 13th Annual Conference of the International Speech Communication Association 2012 Interspeech 2012, Vol. 2 (pp 1774-1777)
Al-Shareef S & Hain T (2012) CRF-based diacritisation of colloquial Arabic for automatic speech recognition. 13th Annual Conference of the International Speech Communication Association 2012 Interspeech 2012, Vol. 3 (pp 1822-1825)
Ng RWM, Hain T & Hirose K (2012) An alignment matching method to explore pseudosyllable properties across different corpora. 13th Annual Conference of the International Speech Communication Association 2012 Interspeech 2012, Vol. 1 (pp 862-865)
Kamper H, de Wet F, Hain T & Niesler T (2012) RESOURCE DEVELOPMENT AND EXPERIMENTS IN AUTOMATIC SOUTH AFRICAN BROADCAST NEWS TRANSCRIPTION. 3rd Workshop on Spoken Language Technologies for Under Resourced Languages Sltu 2012 (pp 102-106)
Wrigley SN & Hain T (2011) Making an automatic speech recognition service freely available on the web. ��Գٱ��𳦳�’11
Tucker R, Fry D, Wan V, Wrigley S & Hain T (2011) Extending Audio Notetaker to Browse WebASR Transcriptions. ��Գٱ��𳦳�’11
Wrigley SN & Hain T (2011) Web-based automatic speech recognition service - webASR. ��Գٱ��𳦳�’11
Kempton T, Moore RK & Hain T (2011) Cross-language phone recognition when the target language phoneme inventory is not known. ��Գٱ��𳦳�’11. Florence
Al-Shareef S & Hain T (2011) An Investigation in Speech Recognition for Colloquial Arabic. ��Գٱ��𳦳�’11
Marino D & Hain T (2011) An Analysis of Automatic Speech Recognition with Multiple Microphones. ��Գٱ��𳦳�’11. Florence
Tucker R, Fry D, Wan V, Wrigley S & Hain T (2011) Extending Audio Notetaker to Browse WebASR Transcriptions. 12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5 (pp 3336-+)
Hain T, Burget L, Dines J, Garner PN, el Hannani A, Huijbregts M, Karafiat M, Lincoln M & Wan V (2010) The AMIDA 2009 Meeting Transcription System. ��Գٱ��𳦳�’10 (pp 358-361)
Hain T & Renals S (2010) Meeting Recognition. Tutorial interspeech 2010
Hain T, Burget L, Dines J, Garner PN, El Hannani A, Huijbregts M, Karafiat M, Lincoln M & Wan V (2010) The AMIDA 2009 Meeting Transcription System. 11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 1-4 (pp 358-361)
Garner PN, Dines J, Hain T, El Hannani A, Karafiát M, Korchagin D, Lincoln M, Wan V & Zhang L (2009) Real-time ASR from meetings. Proceedings of the Annual Conference of the International Speech Communication Association Interspeech (pp 2119-2122)
Garner PN, Dines J, Hain T, El Hannani A, Karafiar M, Korchagin D, Lincoln M, Wan V & Zhang L (2009) Real-Time ASR from Meetings. INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5 (pp 2067-+)
Hain T, El Hannani A, Wrigley SN & Wan V (2008) Automatic speech recognition for scientific purposes - webASR. INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5 (pp 504-507)
Renals S, Hain T & Bourlard H (2008) . 2008 Hands Free Speech Communication and Microphone Arrays Proceedings Hscma 2008 (pp 115-118)
Hain T, Burget L, Dines J, Garau G, Karafiat M, van Leeuwen D, Lincoln M & Wan V (2008) The 2007 AMI(DA) system for meeting transcription. MULTIMODAL TECHNOLOGIES FOR PERCEPTION OF HUMANS, Vol. 4625 (pp 414-428)
Renals S, Hain T & Bourlard H (2008) Interpretation of multiparty meetings the AMI and AMIDA projects. 2008 HANDS-FREE SPEECH COMMUNICATION AND MICROPHONE ARRAYS (pp 116-+)
Wan V, Dines J, El Hannani A & Hain T (2008) BOB: A LEXICON AND PRONUNCIATION DICTIONARY GENERATOR. 2008 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY: SLT 2008, PROCEEDINGS (pp 217-220)
Gibson M & Hain T (2007) Temporal Masking for Unsupervised Minimum Bayes Risk Speaker Adaptation. INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4 (pp 1577-1580)
Karafiat M, Burget L, Hain T & Cernocky J (2007) Application of CMLLR in narrow band wide band adapted systems. ��Գٱ��𳦳�’07 (pp 282-285). Antwerp
Hain T, Burget L, Dines J, Garau G, Karafiat M, Lincoln M, Vepa J & Wan V (2007) The AMI system for the transcription of speech in meetings. 2007 IEEE International Conference on Acoustics, Speech, and Signal Processing, Vol IV, Pts 1-3 (pp 357-360)
Al-Hames M, Hain T, Cernocky J, Schreiber S, Poel M, Muller R, Marcel S, van Leeuwen D, Odobez JM, Ba S , Bourlard H et al (2006) Audio-visual processing in meetings: Seven questions and current AMI answers. Machine Learning for Multimodal Interaction, Vol. 4299 (pp 24-35)
Hain T, Burget L, Dines J, Garau G, Karafiat M, Lincoln M, Vepal J & Wan V (2006) The AMI meeting transcription system: Progress and performance. Machine Learning for Multimodal Interaction, Vol. 4299 (pp 419-431)
Dines J, Vepa J & Hain T (2006) The segmentation of multi-channel meeting recordings for automatic speech recognition. Interspeech 2006 and 9th International Conference on Spoken Language Processing Interspeech 2006 ICSLP, Vol. 3 (pp 1213-1216)
Gibson M & Hain T (2006) Hypothesis Spaces For Minimum Bayes Risk Training In Large Vocabulary Speech Recognition. INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5 (pp 2406-2409)
Uraga E & Hain T (2006) Automatic Speech Recognition Experiments with Articulatory Data. INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5 (pp 353-356)
Wan V & Hain T (2006) Strategies for language model web-data collection. 2006 IEEE International Conference on Acoustics, Speech, and Signal Processing, Vol I, Proceedings (pp 1069-1072). Toulouse, FRANCE, 14 May 2006 - 19 May 2006.
Wan V & Hain T (2006) Strategies for language model web-data collection. 2006 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-13 (pp 1069-1072)
Hain T, Burget L, Dines J, McCowan I, Garau G, Karafiat M, Lincoln M, Moore D, Wan V, Ordelman R & Renals S (2005) The development of the AMI system for the transcription of speech in meetings. MACHINE LEARNING FOR MULTIMODAL INTERACTION, Vol. 3869 (pp 344-356)
Hain T, Burget L, Dines J, Garau G, Karafiat M, Lincoln M, McCowan I, Moore D, Wan V, Ordelman R & Renals S (2005) The 2005 AMI system for the transcription of speech in meetings. MACHINE LEARNING FOR MULTIMODAL INTERACTION, Vol. 3869 (pp 450-462)
Garau G, Renals S & Hain T (2005) Applying vocal tract length normalization to meeting recordings. 9th European Conference on Speech Communication and Technology (pp 265-268)
Hain T, Dines J, Garau G, Karafiat M, Moore D, Wan V, Ordelman R & Renals S (2005) Transcription of conference room meetings: An investigation. 9th European Conference on Speech Communication and Technology (pp 1661-1664)
McCowan I, Carletta J, Kraaij W, Ashby S, Bourban S, Flynn M, Guillemot M, Hain T, Kadlec J, Karaiskos V , Kronenthal M et al (2005) The AMI Meeting Corpus. 5th International Conference on Methods and Techniques in Behavioral Research
Evermann G, Chan HY, Gales MJF, Hain T, Liu X, Mrva D, Wang L & Woodland PC (2004) Development of the 2003 CU-HTK conversational telephone speech transcription system. ICASSP IEEE International Conference on Acoustics Speech and Signal Processing Proceedings, Vol. 1 (pp I249-I252)
Kim DY, Gales MJF, Chan HY, Woodland PC, Umesh S & Hain T (2004) Progress in Broadcast �òݸ�� English Transcription. EARS STT Technical Meeting 2004. Montreal, Canada
Woodland PC, Chan HY, Evermann G, Gales MJF, Hain T, Jia B, Kim DY, Liu X, Mrva D, Sim KC , Tranter SE et al (2004) Cambridge STT Overview. EARS Mid-year Meeting 2004
Kim DY, Umesh S, Gales MJF, Hain T & Woodland PC (2004) Using VTLN for Broadcast �òݸ�� Transcription. ��䳧��’04. Cambridge University, UK
Evermann G, Chan HY, Gales MJF, Hain T, Liu X, Mrva D, Wang L & Woodland P (2004) Development of the 2003 CU-HTK Conversational Telephone Speech transcription system. 2004 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS (pp 249-252)
Hain T (2003) Single Pronunciation Dictionaries - Construction and Performance. EARS STT Technical Meeting 2004
Kim DY, Evermann G, Hain T, Mrva D, Tranter SE, Wang L & Woodland PC (2003) 2003 CU-HTK Broadcast �òݸ�� English System Development. Rich Transcription Workshop 2003s
Woodland PC, Chan HY, Evermann G, Gales MJF, Hain T, Kim DY, Liu X, Mrva D, Povey D, Tranter SE , Wang L et al (2003) 2003 CU-HTK English CTS Systems. Rich Transcription Workshop 2003s. Boston, Ma
Jia B, Sim KC, Gales MJF, Hain T, Liu X, Woodland PC & Yu K (2003) CU-HTK RT-03 Mandarin CTS System. Rich Transcription Workshop 2003
Woodland PC, Evermann G, Gales MJF, Hain T, Chan HY, Jia B, Kim DY, Liu X, Mrva D, Povey D , Sim KC et al (2003) Recent Experiments with HTK Broadcast �òݸ�� and Conversational Telephone Systems. EARS Mid-year meeting 2003
Kim DY, Evermann G, Hain T, Mrva D, Tranter SE, Wang L & Woodland P (2003) Recent advances in broadcast news transcription. ASRU'03: 2003 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING ASRU '03 (pp 105-110)
Hain T (2002) Implicit Pronunciation Modelling in ASR. ITRW PMLA 2002. Estes Park, Colorado
Woodland PC, Evermann G, Gales MJF, Hain T, Liu X, Moore GL, Povey D & Wang L (2002) CU-HTK APRIL 2002 SWITCHBOARD SYSTEM. Rich Transcription Workshop 2002. Vienna, VA
Hain T, Woodland PC, Evermann G & Povey D (2001) New features in the CU-HTK system for transcription of conversational telephone speech. 2001 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-VI, PROCEEDINGS (pp 57-60)
Hain T, Woodland PC, Evermann G & Povey D (2000) The CU-HTK March 2000 HUB5E Transcription System. Speech Transcription Workshop 2000. College Park, Maryland
Hain T & Woodland PC (2000) Modelling sub-phone insertions and deletions in continuous speech recognition. ICSLP 2000
Woodland PC, Odell JJ, Hain T, Moore GL, Niesler TR, Tuerk A & Whittaker EWD (1999) Improvements in Accuracy and Speed in the HTK Broadcast �òݸ�� Transcription System. ��ܰ��ǲ��𳦳�’99
Woodland PC, Hain T, Moore GL, Niesler TR, Povey D, Tuerk A & Whittaker EWD (1999) The 1998 HTK Broadcast �òݸ�� Transcription System: Development and Results. Proc. of the 1999 DARPA Broadcast �òݸ�� Transcription and Understanding Workshop. Herndon, VA
Odell JJ, Woodland PC & Hain T (1999) The CUHTK-Entropic 10xRT Broadcast �òݸ�� Transcription System. 1999 DARPA Broadcast �òݸ�� Transcription and Understanding Workshop (pp 271-275). Herndon, VA
Hain T & Woodland PC (1999) Hidden model sequences. Hub5 Workshop’99
Hain T & Woodland PC (1999) RECENT EXPERIMENTS WITH THE CU-HTK HUB5 SYSTEM. Hub5 Workshop’99
Hain T & Woodland PC (1999) Dynamic HMM selection for continuous speech recognition. ��ܰ��ǲ��𳦳�’99 (pp 1327-1330). Budapest
Hain T, Woodland PC, Niesler TR & Whittaker EWD (1999) The 1998 HTK system for transcription of conversational telephone speech. ICASSP '99: 1999 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS VOLS I-VI (pp 57-60)
Woodland PC, Hain T, Johnson SE, Niesler TR, Tuerk A & Young SJ (1998) Experiments in broadcast news transcription. PROCEEDINGS OF THE 1998 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-6 (pp 909-912)
Hain T & Woodland PC (1998) SEGMENTATION AND CLASSIFICATION OF BROADCAST NEWS AUDIO. ��䳧��’98
Woodland PC, Hain T, Johnson SE, Niesler TR, Tuerk A, Whittaker EWD & Young SJ (1998) The 1997 HTK Broadcast �òݸ�� Transcription System. 1998 DARPA Broadcast �òݸ�� Transcription and Understanding Workshop (pp 41-48)
Hain T & Woodland PC (1998) CU-HTK Acoustic modeling experiments. Hub5 Workshop 98
Hain T, Johnson SE, Tuerk A, Woodland PC & Young SJ (1998) Segment Generation and Clustering in the HTK Broadcast �òݸ�� Transcription System. 1998 DARPA Broadcast �òݸ�� Transcription and Understanding Workshop (pp 133-137)
Huertgen B & Hain T (1994) . ��䴡��’94 (pp 561-564)
Chen M, Zhang H, Li Y, Luo J, Wu W, Ma Z, Bell P, Lai C, Reiss JD, Wang L , Woodland PC et al () . The Speaker and Language Recognition Workshop (Odyssey 2024) (pp 260-265). Quebec City, Canada, 18 June 2024 - 18 June 2024.
Ravenscroft W, Goetze S & Hain T () . ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Vol. 2024 (pp 11491-11495). Seoul, Korea, 14 April 2024 - 14 April 2024.
Do C-T, Doddipatla R, Li M & Hain T () . INTERSPEECH 2023 (pp 4389-4393)
Sailor HB & Hain T () . Interspeech 2020 (pp 4756-4760)
Huang Q & Hain T () . Interspeech 2019 (pp 584-588)
Hasan M, Doddipatla R & Hain T () . Interspeech 2015 (pp 349-353)
Ng RWM, Shah K, Specia L & Hain T () . Interspeech 2015 (pp 2257-2261)
Alharbi G, Ng RWM & Hain T () . Speech and Language Technology in Education (SLaTE 2015) (pp 161-166)
Fox C & Hain T () . Interspeech 2014 (pp 2440-2444)
Hasan M, Doddipatla R & Hain T () . Interspeech 2014 (pp 2902-2906)
Doddipatla R, Hasan M & Hain T () . Interspeech 2014 (pp 2199-2203)
Casanueva I, Christensen H, Hain T & Green PD () . Interspeech 2014 (pp 1033-1037)
Christensen H, Aniol MB, Bell P, Green PD, Hain T, King S & Swietojanski P () . Interspeech 2013 (pp 3642-3645)
Woodland PC, Odell JJ, Hain T, Moore GL, Niesler TR, Tuerk A & Whittaker EWD () . 6th European Conference on Speech Communication and Technology (pp 1043-1046)

Reports

Close G, Hollands S, Goetze S & Hain T (2022) Clarity Prediction Challenge 1 Entry: Non-intrusive Speech Intelligibility Metric Prediction - Technical Report
el Hannani A & Hain T (2011) Data Dependence of Speech Decoder Parameters
Gibson M & Hain T (2011) Confidence-informed unsupervised Minimum Bayes Risk acoustic model adaptation
Hain T, Dines J & McCowan I (2006) Conversational multi-party speech recognition using remote microphones
Hain T, Woodland PC, Evermann G, Liu X, Moore GL, Povey D & Wang L (2003) Automatic Transcription of Conversational Telephone Speech. Development of the CU-HTK 2002 System

Theses

Hain T (2001) Hidden Model Sequence Models for Automatic Speech Recognition.
Hain T (1993) On the Use of Iterated Function Systems for Coding of Grayscale Images.

Datasets

Nicolao M, Hain T, Christensen H, Green P & Cunningham S .
Deena S, Hasan M, Bashkand MD, Torralba OS & Hain T .
Torralba OS, Hain T & Martinez JO .
Torralba OS, Hain T, Deena S, Bashkand MD, Hasan M, Ng WM, Milner R & Liu Y .
Torralba OS & Hain T .
Torralba OS, Hain T, Deena S, Bashkand MD, Khaliq B, Ng WM, Milner R, Hasan M & Martinez JO .
Deena S, Hasan M, Bashkand MD, Torralba OS & Hain T .
Hain T, Liu Y & Hasan M .
Specia L, Hain T, Ng W & Shah K .

Other

Ng WM, Kwan ACM, Lee T & Hain T () .

Preprints

Close G, Hong K, Hain T & Goetze S (2025) WhiSQA: Non-Intrusive Speech Quality Prediction Using Whisper Encoder Features.
Kheir YE, Ibrahim O, Meghanani A, Almarwani N, Toyin HO, Alharbi S, Alfadly M, Alkanhal L, Selim I, Elbatal S , Mdhaffar S et al (2025) , arXiv.
Iakovenko O & Hain T (2024) , arXiv.
Do C-T, Imai S, Doddipatla R & Hain T (2024) , arXiv.
Ravenscroft W, Close G, Goetze S, Hain T, Soleymanpour M, Chowdhury A & Fuhs MC (2024) , arXiv.
Meghanani A & Hain T (2024) , arXiv.
Chen M, Zhang H, Li Y, Luo J, Wu W, Ma Z, Bell P, Lai C, Reiss J, Wang L , Woodland PC et al (2024) , arXiv.
Park C, Chen M & Hain T (2024) Automatic Speech Recognition System-Independent Word Error Rate Estimation.
Close G, Hain T & Goetze S (2024) , arXiv.
Meghanani A & Hain T (2024) , arXiv.
Meghanani A & Hain T (2024) , arXiv.
Ahmad R, Farooq MU & Hain T (2024) Progressive unsupervised domain adaptation for ASR using ensemble models and multi-stage training.
Mogridge R, Close G, Sutherland R, Hain T, Barker J, Goetze S & Ragni A (2024) , arXiv.
Close G, Ravenscroft W, Hain T & Goetze S (2023) , arXiv.
Park C, Lu C, Chen M & Hain T (2023) Fast Word Error Rate Estimation Using Self-Supervised Representations for Speech and Text.
Ravenscroft W, Goetze S & Hain T (2023) , arXiv.
Close G, Hain T & Goetze S (2023) , arXiv.
Close G, Hain T & Goetze S (2023) The Effect of Spoken Language on Speech Enhancement using Self-Supervised Speech Representation Loss Functions.
Ollerenshaw A, Jalal MA, Milner R & Hain T (2023) , arXiv.
Ahmad R, Jalal MA, Farooq MU, Ollerenshaw A & Hain T (2023) , arXiv.
Ollerenshaw A, Jalal MA & Hain T (2022) , arXiv.
Ollerenshaw A, Jalal MA & Hain T (2022) , arXiv.
Ravenscroft W, Goetze S & Hain T (2022) , arXiv.
Park C, Ahmad R & Hain T (2022) , arXiv.
Farooq MU, Narayana DAH & Hain T (2022) , arXiv.
Farooq MU & Hain T (2022) Investigating the Impact of Cross-lingual Acoustic-Phonetic Similarities on Multilingual Speech Recognition.
Milner R, Jalal MA, Ng RWM & Hain T (2022) , arXiv.
Ollerenshaw A, Jalal MA & Hain T (2022) , arXiv.
Ravenscroft W, Goetze S & Hain T (2022) , arXiv.
Chen M, Zhou Y, Huang H & Hain T (2022) Efficient Non-Autoregressive GAN Voice Conversion using VQWav2vec Features and Dynamic Convolution.
Do C-T, Doddipatla R & Hain T (2021) , arXiv.
Chen M, Shi Y & Hain T (2020) Towards Low-Resource StarGAN Voice Conversion using Weight Adaptive Instance Normalization.
Chen M & Hain T (2020) Unsupervised Acoustic Unit Representation Learning for Voice Conversion using WaveNet Auto-encoders.
Doulaty M, Saz O, Ng RWM & Hain T (2016) , arXiv.
Saz O, Doulaty M, Deena S, Milner R, Ng RWM, Hasan M, Liu Y & Hain T (2015) , arXiv.
Doulaty M, Saz O, Ng RWM & Hain T (2015) , arXiv.
Saz O, Doulaty M & Hain T (2015) , arXiv.
Doulaty M, Saz O & Hain T (2015) , arXiv.
Doulaty M, Saz O & Hain T (2015) , arXiv.
Ng RWM, Doulaty M, Doddipatla R, Aziz W, Shah K, Saz O, Hasan M, AlHarbi G, Specia L & Hain T (2014) The USFD Spoken Language Translation System for IWSLT 2014.
Ravenscroft W, Goetze S & Hain T () .

Grants

, EPSRC, 04/2019 - 09/2027, £5,508,850, as PI
VoiceBase Centre, VoiceBase Inc./Liveperson, 04/2018 - 03/2026, £2,488,691, as PI
WFST-based integration of ASR and MT in Spoken Language Translation, Industrial, 03/2014 - 12/2026, £63,588, as PI
Automatic voice conversion for transforming professional adult voice actors to artificial child voice actors, Innovate UK, 01/2021 - 01/2023, £173,605, as PI
MAUDIE: Multimedia Analysis for Unsupervised Dubbing In Entertainment, Innovate UK, 05/2018 - 07/2021, £393,115, as PI
TUTO II: Reading skills tutoring system, ITSLANGUAGE BV, 08/2017 - 12/2019, £121,439, as PI
Sound Source Separation Based on Deep Learning, Industrial, 05/2019 - 04/2020, £48,000, as PI
Acoustic correlates of emotions for automatic recognition, Industrial, 10/2018 - 09/2019, £48,900, as PI
Bridge Project, VoiceBase Inc., 09/2017 - 03/2018, £61,200, as PI
STATUS IV: Speech Technology and Translation Universal Survey, Defence Science and Technology Laboratory, 01/2017 - 10/2017, £60,000, as PI
TUTO: Reading skills tutoring system, ITSLANGUAGE BV, 09/2016 - 08/2017, £61,983, as PI
STATUS III: Speech Technology and Translation Universal Survey, Defence Science and Technology Laboratory, 01/2015 - 07/2016, £78,684, as PI
STATUS II: Speech Technology and Translation Universal Survey, Defence Science and Technology Laboratory, 11/2013 - 05/2014, £98,982, as PI
ItsLanguage, ITSLANGUAGE BV, 11/2012 - 03/2015, £68,333, as PI
German System Adaptation, ITSLANGUAGE BV, 11/2012 - 03/2015, £42,373, as PI
DocuMeet: , EC FP7, 11/2012 - 10/2014, £368,433, as PI
STATUS: Speech Technology and Translation Universal Survey, Defence Science and Technology Laboratory, 10/2012 - 08/2013, £73,726, as PI
A Joint Model of Spoken Language Translation, Google, 09/2011 - 12/2016, £43,014, as PI
, EPSRC, 05/2011 - 07/2016, £1,798,665, as PI
Unsupervised Domain Adaptation, CISCO, 11/2010 - 04/2012, £121,745, as PI
, EC FP6, 10/2006 - 12/2009, £467,074, as PI
, EC FP6, 10/2006 - 12/2009, £345,350, as PI

Professional activities and memberships

Head of the research group
Editorial Board member,
Associate Editor,
Organising committee member, ASRU 2013
Area Chair, Interspeech 2014, Speech Recognition - Signal Processing, Acoustic Modelling, Robustness and Adaptation.
Area Chair, ICPR 2014, Track 3 Image, Speech. Signal and Video Processing
Programme Committee,

�òݸ���

School of Computer Science

School of Computer Science

Professor Thomas Hain

Books

Journal articles

Book chapters

Conference proceedings

Reports

Theses

Datasets

Other

Preprints

Links

�òݸ��