INTERNSHIP - Create a benchmarking tool for Speech-to-Text solutions

Domain
Engineer - Research
Market
Rail
Location

Izegem, Belgium

Televic Rail

With over 30 years of experience in designing, manufacturing and maintaining on-board communication and control systems, Televic Rail is a leading, trusted partner for railway operators and train builders worldwide.

Its Passenger Information Systems and Control Systems are high quality, tailor-made solutions that offer the flexibility, user-friendliness and stability that our clients ask for. Our various types of on-board control systems such as our bogie monitoring systems are innovative yet reliable products which are designed specifically for the railway business.

Trains and trams all around the world are equipped with Televic Rail solutions, from New Zealand to Canada, from China to the United States, from India to Belgium, England and France.

Topic

In most trains, conductors can relay new information to passengers using the audio systems. However, if a passenger misses this announcement, or can't understand it very well (e.g. they are wearing headphones, there's a language barrier, they are hard of hearing, ...), the passenger might miss crucial information. For this reason, Televic GSP is currently developing a speech-to-text engine, which automatically transcribes these passengers calls and shows them on the train displays.Since, this is still in active development, our model frequently changes. To determine the accuracy of each iteration, we would require a benchmark tool. This benchmark tool would allow us to plug in our model, and some labeled test-data, and then display useful metrics of the model. Useful metrics could be the word error rate (WER), the character error rate (CER), the accuracy on certain types of words (e.g. locations), the time it took to transcribe audio, etc... The goal of the student would be to:

* If this is taken as an internship:

** develop a benchmark tool, in which our model can be plugged in.

** create a representative test dataset from a set of recorded passenger files.

* If this is taken as a thesis (or the intern is working very quickly):

** investigate useful metrics to output

** allow third-party models to be plugged into the benchmark, so a useful comparison can be made.

If you are interested in this topic, please also register this on the Televic website at: https://www.televic.com/en/careers/internships-and-students so we can confirm the topic is still available.

Specifics

  • Level: Academic Master/Master
  • Specialty: AI/Machine Learning/ Software
  • Type of work: Research 15%, Implem. 75%, Experim. 10%
  • Location: Televic/University
  • Type of activities: Design, Implementation, Programming
  • Number of students: 1 or 2

APPLY HERE!

Personal information
How would you like us to contact you?
Profile
Browse
One file only. 15 MB limit. Allowed types: pdf, docx, doc.
Browse
One file only. 15 MB limit. Allowed types: pdf, docs, doc.
Extra information
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.