The input form offers audio language selection (Luxembourgish or German), a diarization switch, and an optional number of speakers field to guide diarization. For output, the web interface supports Text (enriched text, downloadable as TXT/DOCX), Word DOCX, Subtitles (SRT), MAXQDA transcript, Praat TextGrid, Praat TextGrid aligned (based on MMS forced alignment and MFA resources for Luxembourgish), and JSON. When srt is selected, you can also set SRT line length.
These files can be downloaded through the link below the transcription. The recognition duration takes up to 5% of the audio file’s duration (e.g., 3 minutes for 60 minutes of audio). Once the recognition process has started, an estimated time and a timer will be displayed to keep track of the progress.
As an experimental feature, translation can output the recognized text in English, French, German, Spanish, Portuguese, or Luxembourgish. Note that translations take more time to run and will run only for short audios (max. 3 minutes). The quality of these translations may vary. You can try also our stand-alone translation LuxMT.
The maximal size for upload is 500 MB. Audio files should be in WAV, MP3 or M4A format, video files in MP4 format.
maxqda to generate transcript text for MAXQDA import. Each speaker paragraph ends with a timestamp in MAXQDA-compatible form [hh:mm:ss.x].
We are opening API access now for limited access. We reserve the right to modify or suspend access to the API at any time. If you plan to integrate our service into another application, contact us first for permission and conditions. The LuxASR API can be reached via:
curl -X POST "https://luxasr.uni.lu/v2/asr?diarization=Enabled&outfmt=colored_text" \
-H "accept: application/json" \
-F "audio_file=@PATH/TO/AUDIO FILE;type=audio/wav"
The API returns the transcription in the specified output format.
lb (Luxembourgish, default) and de (German).Enabled (default) or Disabled to include or exclude speaker diarization.2, 3) to set the expected number of speakers when diarization is enabled.colored_text – enriched text with interactive features and confidence highlightingjson – JSON outputsrt – SubRip subtitle formatmaxqda – transcript text with MAXQDA-compatible #hh:mm:ss.x# timestampstextgrid – Praat TextGrid formattextgrid_aligned – aligned TextGrid format using MMS and MFA Luxembourgish resourcesoutfmt=srt; web default: 42).en, fr, de, es, pt, lb) for supported output formats and short audios.Accepted audio formats are .wav, .mp3, and .m4a.
Note that the transcription and the translation are run on a dedicated server at the University of Luxembourg. All data thus stays within Luxembourg and the University’s network. Nobody has access to the uploaded audio or the text output. The audio data is streamed to this server and no files are stored on this server or in the network. No data is used to further train the model and no data is transferred to third parties.
Learn more about LuxASR. LuxASR is under constant development by Peter Gilles, Léopold Hillah, and Nina Hosseini-Kivanani at the University of Luxembourg and is supported by the Chambre des Députes du Grand-Duché de Luxembourg.