Projects
Speechbrain
An open-source and all-in-one speech toolkit relying on PyTorch. My main work is on porting WFST-based training (CTC and LF-MMI topologies) and decoding into the existing codebase.
speechbrain-cl
Code for my `Comparison and Analysis of New Curriculum Criteria for End-to-End ASR` Interspeech paper. Implements multiple curriculum learning methods with speechbrain.
hass-gshell-extension
A gnome-shell extension meant to work as an interface for some basic functionalities of Home Assistant.
GraphemeToPhoneme-Greek
A grapheme-to-phoneme (G2P) conversion toolkit for Greek. Useful for building speech recognition training pipelines.
drug-identifier
Drug Named Entity Recognition (NER) Package, using spaCy and LLM prompting.
Numbers2Words-Greek
A toolkit for converting numbers to words in Greek. Useful for building speech recognition training pipelines.
accent-adaptation-through-tts
Repository for the ''Accent Adaptation Through the Use of Synthesized Speech'' paper, where accent-specific ASR training is assisted with an augmented accented dataset.
em-img-seg
Rust implementation of the Expectation-Maximization (EM) algorithm for image segmentation. Also includes a performance analysis.
latex-cv-template
A minimal, single column latex template for CVs/resumes.
speech-recognition-gr
Contains a list of freely available Greek speech recognition datasets and some pretrained models which can be used as a basis for further training.