Implement real-time phone speech recognition in project with Asterisk PBX and Kaldi/Vosk -- 2

$30-250 USD

In corso

Pubblicato

10 mesi fa

$30-250 USD

Pagato al completamento

In my programming project I build a system around an Asterisk VoIP server. My purpose is to enable streaming speech recognition once inbound call occurs, i.e. I want to run automatic voice recognition since starting of conversation two people are involved into. The ASR (automatic speech recognition) engine I have chosen to implement that is Kaldi powered by Vosk server ([login to view URL]). As it needs some integration into Asterisk software, I use Asterisk-specific module ([login to view URL]) to carry out ASR operations without compatibility issues. So far if anybody speaks anything while calling, it gives very clear text output. The problem I'm struggling is how to enable streaming ASR immediately during the conversation, i.e. since Dial() application of Asterisk dialplan gets executed. That's a subject of this job - create script (most likely, with some Asterisk REST Interface components) which works as follows: 1) since Dial() application starts running, real-time audio stream gets processed via ASR engine that is waiting for inputs inside of docker container (because I deploy Kaldi as a software built in Vosk server which is compatible with Asterisk, here is the out-of-box program implementation released on Github: [login to view URL]) 2) once conversation begins and voice streaming is detected, audial data flow heads the ASR powered by Vosk server (within the docker container); 3) while the data flow continues because of the ongoing conversation between people, the ASR generates transcribed outputs (files) that must be forwarded to an HTTP server to evaluate the contents of them (don't worry about this part, it's beyond this specific work , Certainly); And it should also be possible to hang up the call according to the transcript. 4) since conversation gets wrapped up, last phrases get processed via ASR to pass final outputs to the HTTP server mentioned above; 5) whenever inbound call occurs, same steps to be carried out: audial data capture - speech recognition within the docker container - text file through to the HTTP server. That all to be compliant with real time requirements, so data flow needs fast and seamless throughput before and after ASR processing, as a matter of course. While searching for any helpful content on the Internet, I encountered this Stack Overflow question [login to view URL] It makes clear the same purpose, just in other words than in my description. However, I demand implementation of the system design with Kaldi/Vosk rather then Google Speech. As for language to be used for development, I would leave some options. So, Python/Java/JS are acceptable to do that. The job will be considered as complete and worth full payment only if there is a provable functionality of the program which enables all listed steps without implementation errors. Certainly, it must be compatible with all aforementioned software products too.

Rif. progetto: 36888941

Info sul progetto

11 proposte

Progetto a distanza

Attivo 10 mesi fa

Hai voglia di guadagnare un po'?

Indirizzo Email

I vantaggi delle offerte su Freelancer

Imposta il tuo budget e le scadenze

Fatti pagare per il lavoro svolto

Delinea la tua proposta

La registrazione e le offerte sui lavori sono gratuite

Assegnato a:

@alaarabie117

I thrive in machine learning environments through team collaboration and take any opportunity to learn from others and prove my own expertise. I am a self motivated problem solver and hope to find meaningful work in a field that constantly challenges me. I have a lot of challenges developing my skills Please review my Freelancer profile [https://www.freelancer.com/u/alaarabie117] for further information on my qualifications and client feedback. I am available for chatting me to discuss the project in more detail and address any questions or concerns you may have. Thank you for considering my proposal. I look forward to the opportunity to work with you and contribute to the success of this project. Sincerely, Alaa Rabie Khalifa

$140 USD in 7 giorni

5,0

(1 valutazione)

2,3

11 freelance hanno fatto un'offerta media di $116 USD

@serazummunirz

Hi There, I'm interested in this project and can provide a solution which will enable all steps required with ASR and necessary integration. I can also provide a provable functionality as required. Please feel free to send me a message with any queries. Thanks in advance for giving me the opportunity to work with you. Best Regards, Sirajum Munir.

$200 USD in 1 giorno

5,0

(65 valutazioni)

5,9

@rashidamjad

Hi there, I am a full stack developer with 4+ years of experience in both front-end and back-end development. I have read your Implement real-time phone speech recognition in project with Asterisk PBX and Kaldi/Vosk -- 2 description very carefully and would like to have a detailed chat about this project as to resolve some queries I have regarding this project that needs to be cleared to get things started. We always look for a long term relationship. It would be my pleasure to build long term relationship with you. All my skills are related to this particular project. Looking forward from your response, Thanks Rashid Amjad.

$250 USD in 8 giorni

5,0

(7 valutazioni)

4,8

@DataScinceFizer

I developed an intuitive feeling about python programming. I can write clean validated python code and make a device-supported py. File. https://www.freelancer.com/projects/python/Project-for32411503/reviews https://www.freelancer.com.bd/projects/python/need-Python-data-science-expert-29599523/reviews I have confidence and say I'm excellent for your project. Let's remark on your project within the message box. Regards@DataScinceFizer

$100 USD in 3 giorni

5,0

(2 valutazioni)

2,1

@QinKebin

Dear sir, I am a senior software engineer who has vast experience with tech stacks below for 8 years. I have rich experience in Python, Linux, CentOs, VoIP and Asterisk PBX, so I can deliver the best result. - Python, C, Java - Machine Learning - Deep Learning - Neural Networks - Algorithm - Artificial Intelligence - Data Structures - Competitive Programming - NLP - Computer Vision(OpenCV) - PyTorch - Tensorflow - Reinforcement Learning I'm familiar with agile project management tools including Slack, JIRA, Trello, Bitbucket, Github, etc. I ensure the highest quality of product and 100% satisfaction through my work. I am innovative and strategic thinking professional with a proven track record of consistently going above and beyond in meeting customer needs and providing more value to the product than what the customer is paying for. For this very reason, they always get back to us again and again with promising ideas and projects. I hope we can discuss more details in chat. I'll look forward to hearing from you soon. Thanks so much. Kind Regards.

$30 USD in 6 giorni

0,0

(0 valutazioni)

0,0

@sofiia85

Hi, I've gone through the job posting. It seems that you're looking for a software engineer who have rich experiences on CentOs, Python, VoIP, Linux and Asterisk PBX. I've worked with another client in this space, I think you might find it very interesting to have chat. Regards, Sofya

$50 USD in 2 giorni

0,0

(0 valutazioni)

0,0

@rajeevnewnetlink

Hi, Implement real-time phone speech recognition in project with Asterisk PBX and Kaldi/Vosk -- 2guilhermepbxPython, Linux, Asterisk PBX, VoIP, CentOsPython, VoIP, Asterisk PBX, LinuxBrazil We are an expert team which have many years of experience on Python, Linux, Asterisk PBX, VoIP, CentOs Lets connect in chat so that We discuss further. Regards

$200 USD in 7 giorni