Dnext

#speechtotext

October 19, 2024 10:42pm

Police are using AI to write police reports. I pictured police officers cutting & pasting from ChatGPT, but that's not what this is. The system is called Draft One, from a company called Axon, and the big selling point, is, obviously, saving police officers' time. But apparently the main thing it does is transcribe audio from bodycams. It's not a "police-report-generating LLM".

Even so, it is not clear these AI-generated reports will be acceptable as evidence in court.

Internal memo: Don't use AI for police reports, prosecutor tells Seattle-area law enforcement

#solidstatelife #ai #speechtotext

Internal memo: Don’t use AI for police reports, prosecutor tells Seattle-area law enforcement

The King County Courthouse in downtown Seattle. (GeekWire Photo / Taylor Soper) Citing the potential for AI hallucinations and other unintended errors,

Hackaday (unofficial)

August 16, 2024 9:00am

Robust Speech-to-Text, Running Locally on Quest VR Headset

#softwaredevelopment #virtualreality #speechtotext #transcription #vr #whisper #hackaday
posted by pod_feeder_v2

Robust Speech-to-Text, Running Locally On Quest VR Headset

[saurabhchalke] recently released whisper.unity, a Unity package that implements whisper locally on the Meta Quest 3 VR headset, bringing nearly real-time transcription of natural speech to the dev…

Wayne Radinsky

December 31, 2023 2:52am

Persian added to Speechmatics. Speechmatics is an automatic speech recognition software company in Cambridge, England.

"The key to understanding spoken Persian is variety. We want a mix of clean audio, such as audiobooks, and messier audio, like someone shouting next to a loud washing machine. The speech needs to include a range of vocabulary too, including technical language, informal vernacular, and regional-specific words. We try to create a bank of diverse voices that reflect how Persian is heard in the real world -- different contexts, quality of recordings, and accents."

"Capturing such a wide range of voices is significantly helped by our self-supervised approach. When building our bank of speech, we're not only looking for labeled data (i.e. audio recordings that come accompanied by a human-written transcript) but also unlabeled audio, of which there is much more. This opens the pool of audio to be learned from since we're not restricted to perfect datasets of recorded and transcribed Persian -- we can potentially use any spoken Persian."

110 million more voices are now understood

#solidstatelife #ai #speechtotext

Understanding Persian - bringing a new language to Speechmatics

Lifting the lid on how Speechmatics' speech recognition learnt the Persian language.

Wayne Radinsky

December 13, 2023 4:32am

Dubbing and subtitling still requires humans, says Tom Scott. He says he tried several AI translation tools, as well as just asking a large language model to translate his videos and subtitles, and as of right now, as of 2023, capturing the nuance and meaning of his speech definitely still requires humans. The video has numerous examples of tricky translations and even the ads are part of it. And the video is is subtitled (by humans, not auto-translated by YouTube) in English, French, Hindi, Japanese, Brazilian Portuguese, and (Latin American) Spanish.

Why don't subtitles match dubbing? - Tom Scott

#solidstatelife #ai #speechtotext #machinetranslation

Hackaday (unofficial)

April 7, 2023 3:00am

ChatGPT Powers a Different Kind of Logic Analyzer

#machinelearning #ai #chatbot #chatgpt #logic #logicalfallacy #speechtotext #spp #whisper #hackaday
posted by pod_feeder_v2

ChatGPT Powers A Different Kind Of Logic Analyzer

If you’re hoping that this AI-powered logic analyzer will help you quickly debug that wonky digital circuit on your bench with the magic of AI, we’re sorry to disappoint you. But if you…

philetmon

April 10, 2022 7:40am

Depuis 15 ans que j'attendais ça, j'ai pu tester le speech to text sous linux. C'est au JDLL que j'ai entendu parler de Vosk sur le stand de OO. Suivi les instructions de ce site: https://www.suramya.com/blog/2022/01/nerd-dictation-a-fantastic-open-source-speech-to-text-software-for-linux/

Attention, marche uniquement sous X11, et chez moi n'est pas très rapide. Mais c'est une super étape. Est-ce que quelqu'un ici utilise ça de manière courante ?

#JDLL #SpeechtoText #libre

Hackaday (unofficial)

July 7, 2021 7:00pm

EMOJO Chatbot Will Be There for You

#raspberrypi #thehackadayprize #2021hackadayprize #speechtotext #texttospeech #tft #hackaday
posted by pod_feeder_v2

EMOJO Chatbot Will Be There For You

We all need someone to talk to sometimes, and the pandemic has only made matters worse when it comes to the number of people living with anxiety and depression. Exchanging the simplest of pleasantr…

biagio_

November 27, 2019 10:50am

#surveillance #speechtotext #china
The real news here is that the great firewall failed to stop her .
https://www.businessinsider.com/china-uighur-protest-tiktok-suspend-feroza-aziz-2019-11?IR=T

TikTok suspended a teen who posted a viral takedown of China disguised as a makeup tutorial, but it claims it's because she posted a video of Osama bin Laden

Feroza Aziz's suspension from TikTok came as the Chinese-owned app faces increasing criticism over its treatment of politically sensitive content.

0 Persons are tagged with #speechtotext

#speechtotext

Robust Speech-to-Text, Running Locally on Quest VR Headset

ChatGPT Powers a Different Kind of Logic Analyzer

EMOJO Chatbot Will Be There for You