scripty

scripty is a collection of AI powered speech to text prototypes based on whisper and faster-whisper. For each, there is an example for processing a audio file and processing realtime microphone input. The inference is running locally.

Installation

Windows, NO CUDA

# install ffmpeg!
pip install -U openai-whisper
pip install torch==2.3.0 torchvision==0.18.0 torchaudio==2.3.0
pip install numpy==1.23.5
pip install faster-whisper
pip install SpeechRecognition pyaudio

Windows, With CUDA

# install ffmpeg!
pip install torch==1.13.1+cu117 torchvision==0.14.1+cu117 torchaudio==0.13.1+cu117 --index-url https://download.pytorch.org/whl/cu117
pip install numpy==1.23.5 
pip install openai-whisper
pip install faster-whisper
pip install SpeechRecognition pyaudio

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
rsc		rsc
samples		samples
README.md		README.md
fw_realtime_scripty.py		fw_realtime_scripty.py
fw_scripty.py		fw_scripty.py
realtime_scripty.py		realtime_scripty.py
sripty.py		sripty.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

scripty

Installation

About

Releases

Packages

Languages

armasuissewt/scripty

Folders and files

Latest commit

History

Repository files navigation

scripty

Installation

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages