-
Al-Bari Technologies
- Pakistan
- @abranasays
- in/abdulbasitrana
Lists (24)
Sort Name ascending (A-Z)
Angular Packages
Courses Material
CSS framework
Database
Design Patterns
Dev Resources
Dev Tools
Email Tools
Flutter Packages
Interview
Javascript Packages
Laravel Packages
MacOS App
PHP Packages
PHP Tools
Portfolio
Public API
Python Packages
React Packages
Software Architecture
SQL Tools
Svelte Packages
UI Component
Vue Packages
Stars
Open source website builder and Webflow alternative. Webstudio is an advanced visual builder that connects to any headless CMS, supports all CSS properties, and can be hosted anywhere, including wi…
WebView OAuth flows for desktop flutter apps
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
AI-Powered Podcast Generator: A Python-based tool that converts text scripts into realistic audio podcasts using Google's Generative AI API. This project leverages advanced text-to-speech technolog…
An Open Source Python alternative to NotebookLM's podcast feature: Transforming Multimodal Content into Captivating Multilingual Audio Conversations with GenAI
JAX implementation of OpenAI's Whisper model for up to 70x speed-up on TPU.
An extremely fast implementation of whisper optimized for Apple Silicon using MLX.
Unofficial API Wrapper for Deepseek (chat.deepseek.com)
Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"
Open Source framework for voice and multimodal conversational AI
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
Instant voice cloning by MIT and MyShell. Audio foundation model.
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
Zero-Shot Speech Editing and Text-to-Speech in the Wild
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
Silero Models: pre-trained speech-to-text, text-to-speech and text-enhancement models made embarrassingly simple
Speech-to-text, text-to-speech, speaker diarization, speech enhancement, and VAD using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Android, iOS, HarmonyOS…
Inference and training library for high-quality TTS models.
Effortlessly run LLM backends, APIs, frontends, and services with one command.
Robust Speech Recognition via Large-Scale Weak Supervision
Part guillotine, part graveyard for Google's doomed apps, services, and hardware.
Port of OpenAI's Whisper model in C/C++