Showing 6662 open source projects for "audio linux"

View related business solutions
  • Earn up to 15% annual interest with Nexo. Icon
    Earn up to 15% annual interest with Nexo.

    Let your crypto work for you

    Put idle assets to work with competitive interest rates, borrow without selling, and trade with precision. All in one platform. Geographic restrictions, eligibility, and terms apply.
    Get started with Nexo.
  • Earn up to 15% annual interest with Nexo. Icon
    Earn up to 15% annual interest with Nexo.

    Access competitive interest rates on your digital assets.

    Generate interest, borrow against your crypto, and trade a range of cryptocurrencies — all in one platform. Geographic restrictions, eligibility, and terms apply.
    Get started with Nexo.
  • 1
    MLX-Audio

    MLX-Audio

    A text-to-speech, speech-to-text and speech-to-speech library

    MLX-Audio is a speech library built on Apple’s MLX framework and optimized for Apple Silicon machines (M-series Macs). It focuses on text-to-speech and speech-to-speech workflows, with APIs and a command-line interface that make it easy to generate high-quality audio from text. Because it uses MLX and targets Apple Silicon, inference is fast and can take advantage of hardware acceleration and quantization for efficient on-device performance. The project provides a straightforward CLI...
    Downloads: 13 This Week
    Last Update:
    See Project
  • 2
    Step-Audio

    Step-Audio

    Open-source framework for intelligent speech interaction

    Step-Audio is a unified, open-source framework aimed at building intelligent speech systems that combine both comprehension and generation: it integrates large language models (LLMs) with speech input/output to handle not only semantic understanding but also rich vocal characteristics like tone, style, dialect, emotion, and prosody. The design moves beyond traditional separate-component pipelines (ASR → text model → TTS), instead offering a multimodal model that ingests speech or audio and...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 3
    Kimi-Audio

    Kimi-Audio

    Audio foundation model excelling in audio understanding

    Kimi-Audio is an ambitious open-source audio foundation model designed to unify a wide array of audio processing tasks — from speech recognition and audio understanding to generative conversation and sound event classification — within a single cohesive architecture. Instead of fragmenting work across specialized models, Kimi-Audio handles automatic speech recognition (ASR), audio question answering, automatic audio captioning, speech emotion recognition, and audio-to-text chat in one...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 4
    Qwen2-Audio

    Qwen2-Audio

    Repo of Qwen2-Audio chat & pretrained large audio language model

    Qwen2-Audio is a large audio-language model by Alibaba Cloud, part of the Qwen series. It is trained to accept various audio signal inputs (including speech, sounds, etc.) and perform both voice chat and audio analysis, producing textual responses. It supports two major modes: Voice Chat (interactive voice only input) and Audio Analysis (audio + text instructions), with both base and instruction-tuned models. It is evaluated on many benchmarks (speech recognition, translation, sound...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Zendesk: The Complete Customer Service Solution Icon
    Zendesk: The Complete Customer Service Solution

    Discover AI-powered, award-winning customer service software trusted by 200k customers

    Equip your agents with powerful AI tools and workflows that boost efficiency and elevate customer experiences across every channel.
    Learn More
  • 5
    Qwen-Audio

    Qwen-Audio

    Chat & pretrained large audio language model proposed by Alibaba Cloud

    Qwen-Audio is a large audio-language model developed by Alibaba Cloud, built to accept various types of audio input (speech, natural sounds, music, singing) along with text input, and output text. There is also an instruction-tuned version called Qwen-Audio-Chat which supports conversational interaction (multi-round), audio + text input, creative tasks and reasoning over audio. It uses multi-task training over many different audio tasks (30+), and achieves strong multi-benchmarks performance...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 6
    Step-Audio-EditX

    Step-Audio-EditX

    LLM-based Reinforcement Learning audio edit model

    Step-Audio-EditX is an open-source, 3 billion-parameter audio model from StepFun AI designed to make expressive and precise editing of speech and audio as easy as text editing. Rather than treating audio editing as low-level waveform manipulation, this model converts speech into a sequence of discrete “audio tokens” (via a dual-codebook tokenizer) — combining a linguistic token stream and a semantic (prosody/emotion/style) token stream — thereby abstracting audio editing into high-level...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 7
    Fun Audio Chat

    Fun Audio Chat

    Large Audio Language Model built for natural interactions

    Fun Audio Chat is an interactive voice-first conversational AI platform designed to let users engage in natural spoken dialogue with large language models in real time, turning speech into context-aware responses while maintaining a smooth back-and-forth experience. It combines speech recognition, audio processing, and AI generation so users can speak simply and receive spoken replies, enabling applications such as virtual assistants, voice bots, and hands-free chat interfaces. The system...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    Step-Audio 2

    Step-Audio 2

    Multi-modal large language model designed for audio understanding

    Step-Audio2 is an advanced, end-to-end multimodal large language model designed for high-fidelity audio understanding and natural speech conversation: unlike many pipelines that separate speech recognition, processing, and synthesis, Step-Audio2 processes raw audio, reasons about semantic and paralinguistic content (like emotion, speaker characteristics, non-verbal cues), and can generate contextually appropriate responses — including potentially generating or transforming audio output. It...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    OBS Studio

    OBS Studio

    Open source software for live streaming and recording

    OBS Studio, also known as Open Broadcaster Software, is a free and open source software program for live streaming and video recording. Features of the software include device/source capture, recording, encoding and broadcasting. Stream on Windows, Mac or Linux. This software is commonly used by video game streamers on the popular streaming platform Twitch.
    Downloads: 265 This Week
    Last Update:
    See Project
  • Wiz: #1 Cloud Security Software for Modern Cloud Protection Icon
    Wiz: #1 Cloud Security Software for Modern Cloud Protection

    Protect Everything You Build and Run in the Cloud

    Use the Wiz Cloud Security Platform to build faster in the cloud, enabling security, dev and devops to work together in a self-service model built for the scale and speed of your cloud development.
    Learn More
  • 10
    OpenVINO AI Plugins for Audacity

    OpenVINO AI Plugins for Audacity

    A set of AI-enabled effects, generators, and analyzers for Audacity

    A set of AI-enabled effects, generators, and analyzers for Audacity. These AI features run 100% locally on your PC, no internet connection is necessary. OpenVINO™ is used to run AI models on supported accelerators found on the user's system such as CPU, GPU, and NPU.
    Downloads: 158 This Week
    Last Update:
    See Project
  • 11
    Ultimate Vocal Remover (UVR5)

    Ultimate Vocal Remover (UVR5)

    GUI for a Vocal Remover that uses Deep Neural Networks

    This application uses state-of-the-art source separation models to remove vocals from audio files. UVR's core developers trained all of the models provided in this package (except for the Demucs v3 and v4 4-stem models).
    Downloads: 767 This Week
    Last Update:
    See Project
  • 12
    LosslessCut

    LosslessCut

    The swiss army knife of lossless video/audio editing

    LosslessCut aims to be the ultimate cross platform FFmpeg GUI for extremely fast and lossless operations on video, audio, subtitle and other related media files. The main feature is lossless trimming and cutting of video and audio files, which is great for saving space by rough-cutting your large video files taken from a video camera, GoPro, drone, etc. It lets you quickly extract the good parts from your videos and discard many gigabytes of data without doing a slow re-encode and thereby...
    Downloads: 151 This Week
    Last Update:
    See Project
  • 13
    NeuralNote

    NeuralNote

    Audio Plugin for Audio to MIDI transcription using deep learning

    NeuralNote is an open-source audio software tool designed to convert recorded audio into MIDI data using modern machine learning techniques. The software functions as an audio plugin that can be used inside digital audio workstations as well as a standalone application for music production and analysis. Its main purpose is to perform audio-to-MIDI transcription, allowing musicians to record a performance and automatically transform it into editable MIDI notes. NeuralNote supports polyphonic...
    Downloads: 86 This Week
    Last Update:
    See Project
  • 14
    Cider App

    Cider App

    A new cross-platform Apple Music experience based on Electron and Vue

    An open-source, community-oriented Apple Music client for Windows, Linux, macOS, and more. Whether it be Discord, LastFM, or even equalizers we've got you covered. Discord & Last.fm Integration. Quickly share and show others what you're listening to; right out of the box. Audio Enhancements. Audio Spatialization, Adrenaline Processor™, and Equalizers are all available and actively engineered by our Audio Engineer, Maikiwi.
    Downloads: 94 This Week
    Last Update:
    See Project
  • 15
    Strawberry Music Player

    Strawberry Music Player

    Strawberry Music Player

    Strawberry is a cross-platform music player and music collection organizer. It is aimed at music collectors and audiophiles. With Strawberry you can play and manage your digital music collection, or stream your favorite radios. Strawberry is a music player and music collection organizer. It is aimed at music collectors and audiophiles. With Strawberry you can play and manage your digital music collection, or stream your favorite radios. Strawberry is free software released under GPL. The...
    Downloads: 114 This Week
    Last Update:
    See Project
  • 16
    LTX-2.3

    LTX-2.3

    Official Python inference and LoRA trainer package

    LTX-2.3 is an open-source multimodal artificial intelligence foundation model developed by Lightricks for generating synchronized video and audio from prompts or other inputs. Unlike most earlier video generation systems that only produced silent clips, LTX-2 combines video and audio generation in a unified architecture capable of producing coherent audiovisual scenes. The model uses a diffusion-transformer-based architecture designed to generate high-fidelity visual frames while...
    Downloads: 167 This Week
    Last Update:
    See Project
  • 17
    Snapcast

    Snapcast

    Synchronous multiroom audio player

    Snapcast is a multiroom client-server audio player, where all clients are time synchronized with the server to play perfectly synced audio. It's not a standalone player, but an extension that turns your existing audio player into a Sonos-like multiroom solution. Audio is captured by the server and routed to the connected clients. Several players can feed audio to the server in parallel and clients can be grouped to play the same audio stream. One of the most generic ways to use Snapcast is...
    Downloads: 48 This Week
    Last Update:
    See Project
  • 18
    Audio Share

    Audio Share

    Audio Share can share Windows/Linux computer's audio to Android phone

    Audio Share can share Windows/Linux computer's audio to Android phone over network, so your phone becomes the speaker of computer. (You needn't buy a new speaker😄.) https://github.com/mkckr0/audio-share
    Leader badge
    Downloads: 264 This Week
    Last Update:
    See Project
  • 19
    spotDL

    spotDL

    Download your Spotify playlists and songs along with album art

    spotDL is a command-line tool that allows users to download songs and playlists from Spotify by sourcing the audio from YouTube. Built in Python, it automatically matches Spotify tracks with corresponding videos on YouTube and downloads them with embedded metadata. The tool retrieves important information such as album art, song titles, artist names, and lyrics to organize downloaded files. spotDL is designed to be fast, accurate, and easy to use through a simple command-line interface. It...
    Downloads: 105 This Week
    Last Update:
    See Project
  • 20
    CAVA

    CAVA

    Cross-platform Audio Visualizer

    Turn your phone or tablet into a fantastic audio viewer. Easy to use and customize. Choose from several preset settings of incredible colors or create your own. CAVA is a bar spectrum audio viewer based on my own open source project with the same name. Take the audio from the device's microphone and visualize the amplitude of the different frequencies as bars on the screen. Each bar represents a certain bandwidth of low to high frequencies. The leftmost bar starts at 50 Hz and the rightmost...
    Downloads: 37 This Week
    Last Update:
    See Project
  • 21
    Spotube

    Spotube

    Open source Spotify client that doesn't require Premium

    An open source, cross-platform Spotify client compatible across multiple platforms utilizing Spotify's data API and YouTube, Piped video or JioSaavn as an audio source, eliminating the need for Spotify Premium. It is still recommended to support creators by engaging with their YouTube channels/Spotify tracks (or preferably by buying their merch/concert tickets/physical media).
    Downloads: 66 This Week
    Last Update:
    See Project
  • 22
    Navidrome

    Navidrome

    Your Personal Streaming Service

    Navidrome is an open-source, web-based personal music server that lets you stream and manage your entire music collection from any browser or compatible mobile app, effectively turning your own files into a cloud-accessible music service. It supports large libraries and handles a wide variety of audio formats while maintaining very low resource usage, so it runs well even on small servers, Raspberry Pi devices, and other constrained hardware. Users can browse, play, and organize tracks and...
    Downloads: 49 This Week
    Last Update:
    See Project
  • 23
    Shairport Sync

    Shairport Sync

    AirPlay audio player

    ...In this way, synchronized multi-room audio is possible for players that support it, such as iTunes and the macOS Music app. Shairport Sync runs on Linux, FreeBSD and OpenBSD. It does not support AirPlay video or photo streaming. Shairport Sync offers full audio synchronization, a feature of AirPlay that previous implementations do not provide. Full audio synchronization means that audio is played on the output device at exactly the time specified by the audio source.
    Downloads: 7 This Week
    Last Update:
    See Project
  • 24
    HandBrake

    HandBrake

    A open source video to convert video from any format to modern codecs

    HandBrake is an open-source, GPL-licensed, multiplatform, multithreaded video transcoder, available for MacOS X, Linux and Windows.
    Downloads: 229 This Week
    Last Update:
    See Project
  • 25
    YesPlayMusic

    YesPlayMusic

    Play Music for Windows / macOS / Linux

    High-quality third-party NetEase cloud player, supports Windows / macOS / Linux :electron. Overseas users can play directly (need to log in to NetEase Cloud account). Support UnblockNeteaseMusic, automatically replace grayed out song links with various audio sources (web version not supported). Various audio sources" refers to audio sources that are enabled by default. YouTube audio source needs to install yt-dlp by itself.
    Downloads: 22 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • 3
  • 4
  • 5
  • Next
MongoDB Logo MongoDB