Audio Capture - Per-Process Audio Recording

A Windows application for capturing audio from individual processes using WASAPI (Windows Audio Session API). Records audio to WAV, MP3, Opus, or FLAC formats with an accessible Win32 interface. Fully self-contained with no external DLL dependencies.

Full Disclosure

This was written with Claude Code. It has been tested and does function completely.

Features

Per-Process Audio Capture: Record audio from specific applications independently (Windows 10 2004+ / build 19041+)
System-Wide Audio Capture: Option to capture all system audio simultaneously (works on all supported Windows versions)
Pause/Resume Recording: Pause and resume all active recordings with dedicated UI buttons
Microphone Input Capture: Record from microphone/line-in devices with seamless integration
Multi-Process Recording Modes:
- Separate Files: Each process records to its own file
- Combined File: All processes mixed into a single output file
- Both: Create individual files AND a combined mixed file
Real-time Audio Mixing: Combine multiple audio streams into a single file
Audio Monitoring/Passthrough: Send captured audio to another output device in real-time for monitoring
Monitor-Only Mode: Listen to audio without recording it to disk
Multiple Format Support: Save recordings as WAV, MP3, Opus, or FLAC files
- WAV: Uncompressed PCM audio (highest quality, largest size)
- MP3: Compressed with configurable bitrate (128-320 kbps)
- Opus: Modern codec with configurable bitrate (64-256 kbps)
- FLAC: Lossless compression with configurable levels (0-8)
Silence Detection: Optional skip silence feature to save disk space
Process Filtering: Show only processes with active audio output
Window Title Display: See window titles to easily identify processes
Accessible Win32 UI: Standard Windows controls with full keyboard navigation and screen reader support
- Adaptive UI automatically adjusts based on OS capabilities
- Simplified interface on older Windows versions
Real-time Monitoring: View active recording sessions and data statistics
Settings Persistence: Automatically saves and restores your preferences
Fully Static Build: No external DLL dependencies - single executable deployment

Requirements

System Requirements

Windows 7 or later - Fully compatible with static C runtime
- Windows 7/8/8.1: System audio capture only (simplified UI)
- Windows 10 1909 and earlier (build < 19041): System audio capture only
- Windows 10 2004+ (build 19041+): Full per-process audio capture with process list
- Windows 11: Full per-process audio capture with process list

Build Requirements

Visual Studio 2019 or later (with C++ Desktop Development workload)
Windows SDK 10.0.19041.0 or later
C++17 compatible compiler
CMake 3.15 or later
vcpkg (for managing dependencies)

Dependencies (via vcpkg)

libflac (statically linked)
opus (statically linked)
libogg (statically linked)
nlohmann-json (header-only)

Building the Project

Prerequisites

Install vcpkg dependencies with static runtime:

vcpkg install libflac:x64-windows-static-mt opus:x64-windows-static-mt libogg:x64-windows-static-mt nlohmann-json:x64-windows-static-mt

Note: The x64-windows-static-mt triplet ensures static linking of both the libraries and the C/C++ runtime, resulting in a fully self-contained executable with no DLL dependencies.

Using the Build Script

The easiest way to build is using the provided batch script:

build.bat

This will:

Configure CMake with vcpkg integration (static libraries)
Build the Release configuration with static C/C++ runtime
Copy the standalone executable to the package folder (no DLLs needed)

Manual Build with Visual Studio

Open a Developer Command Prompt for Visual Studio
Navigate to the project directory and create a build directory:

mkdir build
cd build

Generate Visual Studio project files with vcpkg and static runtime:

cmake .. -G "Visual Studio 17 2022" -A x64 -DCMAKE_TOOLCHAIN_FILE=C:/vcpkg/scripts/buildsystems/vcpkg.cmake -DVCPKG_TARGET_TRIPLET=x64-windows-static-mt

Build the project:

cmake --build . --config Release

The executable will be in build\bin\Release\AudioCapture.exe

Using CMake and Ninja

Open a Developer Command Prompt for Visual Studio
Navigate to the project directory and create a build directory:

mkdir build
cd build

Configure and build:

cmake .. -G Ninja -DCMAKE_BUILD_TYPE=Release
ninja

The executable will be in build\bin\AudioCapture.exe

Usage

Starting the Application

Run AudioCapture.exe (no installation or additional DLLs required)
The main window will display:
- Windows 10 2004+: A list of running processes with their window titles for per-process capture
- Older Windows: Simplified interface for system audio capture only

Basic Capture (Single Process)

Navigate to a Process: Use arrow keys or click on a process in the list
Choose Output Format: Select WAV, MP3, Opus, or FLAC from the dropdown
- For MP3/Opus: Choose bitrate from the dropdown that appears
- For FLAC: Choose compression level (0=fast, 8=best compression)
Optional Settings:
- Check "Skip silence" to avoid recording silent audio
- Check "Show only processes with active audio" to filter the list
Set Output Location: Use the Browse button to choose where recordings are saved (default: Documents\AudioCaptures)
Start Recording: Press Enter or click "Start Capture"
Monitor Progress: View active recordings in the "Active Recordings" list
Stop Recording: Select a recording and click "Stop Capture" (focus returns to process list)

System-Wide Audio Capture

To capture all system audio at once:

Select "[System Audio - All Processes]": This special entry at the top of the process list (PID 0) captures all system audio
Start Capture: Records everything your system is playing
Perfect for recording multiple applications together

Audio Monitoring (Passthrough to Output Device)

You can send captured audio to another audio device in real-time:

Enable "Monitor audio": Check the checkbox to activate passthrough
Select Monitor Device: Choose which output device to send audio to from the dropdown
Start Capture: Audio will play through both the original device and your selected monitor device
Monitor-Only Mode: Check "Monitor only - no recording" to listen without saving to disk
- Recording format and output path controls are disabled in this mode
- Useful for testing or temporary audio routing

Multiple Simultaneous Captures with Recording Modes

You can record multiple processes at once with flexible output options (Windows 10 2004+ only):

Recording Mode Selection

Choose from the Multi-process recording dropdown:

Separate files: Each process records to its own individual file
Combined file: All processes are mixed into a single output file
Both: Creates individual files AND a combined mixed file

Starting Multi-Process Capture

Check Multiple Processes: Click the checkbox next to each process you want to record
Select Recording Mode: Choose how you want the audio saved (Separate files / Combined file / Both)
Click "Start Capture": All checked processes will start recording simultaneously
Monitor Progress: View all active recordings in the "Active Recordings" list
Stop Individual Recordings: Select a recording and click "Stop Capture"
Stop All: Click "Stop All" to end all active recordings at once

Pause and Resume Recording

When multiple recordings are active, you can pause and resume them all at once:

Pause All: Pauses all active recordings - audio is not captured while paused
Resume All: Resumes all paused recordings from where they left off
Buttons intelligently enable/disable based on the current pause state
Individual recordings cannot be paused - only all recordings together

Recording Mode Examples

Separate files: Recording Discord, Spotify, and Chrome creates three files:
- Discord-2025_10_12-14_30_45.opus
- Spotify-2025_10_12-14_30_45.opus
- chrome-2025_10_12-14_30_45.opus
Combined file: All three applications are mixed together into:
- Combined-2025_10_12-14_30_45.opus
Both: Creates all four files (three individual + one combined)

Microphone Input Capture

You can capture microphone or line-in audio along with application audio:

Setting Up Microphone Capture

Enable Microphone: Check the "Capture microphone" checkbox
Select Device: Choose your microphone from the dropdown that appears
Start Capture: The microphone will be captured according to your recording mode

Microphone with Recording Modes

The microphone integrates seamlessly with multi-process recording modes:

Separate files mode: Microphone records to its own file
- Creates: Microphone-2025_10_12-14_30_45.opus
- Each application also gets its own file
Combined file mode: Microphone audio is mixed with application audio
- Microphone audio is included in: Combined-2025_10_12-14_30_45.opus
- No separate microphone file is created (appears as "Monitor Only" in the list)
- Perfect for recording commentary over gameplay or music
Both mode: Microphone creates its own file AND is included in the combined file
- Creates: Microphone-2025_10_12-14_30_45.opus (separate mic file)
- Microphone also mixed into: Combined-2025_10_12-14_30_45.opus
- Best for maximum flexibility

Use Cases

Gaming Commentary: Capture game audio + microphone in one file (Combined mode)
Music Recording: Record DAW output + microphone vocals separately (Separate files mode)
Podcast Recording: Capture multiple apps + microphone with both individual tracks and mixed output (Both mode)

Filtering Processes

Show only processes with active audio: Check this box to see only applications currently playing sound
Window titles: The "Window Title" column helps identify processes (e.g., "YouTube - Chrome" vs "Gmail - Chrome")

Output Files

Files are automatically named using the pattern:

{ProcessName}-YYYY_MM_DD-HH_MM_SS.{extension}

For example: chrome-2025_10_12-14_30_45.flac

Technical Details

Audio Capture Method

This application uses WASAPI (Windows Audio Session API) to capture audio from multiple sources:

Loopback Capture (Application Audio)

Uses IAudioClient with AUDCLNT_STREAMFLAGS_LOOPBACK for capturing application output
Supports true per-process capture on Windows 10 Build 19041+ (version 2004) using AUDIOCLIENT_ACTIVATION_TYPE_PROCESS_LOOPBACK
Dynamically loaded API for compatibility - gracefully falls back to system-wide capture on Windows 7/8/10 1909 and earlier
Adaptive UI automatically hides process list on unsupported OS versions

Input Device Capture (Microphone)

Uses IAudioClient with capture mode (eCapture data flow direction) for microphone input
Supports any WASAPI-compatible input device (microphones, line-in, etc.)
Automatically handles device enumeration and format negotiation

Real-time Mixing

Multiple audio streams are mixed in real-time using a dedicated mixer thread
Automatic sample rate conversion and format matching
32-bit float PCM mixing for maximum quality and dynamic range
Each stream can simultaneously record to its own file AND contribute to the mixed output
Default audio volume set to 100% (1.0x multiplier) for full recording level

Supported Audio Formats

WAV (PCM)

Uncompressed audio (32-bit float or 16-bit PCM)
Highest quality, no loss
Largest file size
No additional codecs required
Best for further editing

MP3

Lossy compressed audio using Media Foundation
Good quality at smaller file sizes
Configurable bitrate: 128, 192, 256, or 320 kbps
Default: 192 kbps
Native Windows support, widely compatible

Opus

Modern lossy codec optimized for internet streaming
Excellent quality at low bitrates
Configurable bitrate: 64, 96, 128, 192, or 256 kbps
Default: 128 kbps
Stored in OGG container format

FLAC

Lossless compression (no quality loss)
Typically 40-60% of WAV size
Configurable compression levels 0-8
- Level 0: Fastest encoding, larger files
- Level 5: Default, good balance
- Level 8: Slowest encoding, smallest files
Ideal for archival and high-quality playback

Architecture

The application is structured into several components:

AudioCapture: WASAPI audio capture engine with support for both loopback (application audio) and input device (microphone) capture, plus real-time passthrough
AudioDeviceEnumerator: Enumerates available audio output devices (for monitoring) and input devices (microphones/line-in)
AudioMixer: Real-time audio mixer that combines multiple audio streams with automatic resampling and format conversion
ProcessEnumerator: Enumerates running processes, window titles, and audio sessions
CaptureManager: Manages multiple simultaneous capture sessions with silence detection and coordinated mixing
WavWriter: Writes uncompressed WAV files
Mp3Encoder: Encodes audio to MP3 using Media Foundation
OpusEncoder: Encodes audio to Opus in OGG container
FlacEncoder: Encodes audio to FLAC with configurable compression

Limitations and Known Issues

Current Limitations

Per-Process Capture OS Requirement: True per-process audio capture requires Windows 10 Build 19041+ (version 2004 or later). On older Windows versions (7/8/8.1/10 1909 and earlier), only system-wide audio capture is available. The UI automatically adapts to show only supported features.
Audio Monitoring Latency: Real-time audio passthrough operates with approximately 100ms latency. This is optimized for minimal delay while maintaining stability.
Elevated Processes: Cannot capture audio from processes running with higher privileges unless the application also runs elevated.
First Refresh May Be Slow (Windows 10 2004+ only): When you first click Refresh, the application fetches window titles for all processes, which can take a moment. Subsequent refreshes use cached data and are faster.
Audio Session Detection (Windows 10 2004+ only): The "Show only processes with active audio" filter checks for active audio sessions, which requires querying Windows Audio Session API and may add a slight delay.
Pause/Resume Granularity: Pause and resume work on all recordings at once, not individual recordings.

Potential Improvements

Implement audio level meters and visualization
Per-recording pause/resume (currently only all recordings at once)
Support for recording to multiple formats simultaneously
Audio processing filters (volume normalization, noise reduction, etc.)
Background worker thread for window title enumeration
VU meter display for active recordings
Lower latency passthrough options (experimental sub-50ms modes)
Volume control per recording

Accessibility

The application uses standard Win32 controls for full accessibility:

List views with keyboard navigation and labeled columns
Checkboxes for multi-selection with keyboard support
Standard buttons with keyboard shortcuts (Tab/Enter navigation)
Proper tab order for all controls
ARIA-compliant labels for all UI elements
Adaptive labels based on OS capabilities (prevents screen reader confusion on older Windows)
Screen reader compatible (tested with NVDA/JAWS on Windows 7 and Windows 10)
Smart focus management (adapts based on visible controls)

Troubleshooting

No Audio Captured

Ensure the target application is actually playing audio
Check that your system audio is not muted
Verify the application has permission to access audio devices
Try running as Administrator if capturing from elevated processes

Build Errors

Ensure Windows SDK is properly installed
Verify CMake version is 3.15 or later
Check that you're using a C++17 compatible compiler
Make sure all required Windows libraries are available

Application Crashes

The application should run on Windows 7 or later with no DLL errors
If you experience crashes on Windows 7, ensure you have the latest Windows updates installed
Check that Media Foundation is available (required for MP3 encoding)
Verify COM is properly initialized (automatic in this application)
Static runtime build eliminates MSVCP140.dll / VCRUNTIME140.dll dependency errors

License

This project is provided as-is for educational and personal use.

Credits

Built using:

Windows Audio Session API (WASAPI)
Media Foundation for MP3 encoding
libFLAC for FLAC encoding
libopus and libogg for Opus encoding
nlohmann-json for settings persistence
Win32 API for user interface
CMake for build system
vcpkg for dependency management

Name		Name	Last commit message	Last commit date
Latest commit History 34 Commits
.github/workflows		.github/workflows
include		include
package		package
src		src
triplets		triplets
.gitignore		.gitignore
BUILD.md		BUILD.md
CMakeLists.txt		CMakeLists.txt
README.md		README.md
build-ci.bat		build-ci.bat
build.bat		build.bat
clean.bat		clean.bat
vcpkg.json		vcpkg.json

masonasons/AudioCapture

Folders and files

Latest commit

History

Repository files navigation