Skip to content

mratanusarkar/audim

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 

Repository files navigation

Audim ✨

Documentation PyPI version Python versions GitHub Actions Workflow Status
License: Apache 2.0 Author: Atanu Sarkar Citation
Visitors PyPI Total Downloads PyPI Monthly Downloads

Audio Podcast Animation Engine

An animation and video rendering engine for audio-based and voice-based podcast videos.

| Documentation | Features | Getting Started | Quick Links |

πŸš€ Demo

podcast.mp4

A sample podcast video generated with Audim

Note

For this example, we have transformed a conversation between Grant Sanderson (from 3Blue1Brown) and Sal Khan (from Khan Academy) from YouTube into a visually engaging podcast video using Audim.

See docs/devblog/v0.0.7 for more details on how this video was generated.

πŸ”— Quick Links

  1. Getting Started
    • See Setup and ensure you have setup correctly before usage.
    • For developers and contributors, see Development.
  2. API Documentation
    • See API Docs for the audim API documentation.
  3. Usage and Examples
    • See Usage for usage examples.
  4. Dev Blog
    • See Dev Blog for the development blog of the project to gain more insights into the project.
    • See Changelog for the changelog of the project.

🎯 Introduction

Audim is an engine for precise programmatic animation and rendering of podcast videos from audio-based and voice-based file recordings.

✨ Features

  • πŸ’» Precise programmatic animations.
  • 🎬 Rendering of videos with layout based scenes.
  • πŸ“ Generate subtitles and transcripts from audio/video files.
  • 🎀 From subtitle and scene elements to podcast video generation.

πŸš€ Getting Started

Prerequisites

  • 🐍 Python β‰₯ 3.10
  • πŸ–₯️ Conda or venv
  • πŸŽ₯ FFmpeg (recommended, for faster video encoding)

Installation

1. Install Audim

It is recommended to install Audim in a virtual environment from PyPI or Conda in a Python=3.10 environment.

Install audim package from PyPI:

pip install audim
Install from source

By installing audim from source, you can explore the latest features and enhancements that have not yet been officially released. Please note that the latest changes may be still in development and may not be stable and may contain bugs.

Install from source

pip install git+https://github.com/mratanusarkar/audim.git

OR, you can also clone the repository and install the package from source:

Clone the repository

git clone https://github.com/mratanusarkar/audim.git

2. Install FFmpeg locally (recommended)

Using local FFmpeg is optional but recommended for speeding up the video encoding process.

On Ubuntu, install FFmpeg using:

sudo apt install ffmpeg libx264-dev

On Windows and other platforms, download and install FFmpeg from the official website:

Virtual environment and project setup for development with uv

Install uv and setup project environment:

IMPORTANT

If you are using conda base environment as the default base environment for your python projects, run the below command to activate the base environment. If not, skip this step and continue with the next step.

conda activate base
# Install uv
pip install uv

# Setup project environment
uv venv

source .venv/bin/activate   # on Linux
# .venv\Scripts\activate    # on Windows

uv pip install -e ".[dev,docs]"

Build and deploy documentation

You can build and serve the documentation by running:

uv pip install -e .[docs]
mkdocs serve

Code Quality

Before committing, please ensure that the code is formatted and styled correctly. Run the following commands to check and fix code style issues:

# Check and fix code style issues
ruff format .
ruff check --fix .

See Development for more details on how to setup the project environment and contribute to the project.

βš–οΈ License & Attribution

Audim is licensed under Apache 2.0. You can use it freely for personal and commercial projects.

Attribution is encouraged. If you use Audim, please:

  • Keep the default watermark in videos, OR
  • Add "Made with Audim" to video descriptions, OR
  • Link to this repo in your project documentation

See NOTICE file for complete attribution guidelines.

πŸ“„ Citation

If you use Audim in your project or research, please cite it as follows:

@software{audim,
  title = {Audim: Audio Podcast Animation Engine},
  author = {Sarkar, Atanu},
  year = {2025},
  url = {https://github.com/mratanusarkar/audim},
  version = {0.0.7}
}

You can also click the "Cite this repository" button on GitHub for other citation formats.

⚠️ Disclaimer

Warning

Early Development Stage

  • This project is actively under development and may contain bugs or limitations.
  • While stable for basic use cases, the rendering engine requires further development and testing across diverse scenarios.
  • The API is subject to change, so keep an eye at the documentation for the latest updates.

Tip

We encourage you to:

  • Try Audim for your projects and podcast videos.
  • Report issues when encountered.
  • Feel free to raise a PR to contribute and improve the project.

Your feedback and contributions help make Audim better for everyone!

About

An animation and video rendering engine for audio-based and voice-based podcast videos.

Resources

License

Stars

Watchers

Forks

Packages

No packages published

Languages