π½οΈ Audio Video-based AI Synthesis Guide
π‘ Audio & Video Synthesis Software
Resource | Category | Link |
---|---|---|
Models & LoRA's on Civitai | π Download Image Diffusion Models | Link (opens in a new tab) |
AUTOMATIC1111's stable-diffusion-webui | πΌοΈ Image-based AI Diffusion Software | Link (opens in a new tab) |
lllyasviel's ControlNet | πΌοΈ Image-based AI Diffusion Control Software | Link (opens in a new tab) |
comfyanonymous's ComfyUI | πΌοΈ Image-based AI Diffusion Control Software | Link (opens in a new tab) |
CiaraStrawberry's TemporalKit | π½οΈ Video-based AI Diffusion & Synthesis Software | Link (opens in a new tab) |
EbSynth | π½οΈ Video-based AI Diffusion & Synthesis Software | Link (opens in a new tab) |
Runway | π½οΈ Video-based AI Diffusion & Synthesis Software | Link (opens in a new tab) |
Elevenlabs | π Audio-based AI Synthesis Software | Link (opens in a new tab) |
π Compute Resources
Do you have a GPU and CPU? Try running or self-hosting this at home! If you're short power, you can find more below.
β‘ Power
Resource | Category | Link |
---|---|---|
db0's AI Horde | Free AI Horde Compute | Link (opens in a new tab) |
Petals | Free Distributed AI Compute | Link (opens in a new tab) |
Runpod.io | Rent-a-Server / GPU | Link (opens in a new tab) |
Vast.ai | Rent-a-Server / GPU | Link (opens in a new tab) |
π Download
Resource | Category | Link |
---|---|---|
Models & LoRA's on Civitai | π Download Image Diffusion Models | Link (opens in a new tab) |
Elevenlabs | π Access Synthesized Audio Models | Link (opens in a new tab) |
β³ Install
Resource | Category | Link |
---|---|---|
How-to-Install | AUTOMATIC1111's stable-diffusion-webui | Link (opens in a new tab) |
How-to-Install | EbSynth & CiaraStrawberry's TemporalKit | Link (opens in a new tab) |
π½οΈ A/V Synthesis Resources
These Free and Open-Source Software AI platforms provide the tools to run comprehensive image diffusion models on your individual servers, desktops, or laptops.
Each platform shares different advantages and stylizations. If you're unsure where to start, Stable Diffusion is a popular choice. Once you're acquainted with Stable Diffusion, you might want to venture into other platforms to broaden your exposure to video mediums and other image diffusion models and software.
Please note, the features and user experiences will vary across different platforms.
-
Stable Diffusion + ControlNet + TemporalKit + EbSynth Workflow (opens in a new tab)
-
Stable Diffusion + Warp Fusion Workflow (opens in a new tab)
StableDiffusion (opens in a new tab)
Stable Diffusion is a text-to-image diffusion model capable of generating photo-realistic and stylized images. This is the free alternative to MidJourney. It is rumored that MidJourney originates from a version of Stable Diffusion that is highly modified, tuned, then made proprietary.
SDXL (opens in a new tab)
With Stable Diffusion XL (opens in a new tab), you can create descriptive images with shorter prompts and generate words within images. The model is a significant advancement in image generation capabilities, offering enhanced image composition and face generation that results in stunning visuals and realistic aesthetics.
ComfyUI (opens in a new tab)
A powerful and modular stable diffusion GUI and backend. This new and powerful ui will let you design and execute advanced stable diffusion pipelines using a graph/nodes/flowchart based interface.
ControlNet (opens in a new tab)
ControlNet is a neural network structure to control diffusion models by adding extra conditions. This is a very popular and powerful extension to add to AUTOMATIC111's stable-diffusion-webui.
TemporalKit (opens in a new tab)
An all in one solution for adding Temporal Stability to a Stable Diffusion Render via an automatic1111 extension. You must install FFMPEG to path before running this.
EbSynth (opens in a new tab)
Bring your paintings to animated life. This software can be used in conjunction with StableDiffusion + ControlNet + TemporalKit workflows.
WarpFusion (opens in a new tab)
A TemporalKit alternative to produce video effects and animation styling.
Elevenlabs (opens in a new tab)
Create lifelike voiceovers for your content or use our AI voice generator as an easy-to-use text reader.
π©βπ AI Communities π¨βπ
Fediverse
HYPERION (Coming Soon!) |
---|
π Enroll |
- π« HyperTech Academy |
βοΈ Apply |
- π¬ Hyperion Technologies |
πΉοΈ Play |
- βοΈ HYPERION |
βοΈ Contribute to FOSAI β² XYZ
First, clone the repo (opens in a new tab) to your device and then run pnpm i
in your terminal of choice to install the dependencies.
Then, run pnpm dev
to start the development server and visit localhost:3000.
From here, you should be able to see the 'pages' folder, which contains all of the webpage content you see here (editable in simple markdown).