📽️ Audio Video-based AI Synthesis Guide

💡 Audio & Video Synthesis Software

Resource	Category	Link
Models & LoRA's on Civitai	🌐 Download Image Diffusion Models	Link (opens in a new tab)
AUTOMATIC1111's stable-diffusion-webui	🖼️ Image-based AI Diffusion Software	Link (opens in a new tab)
lllyasviel's ControlNet	🖼️ Image-based AI Diffusion Control Software	Link (opens in a new tab)
comfyanonymous's ComfyUI	🖼️ Image-based AI Diffusion Control Software	Link (opens in a new tab)
CiaraStrawberry's TemporalKit	📽️ Video-based AI Diffusion & Synthesis Software	Link (opens in a new tab)
EbSynth	📽️ Video-based AI Diffusion & Synthesis Software	Link (opens in a new tab)
Runway	📽️ Video-based AI Diffusion & Synthesis Software	Link (opens in a new tab)
Elevenlabs	🔊 Audio-based AI Synthesis Software	Link (opens in a new tab)

🔋 Compute Resources

Do you have a GPU and CPU? Try running or self-hosting this at home! If you're short power, you can find more below.

⚡ Power

Resource	Category	Link
db0's AI Horde	Free AI Horde Compute	Link (opens in a new tab)
Petals	Free Distributed AI Compute	Link (opens in a new tab)
Runpod.io	Rent-a-Server / GPU	Link (opens in a new tab)
Vast.ai	Rent-a-Server / GPU	Link (opens in a new tab)

🌐 Download

Resource	Category	Link
Models & LoRA's on Civitai	🌐 Download Image Diffusion Models	Link (opens in a new tab)
Elevenlabs	🔊 Access Synthesized Audio Models	Link (opens in a new tab)

⏳ Install

Resource	Category	Link
How-to-Install	AUTOMATIC1111's stable-diffusion-webui	Link (opens in a new tab)
How-to-Install	EbSynth & CiaraStrawberry's TemporalKit	Link (opens in a new tab)

📽️ A/V Synthesis Resources

These Free and Open-Source Software AI platforms provide the tools to run comprehensive image diffusion models on your individual servers, desktops, or laptops.

Each platform shares different advantages and stylizations. If you're unsure where to start, Stable Diffusion is a popular choice. Once you're acquainted with Stable Diffusion, you might want to venture into other platforms to broaden your exposure to video mediums and other image diffusion models and software.

Please note, the features and user experiences will vary across different platforms.

StableDiffusion (opens in a new tab)

Stable Diffusion is a text-to-image diffusion model capable of generating photo-realistic and stylized images. This is the free alternative to MidJourney. It is rumored that MidJourney originates from a version of Stable Diffusion that is highly modified, tuned, then made proprietary.

SDXL (opens in a new tab)

With Stable Diffusion XL (opens in a new tab), you can create descriptive images with shorter prompts and generate words within images. The model is a significant advancement in image generation capabilities, offering enhanced image composition and face generation that results in stunning visuals and realistic aesthetics.

👩‍🚀 AI Communities 👨‍🚀

Reddit

Fediverse

!fosai@lemmy.world (opens in a new tab)

HYPERION (Coming Soon!)
🎓 Enroll
- 💫 HyperTech Academy
☄️ Apply
- 🔬 Hyperion Technologies
🕹️ Play
- ☄️ HYPERION

✍️ Contribute to FOSAI ▲ XYZ

First, clone the repo (opens in a new tab) to your device and then run pnpm i in your terminal of choice to install the dependencies.

Then, run pnpm dev to start the development server and visit localhost:3000.

From here, you should be able to see the 'pages' folder, which contains all of the webpage content you see here (editable in simple markdown).

🖼️ y Image-based AI Diffusion Hardware Requirements

📽️ Audio Video-based AI Synthesis Guide

💡 Audio & Video Synthesis Software

🔋 Compute Resources

⚡ Power

🌐 Download

⏳ Install

📽️ A/V Synthesis Resources

StableDiffusion (opens in a new tab)

SDXL (opens in a new tab)

ComfyUI (opens in a new tab)

ControlNet (opens in a new tab)

TemporalKit (opens in a new tab)

EbSynth (opens in a new tab)

WarpFusion (opens in a new tab)

Elevenlabs (opens in a new tab)

👩‍🚀 AI Communities 👨‍🚀

✍️ Contribute to FOSAI ▲ XYZ