Diese Linkliste ist eine lebende Liste, ich werde je nach Zeit eine Datenbank einrichten um die Navigation zu erleichtern.
AI für Audio-Video-Text Darstellungen
Name | URL | Classification | Use case(s) | Additional Notes |
Chat GPT | LLM/ Text Generative | |||
Gemini | LLM/ Text Generative | |||
Fadr | Music / Steming Stereospuren splitten | |||
Musicfy | Music / Custom Voice, Instrument Model training | |||
Suno/Bark | Audio Generative | |||
Suno/Chirp | MultiStem Generative | |||
Midjourney | Image Generative | |||
KreaAi | Image Generative/Upscaler/ LoRa Training | |||
Ideogram | Image Generative | |||
StableDiffusion 1.5 (Local) | Image Generative | |||
StableDiffusion XL | Image Generative | |||
Adobe Firefly | Image Generative | |||
Vectorizer AI | Image to Vector | |||
Wondershare AI | Editing/BeatMatch/ Video | |||
RunwayML | Video Generative | |||
Colossyan | Video Generative | |||
Elevenlabs | Text to Speech, Speech to Speech Generative | |||
FL Studio Cloud AI | Music/Mastering/Steming | |||
iZotope | Mix/Mastering | |||
RipX AI | Music - Audio Restauration | |||
SynthGPT | Music / Instrument | |||
sonible Smart | Signal Processing | |||
RX 10 | Audio Restauration | |||
PlaceIt AI | Video/Image Generative with Mockup placing | |||
GitHub Codepilot | Text Generative | |||
Notion AI | Text Generative |
Music AI Models
Diese Linkliste hat Walter Werzowa in der GMPU bei "KI in der Musikproduktion" zu Verfügung gestellt. Ich habe nun vor diese auf meiner Website weiter zu pflegen mit meinen eigenen Erkenntnissen.
Name | URL | Paper URL | Parent company/owner (if applicable) | Use case(s) | Training Data Details | Additional notes | Last Modified |
[Unnamed Apple Diffusion - December 2023] | Apple | Audio synthesis,Style transfer | 1/17/2024 | ||||
AudioLDM | (academia - various authors) | Audio synthesis | "The datasets we used in this paper includes AudioSet (AS) (Gemmeke et al., 2017), AudioCaps (AC) (Kim et al., 2019), Freesound (FS)1 , and BBC Sound Effect library (SFX)2 . AS is currently the largest audio dataset, with 527 labels and over 5, 000 hours of audio data. AC is a much smaller dataset with around 49, 000 audio clips and text descriptions. Most of the data in AudioSet and AudioCaps are in-the-wild audio from YouTube, so the quality of the audio is not guaranteed. To expand the dataset, especially with high-quality audio data, we crawl the data from the FreeSound and BBC SFX datasets, which have a wide range of categories such as music, speech, and sound effects." | framed as open version of AudioLM from google | 1/16/2024 | ||
AudioSR | Academia + ByteDance | Upscaler | "The datasets used in this paper include MUSDB18-HQ [19], MoisesDB [20], MedleyDB [21], FreeSound [22] 2 , and the speech dataset from OpenSLR3 , which are downloaded by following the link provided by VoiceFixer [1]. All the audio data used are resampled at 48kHz sampling rate. The total duration of the training data is approximately 7000 hours. We utilize all these datasets to optimize VAE, LDM, and HiFi-GAN" | 1/16/2024 | |||
CLaMP | Microsoft | Audio Analysis / Metadata | "To facilitate the learning of relationships between natural language and symbolic music, we developed a dataset named WebMusicText (WebMT) by crawling an extensive collection of music-text pairs from the web. Our dataset comprises 1,448,750 pairs of music-text data, where all music files are in score-oriented formats (e.g., MusicXML, LilyPond, and ABC notation)." | 1/16/2024 | |||
Dance Diffusion | Harmonai/Stability AI | Audio synthesis | 11/28/2023 | ||||
Demucs Music Source Separation | Facebook Research | Source separation | "We curated an internal dataset composed of 3500 songs with the stems from 200 artists with diverse music genres. Each stem is assigned to one of the 4 sources according to the name given by the music producer (for instance ”vocals2”, ”fx”, ”sub” etc...). This labeling is noisy because these names are subjective and sometime ambiguous. For 150 of those tracks, we manually verified that the automated labeling was correct, and discarded ambiguous stems. We trained a first Hybrid Demucs model on MUSDB and those 150 tracks. " | 1/16/2024 | |||
Essentia | Universitat Pompeu Fabra | Audio synthesis,Sound/sample search,Source separation | 11/28/2023 | ||||
JEN | Futureverse | Audio synthesis,Text-to-Audio | " We use total 5k hours of high-quality private music data to train JEN-1. All music data consist of full-length music sampled at 48kHz with metadata composed of a rich textual description and additional tags information, e.g. , genre, instrument, mood/theme tags, etc." | 1/16/2024 | |||
LLark | Spotify | Audio Analysis / Metadata | "To construct our instruction-tuning datasets, we use a set of only publicly-available, open source, permissively-licensed music datasets. The datasets we use for training are summarized in Table 1. For each dataset, we collect the audio and any accompanying annotations available for that dataset. The audio from these sources consist of a variety of styles, ranging from classical to electronic music, rock, and experimental, and comprise approximately 164, 000 distinct tracks from which we ultimately construct approximately 1.2M instruction pairs over three task families. Since our audio encoder is limited to 25-second clips of audio, we crop the audio, selecting a random 25-second clip from each track (one clip per track is used)." | it's not a music generation dataset. its specifically for interacting with music with language. limited to 25 second snippets. but can talk to those snippets in some interesting ways. like ask specific descriptions on it like tempo or key | 1/16/2024 | ||
Lyria | Google | Audio synthesis | 11/28/2023 | ||||
MAEST | Music Technology Group (Universitat Pompeu Fabra) | Audio Analysis / Metadata | "We train our models using an in-house dataset with 3.3 M tracks mapped to the Discogs’ public metadata dump. 2 The training task consists of a multi-label classification of the top 400 music styles from Discogs’ taxonomy" | using discogs data to label sound | 1/16/2024 | ||
Make-An-Audio | ByteDance | Audio synthesis | "We train on a combination of several datasets: AudioSet, BBC sound effects, Audiostock, AudioCaps-train, ESC-50, FSD50K, Free To Use Sounds, Sonniss Game Effects, WeSoundEffects, MACS, Epidemic Sound, UrbanSound8K, WavText5Ks, LibriSpeech, and Medley-solos-DB. For audios without natural language annotation, we apply the pseudo prompt enhancement to construct captions aligned well with the audio. Overall we have ∼3k hours with 1M audio-text pairs for training data" | 1/16/2024 | |||
MuseNet | OpenAI | Music composition/songwriting | "We collected training data for MuseNet from many different sources. ClassicalArchives and BitMidi donated their large collections of MIDI files for this project, and we also found several collections online, including jazz, pop, African, Indian, and Arabic styles. Additionally, we used the MAESTRO dataset." | 1/16/2024 | |||
Music Agent | 11/28/2023 | ||||||
Music ControlNet | Adobe | Audio synthesis | "We train our models on a dataset of ≈1800 hours of licensed instrumental music with genre and mood tags. Our dataset does not have free-form text description, so we use classconditional text control of global musical style, as done in JukeBox" | 1/17/2024 | |||
MusicFM | Academia + ByteDance | Audio Analysis / Metadata | "We utilize two distinct datasets to train our foundation models. The first dataset consists of 160k hours of in-house music data, designed to align with the size of the data used to train MERT. The second dataset is the Free Music Archive (FMA) dataset [22], which comprises 8k hours of Creative Commons-licensed music audio" | 1/17/2024 | |||
MusicGen | Meta | Audio synthesis | "We use 20K hours of licensed music to train MUSICGEN. Specifically, we rely on an internal dataset of 10K high-quality music tracks, and on the ShutterStock and Pond5 music data collections2 with respectively 25K and 365K instrument-only music tracks. All datasets consist of full-length music sampled at 32 kHz with metadata composed of a textual description and information such as the genre, BPM, and tags" | 1/17/2024 | |||
MusicLM | Google | Audio synthesis | "By relying on pretrained and frozen MuLan, we need audioonly data for training the other components of MusicLM. We train SoundStream and w2v-BERT on the Free Music Archive (FMA) dataset (Defferrard et al., 2017), whereas the tokenizers and the autoregressive models for the semantic and acoustic modeling stages are trained on a dataset containing five million audio clips, amounting to 280k hours of music at 24 kHz." NOTE: MuLan itself is trained on: "a collection of 50 million internet music videos. From the soundtrack of each video, we extract a 30-second clip starting at the 30 second mark. We then apply a preexisting music audio detector and discard any clip that is less than half music content. After this filtering, we are left with approximately 44 million 30-second clips, which amounts to nearly 370K hours of audio." | 1/17/2024 | |||
Noise2Music | Google | Audio synthesis | "We employ a data mining pipeline to construct a large-scale training dataset of diverse music audio clips, each paired with multiple descriptive text labels. The text labels for the audio are generated by employing a pair of pretrained deep models: first, we use a large language model to generate a large set of generic music descriptive sentences as caption candidates; we then use a pre-trained music-text joint embedding model to score each unlabeled music clip against all the caption candidates and select the captions with the highest similarity score as pseudo labels for the audio clip. We are able to annotate O(150K) hours of audio sources this way to constitute our training data" | 1/17/2024 | |||
PESTO | Sony | Audio Analysis / Metadata | 11/28/2023 | ||||
RAVE | IRCAM | Audio synthesis | "Since our main target is the modelling of musical audio signals, we use an internal dataset composed of approximately 30 hours of raw recordings of strings in various configurations (monophonic solos and polyphonic group performances, with different styles and recording configurations)" | 1/17/2024 | |||
SingSong | Google | Audio synthesis | "The training set for SingSong is comprised of 1 million audio-only sources resulting in 46k hours of music." | 1/17/2024 | |||
SoundStream | Google | Production (other) | "We train SoundStream on three types of audio content: clean speech, noisy speech and music, all at 24 kHz sampling rate. For clean speech, we use the LibriTTS dataset [50]. For noisy speech, we synthesize samples by mixing speech 7 from LibriTTS with noise from Freesound [51]. We apply peak normalization to randomly selected crops of 3 seconds and adjust the mixing gain of the noise component sampling uniformly in the interval [−30 dB, 0 dB]. For music, we use the MagnaTagATune dataset [52]. We evaluate our models on disjoint test splits of the datasets above. In addition, we collected a real-world dataset, which contains both near-field and far-field (reverberant) speech, with background noise in some of the examples. Unless stated otherwise, objective and subjective metrics are computed on a set of 200 audio clips 2-4 seconds long, with 50 samples from each of the four datasets listed above (i.e., clean speech, noisy speech, music, noisy/reverberant speech)." | ML based neural codec that works for music aswell as speech. (think opus codec for compression) | 1/17/2024 | ||
Spleeter | Deezer | Source separation | "The models were trained on Deezer internal datasets " | reportedly incorporated into several different tools including those offered by iZotope, VirtualDJ, Algoriddim (creators of the djay app), SpectralLayers, and Acon Digital | 1/17/2024 | ||
Stable Audio (training+inference code) | Stability AI | Audio synthesis | "To train our flagship Stable Audio model, we used a dataset consisting of over 800,000 audio files containing music, sound effects, and single-instrument stems, as well as corresponding text metadata, provided through a deal with stock music provider AudioSparx. This dataset adds up to over 19,500 hours of audio." | 1/17/2024 | |||
MAGNeT | Meta | Audio synthesis | "We follow the same setup as in Copet et al. (2023) [editor note: this is MusicGen] and use 20K hours of licensed music to train MAGNET. Specifically, we rely on the same 10K high-quality music tracks, the ShutterStock, and Pond5 music data collections as used in Copet et al. (2023) 8 with respectively 25K and 365K instrument-only music tracks. All datasets consist of full-length music sampled at 32 kHz with metadata composed of a textual description and additional information such as the genre, BPM, and tags." | 1/19/2024 |
Music DataSets
Name | Type | Description | URL | Hours | License | Associated models | Use case(s) (from MODELS (RAW)) | Last Modified |
MAESTRO | MIDI | "MAESTRO (MIDI and Audio Edited for Synchronous TRacks and Organization) is a dataset composed of about 200 hours of virtuosic piano performances captured with fine alignment (~3 ms) between note labels and audio waveforms." | 200:00 | CC BY-NC-SA 4.0 DEED | MuseNet | Music composition/songwriting | 1/6/2024 | |
MusicCaps | Evaluation,Music + Text | Curated Evaluation dataset for MusicLM, taken from the AudioSet dataset and expanded with "an English aspect list and a free text caption written by musicians" | 15:20 | CC BY-SA 4.0 DEED | MusicLM,LLark | Audio synthesis,Audio Analysis / Metadata | 1/8/2024 | |
AudioSet | Audio + Text | "AudioSet consists of an expanding ontology of 632 audio event classes and a collection of 2,084,320 human-labeled 10-second sound clips drawn from YouTube videos. The ontology is specified as a hierarchical graph of event categories, covering a wide range of human and animal sounds, musical instruments and genres, and common everyday environmental sounds." | 5800:00 | CC BY 4.0 DEED | Make-An-Audio,Noise2Music,AudioLDM | Audio synthesis,Audio synthesis,Audio synthesis | 1/8/2024 | |
BBC Sound Effects | Audio + Text | "The BBC Sound Effects Archive is available for personal, educational or research purposes. There are over 33,000 clips from across the world from the past 100 years. These include clips made by the BBC Radiophonic workshop, recordings from the Blitz in London, special effects made for BBC TV and Radio productions, as well as 15,000 recordings from the Natural History Unit archive. You can explore sounds from every continent - from the college bells ringing in Oxford to a Patagonian waterfall - or listen to a submarine klaxon or the sound of a 1969 Ford Cortina door slamming shut." | Make-An-Audio,AudioLDM | Audio synthesis,Audio synthesis | 1/8/2024 | |||
AudioCaps | Audio + Text | "A large-scale dataset of about 46K audio clips to human-written text pairs collected via crowdsourcing on the AudioSet dataset. The collected captions of AudioCaps are indeed faithful for audio inputs." | Make-An-Audio,AudioLDM | Audio synthesis,Audio synthesis | 1/8/2024 | |||
ESC-50 | Audio + Text | "The ESC-50 dataset is a labeled collection of 2000 environmental audio recordings suitable for benchmarking methods of environmental sound classification. The dataset consists of 5-second-long recordings organized into 50 semantical classes (with 40 examples per class) loosely arranged into 5 major categories" | 2:42 | CC BY-NC 3.0 DEED | Make-An-Audio | Audio synthesis | 1/7/2024 | |
FSD50K | Audio + Text | "Freesound Dataset 50k (or FSD50K for short) is an open dataset of human-labeled sound events containing 51,197 Freesound clips unequally distributed in 200 classes drawn from the AudioSet Ontology [1]. FSD50K has been created at the Music Technology Group of Universitat Pompeu Fabra." | 108:18 | CC0 ... CC-BY CC-BY-NC CC Sampling+ | Make-An-Audio,AudioLDM,AudioSR | Audio synthesis,Audio synthesis,Upscaler | 1/8/2024 | |
Free To Use Sounds | Audio + Text | "our exclusive collection from 29+ global destinations, spanning mono, stereo, VR, DMS & 5.1 recordings. Unlock 2TB of premium soundscapes & sound effects with over 12,000 recordings. " \ | 175:43 | Make-An-Audio | Audio synthesis | 1/8/2024 | ||
Sonniss Game Effects | Audio + Text | "Each year we give away thousands of dollars worth of sounds for free in celebration of the Game Developers Conference. This is our achieve. All of the sound effects are royalty free and commercially usable. No attribution is required and you can use them on an unlimited number of projects for the rest of your lifetime. If you would like more options and design choices to work with, please consider purchasing the corresponding collection. All of the files we send out are just a small sample of our suppliers complete collection. We’re giving you a taste of what we have to offer with a selection of sounds from each library added over the years. These files are straight from the source and haven’t been tampered with in any way, they are exactly the same files as we sell." | 84:36 | Make-An-Audio | Audio synthesis | 1/8/2024 | ||
MACS | Audio + Text | "This is a dataset containing audio captions and corresponding audio tags for a number of 3930 audio files of the TAU Urban Acoustic Scenes 2019 development dataset (airport, public square, and park). The files were annotated using a web-based tool. Each file is annotated by multiple annotators that provided tags and a one-sentence description of the audio content." | 10:54 | Make-An-Audio | Audio synthesis | 1/8/2024 | ||
Epidemic Sound | Audio + Text | "Discover our catalog of over 90,000 SFX. Whoosh, cartoon or ghost sound effects - with over 300 different sound effects categories you are sure to find what you are looking for" | 220:43 | Make-An-Audio | Audio synthesis | 1/8/2024 | ||
UrbanSound8K | Audio + Text | "This dataset contains 8732 labeled sound excerpts (<=4s) of urban sounds from 10 classes: air_conditioner, car_horn, children_playing, dog_bark, drilling, enginge_idling, gun_shot, jackhammer, siren, and street_music. The classes are drawn from the urban sound taxonomy. For a detailed description of the dataset and how it was compiled please refer to our paper. All excerpts are taken from field recordings uploaded to www.freesound.org. The files are pre-sorted into ten folds (folders named fold1-fold10) to help in the reproduction of and comparison with the automatic classification results reported in the article above." | 8:45 | Make-An-Audio | Audio synthesis | 1/8/2024 | ||
WavText5Ks | Audio + Text | "[a] collection consisting of 4525 audios, 4348 descriptions, 4525 audio titles and 2058 tags... sourced from two main website: BigSoundBank and SoundBible...While collecting the audio, we encountered empty audio files, incorrect download links, and empty metadata. We removed those entries from the final collection." | 25:28 | Make-An-Audio | Audio synthesis | 1/8/2024 | ||
LibriSpeech | Speech + Text | "LibriSpeech is a corpus of approximately 1000 hours of 16kHz read English speech, prepared by Vassil Panayotov with the assistance of Daniel Povey. The data is derived from read audiobooks from the LibriVox project, and has been carefully segmented and aligned." | 1000:00 | CC BY 4.0 | Make-An-Audio | Audio synthesis | 1/8/2024 | |
Medley-solos-DB | Music + Text,Evaluation | "Medley-solos-DB is a cross-collection dataset for automatic musical instrument recognition in solo recordings. It consists of a training set of 3-second audio clips, which are extracted from the MedleyDB dataset of Bittner et al. (ISMIR 2014) as well as a test set set of 3-second clips, which are extracted from the solosDB dataset of Essid et al. (IEEE TASLP 2009). Each of these clips contains a single instrument among a taxonomy of eight: clarinet, distorted electric guitar, female singer, flute, piano, tenor saxophone, trumpet, and violin." | 17:48 | CC BY 4.0 | Make-An-Audio | Audio synthesis | 1/8/2024 | |
YouTube8M-MusicTextClips | Music + Text | "The YouTube8M-MusicTextClips dataset consists of over 4k high-quality human text descriptions of music found in video clips from the YouTube8M dataset. For each selected YouTube music video, we extracted 10 second clips at the middle of the video for annotation. We provided annotators with only the audio corresponding to this clip. Thus, text annotations describe audio alone, not the visual content of the clip." | 11:34 | Adobe Research License | LLark | Audio Analysis / Metadata | 1/8/2024 | |
MusicNet | Music + Text | "MusicNet is a collection of 330 freely-licensed classical music recordings, together with over 1 million annotated labels indicating the precise time of each note in every recording, the instrument that plays each note, and the note's position in the metrical structure of the composition. The labels are acquired from musical scores aligned to recordings by dynamic time warping. The labels are verified by trained musicians; we estimate a labeling error rate of 4%. We offer the MusicNet labels to the machine learning and music communities as a resource for training models and a common benchmark for comparing results." | 34:04 | CC BY 4.0 | LLark | Audio Analysis / Metadata | 1/8/2024 | |
FMA (Free Music Archive) | Music + Text | 8232:00 | Metadata: CC BY 4.0 Underlying audio: Various types of CC | LLark,[Unnamed Apple Diffusion - December 2023],MusicFM | Audio Analysis / Metadata,Audio synthesis,Style transfer,Audio Analysis / Metadata | 1/8/2024 | ||
MTG-Jamendo | Music + Text | "built using music available at Jamendo under Creative Commons licenses and tags provided by content uploaders. The dataset contains over 55,000 full audio tracks with 195 tags from genre, instrument, and mood/theme categories. We provide elaborated data splits for researchers and report the performance of a simple baseline approach on five different sets of tags: genre, instrument, mood/theme, top-50, and overall." | Metadata: CC BY-NC-SA 4.0 DEED Underlying audio: Various levels of CC as defined by the Artist on Jamendo | LLark | Audio Analysis / Metadata | 1/8/2024 | ||
MagnaTagATune | Music + Text | 208:20 | CC BY-NC-SA 3.0 DEED | LLark,SoundStream | Audio Analysis / Metadata,Production (other) | 1/8/2024 | ||
MUSDB18 | Music + Stems | "The sigsep musdb18 data set consists of a total of 150 full-track songs of different styles and includes both the stereo mixtures and the original sources, divided between a training subset and a test subset. Its purpose is to serve as a reference database for the design and the evaluation of source separation algorithms. The objective of such signal processing methods is to estimate one or more sources from a set of mixtures, e.g. for karaoke applications. It has been used as the official dataset in the professionally-produced music recordings task for SiSEC 2018, which is the international campaign for the evaluation of source separation algorithms." | 10:00 | Various Licenses | AudioSR | Upscaler | 1/8/2024 | |
MoisesDB | Music + Stems | "This comprehensive dataset comprises 240 previously unreleased songs created by 47 artists that span twelve high-level genres. The total duration of the dataset is 14 hours, 24 minutes and 46 seconds, where the average recording is 3:36 seconds, with a standard deviation of 66 seconds. The organizational structure of the dataset follows a taxonomy that reflects the needs of source separation systems. The large number of songs, the diverse types of stems and tracks, and their organization in a source-separation-focused taxonomy will allow researchers to build their own stems according to their own requirements, and thus develop more granular source separation systems." | 14:24 | NC-RCL | AudioSR | Upscaler | 1/8/2024 | |
MedleyDB | Music + Stems | "MedleyDB is a dataset of annotated, royalty-free multitrack recordings for noncommercial and academic research. MedleyDB was curated primarily to support research on melody extraction, addressing important shortcomings of existing collections. For each song we provide melody f0 annotations as well as instrument activations for evaluating automatic instrument recognition. The dataset is also useful for research on tasks that require access to the individual tracks of a song such as source separation and automatic mixing." | 7:17 | CC BY-NC-SA 4.0 DEED | AudioSR | Upscaler | 1/8/2024 |
Music AIs
Tool name | Tool URL | Parent company/owner (if applicable) | Use case(s) | How to access? | Monetized | Pricing URL | Target users | MAU (public) | Instagram URL | TikTok URL | Facebook URL | Youtube URL | Twitter URL | Discord | Soundcloud | Legal Links | Last Modified |
AIbtract | MIDI triggered audio,Music composition/songwriting | Web-based app | Tier Subscription | Professional musicians,Hobbyist/casual musicians,Solo content creators | NA | NA | NA | NA | NA | 12/21/2023 | |||||||
Aimi | Production (other),Music composition/songwriting,MIDI triggered audio,Audio synthesis | Web-based app,Mobile app | Beta | NA | Professional musicians,Hobbyist/casual musicians,Music service providers,Lean-back consumers,Software developers | NA | NA | NA | 1/19/2024 | ||||||||
AIMS API | Audio Analysis / Metadata | Web-based app | B2B | NA | Catalog | NA | NA | NA | NA | NA | NA | NA | 12/21/2023 | ||||
AIVA | Music composition/songwriting,MIDI triggered audio | Web-based app | Tier Subscription | Solo content creators,Professional content teams | NA | NA | NA | NA | NA | 12/20/2023 | |||||||
Amadeus Code | Music composition/songwriting,MIDI triggered audio | Mobile app | Per Use,Tier Subscription | Hobbyist/casual musicians,Professional musicians | NA | NA | NA | NA | NA | NA | NA | 1/4/2024 | |||||
Amper (shuttered) | Shutterstock | Music composition/songwriting | Web-based app | NA | Solo content creators,Professional content teams | NA | NA | NA | NA | NA | NA | NA | 12/21/2023 | ||||
AmpliTube | IK Multimedia | Production (other) | DAW plugin | Free | NA | Professional musicians | NA | 12/20/2023 | |||||||||
Anthemscore | Audio transcription | Desktop app | Per Product | NA | Hobbyist/casual musicians | NA | NA | NA | NA | NA | NA | NA | 12/21/2023 | ||||
Archives | Sound/sample search | Web-based app | Beta | NA | Lean-in consumers | NA | NA | NA | NA | NA | NA | NA | 12/20/2023 | ||||
Audiocipher | Music composition/songwriting | Desktop app,DAW plugin | Per Product | Professional musicians | NA | NA | NA | NA | NA | 12/20/2023 | |||||||
Audionamix | Source separation | Web-based app,Model APIs/code libraries (open-source) | Tier Subscription | Professional musicians,Software developers | NA | NA | NA | 12/20/2023 | |||||||||
AudioSep: Separate Anything You Describe | Source separation | Web-based app | Free | NA | Researchers (non-commercial) | NA | NA | NA | NA | NA | NA | NA | MIT License | 12/20/2023 | |||
AudioShake | Source separation | Web-based app,Model APIs/code libraries (closed) | Tier Subscription,B2B | Professional musicians,Professional content teams | NA | NA | 1/15/2024 | ||||||||||
AudioStellar | Music composition/songwriting,Sound/sample search | DAW plugin,Desktop app | Free | NA | Professional musicians,Hobbyist/casual musicians | NA | NA | NA | 12/21/2023 | ||||||||
AUX | Audio synthesis | Desktop app | Tier Subscription | Professional musicians,Hobbyist/casual musicians | N/A | NA | NA | 1/5/2024 | |||||||||
BandLab SongStarter | BandLab | Music composition/songwriting,Source separation | DAW plugin,Web-based app | Tier Subscription | Hobbyist/casual musicians | NA | 1/4/2024 | ||||||||||
Banger: AI Cover Songs & Music | 42 Digital | Source separation,Timbre transfer | Mobile app | Per Product | NA | Lean-back consumers | N/A | N/A | N/A | N/A | N/A | N/A | N/A | 12/20/2023 | |||
Basic Pitch | Spotify | Audio transcription | Web-based app | Free | NA | Professional musicians,Hobbyist/casual musicians | N/A | N/A | N/A | N/A | N/A | N/A | 12/20/2023 | ||||
Beatoven | Music composition/songwriting,MIDI triggered audio | Web-based app | Per Use,Tier Subscription | Solo content creators,Professional content teams | 60 | NA | NA | 1/25/2024 | |||||||||
Boomy | Music composition/songwriting,Voice/speech synthesis | Web-based app | Tier Subscription | Hobbyist/casual musicians,Solo content creators | NA | NA | https://boomy.com/privacy https://boomy.com/terms https://support.boomy.com/hc/en-us/sections/14581038360845-Rights-Management https://support.boomy.com/hc/en-us/articles/14639031498381-Can-I-add-my-Boomy-music-to-an-NFT- https://support.boomy.com/hc/en-us/articles/14581471021581-Can-I-release-my-Boomy-music-with-other-distributors- https://support.boomy.com/hc/en-us/articles/14638984496397-Can-I-upload-my-own-music-to-Boomy-for-distribution- https://support.boomy.com/hc/en-us/articles/14639606025869-Can-I-use-my-Boomy-song-in-a-video-game- https://support.boomy.com/hc/en-us/articles/16238363656845-Does-the-license-for-my-songs-end-if-I-cancel-my-membership- https://support.boomy.com/hc/en-us/articles/16237103661581-What-licensing-is-available-for-the-different-Boomy-Memberships- https://support.boomy.com/hc/en-us/articles/14581348824973-Who-owns-the-copyright-to-Boomy-songs- https://support.boomy.com/hc/en-us/articles/15261808044301-Who-owns-the-rights-to-Boomy-songs- | 1/25/2024 | |||||||||
Botnik | Lyrics/text generation,Music composition/songwriting | Web-based app | B2B | NA | Professional musicians,Solo content creators | NA | NA | NA | 12/20/2023 | ||||||||
BrainRap | VerseBooks | Lyrics/text generation | Web-based app | NA | NA | Professional musicians | NA | NA | NA | NA | NA | NA | 12/21/2023 | ||||
Bronze.ai | Production (other) | Model APIs/code libraries (closed) | B2B | NA | Professional musicians | N/A | N/A | N/A | N/A | N/A | 1/25/2024 | ||||||
Captain Chords | Music composition/songwriting | DAW plugin | Per Product | Professional musicians | N/A | N/A | N/A | N/A | N/A | 12/21/2023 | |||||||
Cassette | Music composition/songwriting,Audio synthesis | Web-based app,Model APIs/code libraries (closed) | Tier Subscription | Hobbyist/casual musicians | 15 | NA | NA | NA | NA | NA | NA | NA | 12/20/2023 | ||||
Chameleon 2 | Accentize | Production (other) | DAW plugin | Per Product | Professional musicians | NA | NA | NA | NA | 1/5/2024 | |||||||
Timbre transfer | Web-based app | Tier Subscription | Professional musicians,Hobbyist/casual musicians | NA | NA | NA | 1/5/2024 | ||||||||||
Coqui🐸 XTTS | Voice/speech synthesis,Timbre transfer | Web-based app | Free,B2B | NA | Researchers (non-commercial) | NA | NA | NA | NA | NA | NA | NA | 1/5/2024 | ||||
CoSo | Splice | Music composition/songwriting | Mobile app | Free | NA | Hobbyist/casual musicians,Professional musicians | NA | NA | NA | NA | 12/21/2023 | ||||||
Cyanite | Audio Analysis / Metadata | Web-based app | Per Use,Tier Subscription | Professional musicians,Catalog | NA | NA | NA | NA | NA | NA | 12/21/2023 | ||||||
DAACI | Music composition/songwriting,MIDI triggered audio | DAW plugin,Desktop app | NA | NA | Professional musicians,Professional content teams | NA | NA | NA | NA | NA | NA | NA | NA | 12/21/2023 | |||
Databass.ai | Text-to-Audio,Music composition/songwriting,Audio synthesis,Source separation | Web-based app | Tier Subscription | Professional musicians | NA | NA | 12/21/2023 | ||||||||||
Deep Flow | Beatopia | Lyrics/text generation | Web-based app | Tier Subscription | Professional musicians,Professional content teams | NA | NA | NA | NA | NA | 12/20/2023 | ||||||
Delphos | MIDI triggered audio,Music composition/songwriting | Web-based app | Tier Subscription | Professional musicians | NA | NA | NA | NA | NA | NA | NA | NA | 12/21/2023 | ||||
Dream Track | Google | Timbre transfer,Music composition/songwriting,Audio synthesis | Unreleased | Beta | NA | Professional musicians | NA | NA | NA | NA | NA | 12/21/2023 | |||||
DynaScore | Wonder | Audio synthesis | Desktop app,Mobile app | Tier Subscription | Professional content teams,Software developers | NA | NA | 12/20/2023 | |||||||||
Emergent Drums | Audio synthesis,Production (other) | DAW plugin | Per Product | Professional musicians | NA | NA | NA | 12/20/2023 | |||||||||
Emvoice | Voice/speech synthesis | DAW plugin | Per Product | Professional musicians,Hobbyist/casual musicians | NA | 12/20/2023 | |||||||||||
Endel | Production (other) | Web-based app,Mobile app | Tier Subscription | Lean-back consumers,Music service providers | 1,000,000 + | NA | NA | NA | 1/25/2024 | ||||||||
Essentia | Universitat Pompeu Fabra | Audio synthesis,Sound/sample search,Source separation | Model APIs/code libraries (open-source) | Free | NA | Software developers | NA | NA | NA | NA | NA | NA | NA | 12/21/2023 | |||
Fadr | Music composition/songwriting,Source separation,Timbre transfer,Audio synthesis | Web-based app | Tier Subscription | Professional musicians | NA | NA | NA | 1/26/2024 | |||||||||
FAST Limiter | Focusrite | Mixing/mastering | DAW plugin | Per Product | Professional musicians | NA | NA | NA | NA | NA | NA | NA | 12/21/2023 | ||||
FL Studio | Image-Line | Source separation | Desktop app | Per Product,Tier Subscription | Professional musicians | NA | NA | NA | NA | NA | NA | NA | 12/21/2023 | ||||
Flowmachines | SonyCSL | Music composition/songwriting | Desktop app | Free | NA | Professional musicians,Hobbyist/casual musicians | NA | NA | NA | NA | NA | NA | 12/21/2023 | ||||
Futureverse | Audio synthesis | Unreleased | NA | NA | Professional musicians,Solo content creators | NA | NA | NA | 12/21/2023 | ||||||||
Getsound.ai | Music composition/songwriting | Mobile app,Desktop app | Tier Subscription | Lean-back consumers | NA | NA | NA | 12/21/2023 | |||||||||
Hooky | Timbre transfer | Unreleased | B2B | Professional musicians | NA | NA | NA | NA | NA | NA | NA | 1/5/2024 | |||||
Hydra | Rightsify | Music composition/songwriting,Text-to-Audio,Audio synthesis | 1/26/2024 | ||||||||||||||
Infinite Album | Music composition/songwriting,Audio synthesis | Web-based app | Tier Subscription | Solo content creators | NA | NA | NA | 12/20/2023 | |||||||||
Infinite Drum Machine | Google | UNCLEAR | Web-based app | Free | NA | Researchers (non-commercial),Professional musicians,Hobbyist/casual musicians | NA | NA | NA | NA | NA | NA | NA | 12/21/2023 | |||
Kits.ai | Arpeggi Labs | Timbre transfer | Web-based app | Tier Subscription | Professional musicians,Hobbyist/casual musicians | NA | NA | 1/5/2024 | |||||||||
KORUS | Pixlynx | Music composition/songwriting | Web-based app | Free | NA | Professional musicians,Hobbyist/casual musicians | 25 | NA | NA | NA | NA | 1/4/2024 | |||||
LALAL.AI | OmniSale GMBH | Production (other) | Web-based app | Per Use | Professional musicians | NA | NA | 12/21/2023 | |||||||||
LANDR Mastering Plugin | LANDR | Mixing/mastering | DAW plugin | Tier Subscription | Professional musicians | NA | NA | https://www.landr.com/terms-of-service/ https://www.landr.com/terms-of-service/ https://www.landr.com/network-terms-of-service/ https://www.landr.com/sessions-terms-of-service/ https://www.landr.com/projects-terms-of-services/ https://www.landr.com/privacy/ https://www.landr.com/acceptable-use/ https://www.landr.com/copyrights/ | 12/21/2023 | ||||||||
Lemonaide Seeds | Lemonaide Music | Music composition/songwriting | Desktop app,Web-based app | Tier Subscription | Professional musicians,Hobbyist/casual musicians | NA | NA | NA | NA | NA | 12/21/2023 | ||||||
LifeScore | Music composition/songwriting | Web-based app | B2B | NA | Music service providers,Software developers | NA | NA | NA | NA | NA | NA | NA | NA | 1/25/2024 | |||
Loudly AI Studio | Loudly | Music composition/songwriting | Mobile app | Tier Subscription | Solo content creators,Professional content teams | NA | NA | NA | NA | 12/21/2023 | |||||||
Lyric Studio | WAVE AI | Lyrics/text generation | Web-based app | Tier Subscription | Professional musicians,Hobbyist/casual musicians | look again | NA | NA | NA | NA | NA | NA | 12/21/2023 | ||||
Lyric Studio - Rap Rhyme Maker | Lyrics/text generation | Mobile app | Tier Subscription | NA | Professional musicians,Hobbyist/casual musicians | NA | NA | NA | NA | NA | NA | NA | NA | 12/21/2023 | |||
Magenta Studio | Google | Music composition/songwriting | DAW plugin,Model APIs/code libraries (open-source) | Open Source | NA | Professional musicians,Researchers (non-commercial),Software developers,Hobbyist/casual musicians | NA | NA | NA | NA | NA | NA | 12/21/2023 | ||||
Masterchannel.ai | Masterchannel | Mixing/mastering | Web-based app | Tier Subscription | Professional musicians | NA | NA | NA | NA | NA | NA | NA | 12/21/2023 | ||||
Mawf | ByteDance | Audio synthesis | DAW plugin | Beta | NA | Professional musicians,Hobbyist/casual musicians | NA | NA | NA | NA | NA | NA | NA | 12/20/2023 | |||
Timbre transfer,Source separation | Mobile app | Per Product | NA | Hobbyist/casual musicians | 1,000,000 + | NA | 12/20/2023 | ||||||||||
Melodia | Search | Web-based app | Free | NA | Solo content creators,Professional content teams | NA | NA | NA | NA | 1/4/2024 | |||||||
Melody Sauce | Music composition/songwriting | DAW plugin | Per Product | Professional musicians,Hobbyist/casual musicians | NA | NA | 12/20/2023 | ||||||||||
Melody Studio | WAVE AI | Lyrics/text generation,Music composition/songwriting | Web-based app | Tier Subscription | Professional musicians,Hobbyist/casual musicians | NA | NA | NA | NA | NA | NA | NA | 12/21/2023 | ||||
Micro Music | Production (other) | Web-based app,DAW plugin | Professional musicians,Hobbyist/casual musicians | NA | NA | NA | NA | NA | 12/21/2023 | ||||||||
MIDIjourney: groove and pitch | Korus Labs/Pixlynx | Music composition/songwriting | DAW plugin | Free | NA | Professional musicians,Hobbyist/casual musicians | NA | NA | NA | NA | NA | 12/20/2023 | |||||
Mix.Audio | Neutune | Text-to-Audio,Music composition/songwriting,Audio synthesis | Web-based app | Tier Subscription | Professional musicians,Solo content creators,Professional content teams | NA | NA | NA | NA | NA | 1/4/2024 | ||||||
MixaudioDJ: Text2Music | Neutune | Text-to-Audio,Music composition/songwriting,Audio synthesis | Mobile app | Free | NA | Lean-in consumers,Lean-back consumers | NA | NA | NA | NA | NA | 1/4/2024 | |||||
Mixit: Sing & Create Covers | Sphereo Sound ltd | Lyrics/text generation | Mobile app | Free | NA | Lean-back consumers | NA | NA | NA | NA | 12/21/2023 | ||||||
Moises | Source separation,Timbre transfer,Lyrics/text generation | Desktop app,Mobile app | Tier Subscription | Professional musicians,Hobbyist/casual musicians,Software developers | NA | NA | 1/5/2024 | ||||||||||
Mubert | Mubert | Music composition/songwriting,Audio synthesis | Web-based app | Tier Subscription | Professional musicians,Solo content creators,Professional content teams | NA | NA | NA | NA | 12/20/2023 | |||||||
MubertAI | Mubert | Audio synthesis | Model APIs/code libraries (open-source) | NA | NA | Solo content creators,Professional content teams,Professional musicians | NA | NA | NA | NA | 12/21/2023 | ||||||
MuseNet | OpenAI | Music composition/songwriting | Model APIs/code libraries (closed) | Free | NA | Researchers (non-commercial) | NA | NA | NA | NA | 12/21/2023 | ||||||
Musicfy | Music composition/songwriting,Timbre transfer,Text-to-Audio | Web-based app | Tier Subscription | Professional musicians,Hobbyist/casual musicians | 500 | NA | NA | NA | NA | NA | NA | NA | 1/5/2024 | ||||
Musiio | SoundCloud | Sound/sample search | Model APIs/code libraries (open-source) | Per Use,Tier Subscription | Catalog | NA | NA | NA | NA | NA | 12/21/2023 | ||||||
Musika | Audio synthesis | Model APIs/code libraries (open-source) | Free | NA | Researchers (non-commercial) | NA | NA | NA | 12/21/2023 | ||||||||
Nectar (iZotope) | Soundwide | Mixing | DAW plugin | Per Product | Professional musicians | NA | NA | NA | 1/5/2024 | ||||||||
Neutone | Qosmo | Production (other) | DAW plugin | Free | NA | Professional musicians,Hobbyist/casual musicians | NA | NA | NA | NA | NA | 12/20/2023 | |||||
ORB Producer Suite | ORB Plugins | Music composition/songwriting | DAW plugin | Per Product | Professional musicians | NA | NA | NA | NA | 12/20/2023 | |||||||
Output Co-Producer | Sound/sample search,Audio synthesis,Text-to-Audio | Web-based app | Beta | NA | Professional musicians | NA | NA | NA | 12/21/2023 | ||||||||
Ozone (iZotope) | Soundwide | Mixing/mastering | DAW plugin | Per Product | Professional musicians | NA | NA | NA | 12/21/2023 | ||||||||
Pitch Studio | Pitch Inc. | Audio synthesis,Voice/speech synthesis | Mobile app | Per Product | NA | Lean-back consumers | NA | NA | 12/20/2023 | ||||||||
Plus music.ai | Audio synthesis | Desktop app | Tier Subscription,Per Use | Professional musicians,Professional content teams,Gaming developers | NA | NA | NA | NA | NA | NA | NA | 12/21/2023 | |||||
PreTube | Accentize | Production (other) | DAW plugin | Per Product | Professional musicians | NA | NA | NA | NA | 1/5/2024 | |||||||
Reactional | Music composition/songwriting | Web-based app,Desktop app | Beta | Professional musicians,Gaming developers | NA | NA | NA | NA | NA | NA | 12/21/2023 | ||||||
Revocalize | Timbre transfer | Web-based app,DAW plugin,Model APIs/code libraries (closed) | Tier Subscription | Professional musicians,Hobbyist/casual musicians | NA | NA | NA | NA | NA | 1/5/2024 | |||||||
RIFFIT Reader | RIFFIT | Music composition/songwriting,Voice/speech synthesis,MIDI triggered audio | Web-based app | Free | Lean-in consumers | NA | NA | NA | NA | 12/21/2023 | |||||||
Riffusion | Audio synthesis | Model APIs/code libraries (open-source),Web-based app | Free | NA | Hobbyist/casual musicians,Professional musicians,Solo content creators | NA | NA | NA | NA | NA | 12/20/2023 | ||||||
Ripple | ByteDance | Music composition/songwriting | Mobile app | Free | NA | Hobbyist/casual musicians,Solo content creators | NA | NA | NA | NA | NA | NA | NA | 1/4/2024 | |||
RipX | Hit’n’Mix | Source separation | Desktop app | Per Product | Professional musicians | NA | NA | NA | 1/4/2024 | ||||||||
RoEx | Mixing | Web-based app | Tier Subscription,B2B | Professional musicians | NA | NA | NA | 1/5/2024 | |||||||||
SALMONN: Speech Audio Language Music Open Neural Network | Audio-to-text | Desktop app | Per Use | NA | Researchers (non-commercial) | NA | NA | NA | NA | NA | NA | NA | NA | 1/4/2024 | |||
SampleBrain | Aphex Twin | Audio synthesis | Desktop app | Free | NA | Professional musicians,Hobbyist/casual musicians,Software developers | NA | NA | NA | NA | NA | NA | NA | 1/4/2024 | |||
Serato Sample | Serato | Source separation | Desktop app | Per Product,Tier Subscription | Professional musicians | 1/4/2024 | |||||||||||
Singify | Timbre transfer | Web-based app | Tier Subscription | NA | Hobbyist/casual musicians | NA | NA | NA | 1/5/2024 | ||||||||
SOLARIA | Eclipsed Sounds | Voice/speech synthesis | DAW plugin | Per Product | Professional musicians | NA | NA | 1/4/2024 | |||||||||
Songmastr | Mixing/mastering | Web-based app | Professional musicians,Hobbyist/casual musicians | NA | NA | NA | NA | NA | NA | NA | 1/4/2024 | ||||||
Soundful | Music composition/songwriting | Web-based app | Tier Subscription | Professional musicians,Solo content creators,Professional content teams | NA | 1/26/2024 | |||||||||||
SoundRaw | Music composition/songwriting | Web-based app | Tier Subscription | Solo content creators,Hobbyist/casual musicians,Professional content teams | NA | NA | NA | NA | NA | NA | NA | 12/20/2023 | |||||
Sounds Studio | Never Before Heard Sounds | Audio synthesis | Model APIs/code libraries (closed),Web-based app | Free | Hobbyist/casual musicians | NA | NA | NA | NA | NA | NA | 12/20/2023 | |||||
Soundverse | Music composition/songwriting,Lyrics/text generation,Audio synthesis | Web-based app | Tier Subscription | Professional musicians | 1305 | NA | NA | 12/20/2023 | |||||||||
Spindrop | Sound/sample search,Production (other) | Mobile app | Free | Lean-in consumers | NA | NA | NA | NA | NA | NA | NA | 12/21/2023 | |||||
Splash | Popgun Labs | Music composition/songwriting,Voice/speech synthesis,Lyrics/text generation | Mobile app,Web-based app | Tier Subscription | Professional musicians,Professional content teams | NA | NA | NA | 1/25/2024 | ||||||||
Spotify Voice Translation | Spotify | Voice/speech synthesis,Timbre transfer | Unreleased | NA | NA | Podcasters | NA | NA | NA | NA | NA | NA | NA | 12/21/2023 | |||
Stable Audio | Stability AI | Audio synthesis | Web-based app | Tier Subscription | Professional musicians,Hobbyist/casual musicians,Solo content creators,Researchers (non-commercial) | NA | NA | NA | 12/20/2023 | ||||||||
Staccatto | Lyrics/text generation,Music composition/songwriting | Web-based app | Tier Subscription | Professional musicians,Hobbyist/casual musicians | NA | 12/20/2023 | |||||||||||
Stemroller | Source separation | Web-based app | Free | NA | Hobbyist/casual musicians,Lean-in consumers | NA | NA | NA | NA | NA | NA | NA | 12/21/2023 | ||||
Stemz | MWM | Source separation | Mobile app | Tier Subscription | NA | Hobbyist/casual musicians | NA | NA | NA | NA | NA | NA | NA | 12/21/2023 | |||
Suno | Music composition/songwriting,Audio synthesis | Discord | Tier Subscription | Professional musicians,Hobbyist/casual musicians,Professional content teams,Solo content creators | NA | NA | NA | NA | NA | NA | NA | 12/20/2023 | |||||
Supertone | HYBE | Voice/speech synthesis,Timbre transfer | Model APIs/code libraries (closed) | Per Product | Software developers | NA | NA | NA | NA | NA | NA | NA | 1/5/2024 | ||||
Supertone Clear (fka GOYO) | Supertone | Source separation,Production (other) | DAW plugin | Per Product | Professional musicians | NA | NA | NA | NA | NA | NA | 12/21/2023 | |||||
Synplant2 | Sonic Charge | Production (other) | DAW plugin | Per Product | Professional musicians | NA | NA | 12/21/2023 | |||||||||
Synthesizer V | Dreamtonics | Voice/speech synthesis | Desktop app | Per Product | Professional musicians,Hobbyist/casual musicians | NA | NA | 12/21/2023 | |||||||||
TAIP | BABY Audio | Production (other) | DAW plugin | Per Product | Professional musicians,Hobbyist/casual musicians | NA | NA | NA | 1/5/2024 | ||||||||
TextFX | Google | Lyrics/text generation | Web-based app | Free | NA | Professional musicians,Hobbyist/casual musicians | NA | NA | NA | NA | NA | NA | NA | 12/20/2023 | |||
The Strip | Phil Speiser | Mixing | DAW plugin | Per Product | Professional musicians | NA | NA | NA | NA | NA | 1/5/2024 | ||||||
These Lyrics Do Not Exist | Lyrics/text generation | Web-based app | Free | Hobbyist/casual musicians | NA | NA | NA | NA | NA | NA | NA | 12/21/2023 | |||||
Triniti | CreateSafe | Music composition/songwriting,Audio synthesis | Discord,Web-based app | Free | NA | Professional musicians,Solo content creators,Hobbyist/casual musicians | 7,264 | NA | NA | NA | NA | NA | NA | NA | 12/21/2023 | ||
Tuney | Music composition/songwriting | Web-based app | Tier Subscription | Professional musicians,Solo content creators,Professional content teams | 8 | NA | NA | NA | NA | 1/25/2024 | |||||||
Uberduck | Voice/speech synthesis,Timbre transfer | Model APIs/code libraries (closed) | Tier Subscription | Professional musicians,Hobbyist/casual musicians,Professional content teams,Solo content creators | NA | NA | NA | 1/5/2024 | |||||||||
UniAudio: An Audio Foundation Model Toward Universal Audio Generation | Text-to-Audio | Model APIs/code libraries (open-source) | Free | NA | Researchers (non-commercial) | NA | NA | NA | NA | NA | NA | NA | 12/21/2023 | ||||
Usample | UMG | Source separation | Web-based app | Per Product | Professional content teams,Professional musicians | NA | NA | NA | NA | NA | NA | NA | 12/21/2023 | ||||
Utopia | Audio Analysis / Metadata | Web-based app | B2B | Catalog,Professional musicians,Music service providers | NA | NA | NA | NA | NA | 12/21/2023 | |||||||
VOCALOID6 | Vocaloid | Voice/speech synthesis,Timbre transfer | DAW plugin | Per Product | Professional musicians | NA | NA | 1/5/2024 | |||||||||
Voice-Swap | Timbre transfer | Desktop app | Tier Subscription | Professional musicians,Hobbyist/casual musicians | NA | NA | NA | NA | NA | 1/26/2024 | |||||||
Voicemod | Music composition/songwriting,Voice/speech synthesis | Desktop app | Per Product,Tier Subscription | NA | Professional musicians,Professional content teams,Solo content creators | NA | NA | NA | 12/21/2023 | ||||||||
Voicestars | Timbre transfer | Web-based app | Tier Subscription | Hobbyist/casual musicians | NA | NA | NA | NA | NA | 12/21/2023 | |||||||
Voicify | Timbre transfer | Web-based app | Tier Subscription | Professional musicians,Hobbyist/casual musicians | NA | NA | NA | 1/5/2024 | |||||||||
WarpSound | Music composition/songwriting,MIDI triggered audio | Web-based app | B2B | Lean-back consumers,Hobbyist/casual musicians,Software developers | NA | NA | 12/21/2023 | ||||||||||
Wavtool | Music composition/songwriting,MIDI triggered audio,Text-to-Audio | Web-based app | Tier Subscription | Professional musicians,Hobbyist/casual musicians | 5500 | NA | NA | NA | 12/20/2023 | ||||||||
Xiaoice / X Studio | Microsoft China | Voice/speech synthesis,Timbre transfer | Desktop app | Professional musicians | 1/5/2024 | ||||||||||||
XLN XO | XLN Audio | Production (other) | DAW plugin | Per Product | Professional musicians,Hobbyist/casual musicians | NA | NA | NA | 12/21/2023 |