AI Models List
November 27, 2023About 2 min
Building Detection (Geoscience)
- Datasets for satellite building segmentation
- Model for satellite city segmentation
- Segmenting Buildings in Satellite Images
- Mask2former swin large mapillary vistas panoptic
- Maskformer Satellite Trees
Voice Cloning
- Model for Kanye voice cloning
- Real Time Voice Cloning
- https://huggingface.co/anton-l/wav2vec2-xls-r-common_voice-tr-ft-stream
Image Super Resolution
- Search super-resolution in huggingface
- magnific
- Image Resolution Enhancer (IBM)
- keras-io/super-resolution
- ldm-super-resolution-4x-openimages
Image clear
Speech2Text:
- Speech to Text
- Speech to Text 2
- turboscribe.ai
- wbbbbb/wav2vec2-large-chinese-zh-cn Important
- Google speech to text supported languages
- openai/whisper-large-v3
Digital Human
- CiroN2022/digital-human
- Kedreamix/Digital-Human-Weights
- Efficient 3D Articulated Human Generation with Layered Surface Volumes
- Chupa: Carving 3D Clothed Humans from Skinned Shape Priors using 2D Diffusion Probabilistic Models
- Creating digital humans: Capture, Modeling, and Synthesis
- Digital-human on Sourceforge
- The World of Digital Humans: Where AI Meets Realism
- awesome-digital-human
- Digital Human Software for Blender
Image to Video
motexture/VSeq2VSeq
animate anything
stabilityai/stable-video-diffusion-img2vid-xt
Text to Video
hotshotco/Hotshot-XL
cerspense/zeroscope_v2_576w
damo-vilab/text-to-video-ms-1.7b
damo-vilab/modelscope-damo-text-to-video-synthesis
guoyww/animatediff-motion-adapter-v1-5-2
camenduru/potat1
guoyww/animatediff-motion-lora-zoom-in
Text to Speech
- coqui/XTTS-v2 Support Chinese, voice clone, emotion, etc..
- speechbrain/tts-tacotron2-ljspeech
- facebook/fastspeech2-en-ljspeech Only English is supported
- suno/bark-small More research is needed, not good enough as example on colab
- microsoft/speecht5_tts Effect is good, need to find models for other languages
SpeechT5 on Github - facebook/mms-tts-engExample efficient is good, need to explore for other language.
- suno/bark Example on Google Colab
Text to Image
Sketch
TencentARC/t2i-adapter-sketch-sdxl-1.0 **
TencentARC/t2iadapter_sketch_sd15v2 **
cosc/sketchstyle-cutesexyrobutts
microsoft/beit-base-patch16-224-pt22k-ft22k
Linaqruf/sketch-style-xl-lora