Skip to content

yzihan/Generative-AI

Repository files navigation

Generative AI

Frontend

painter: https://github.com/aml2610/react-painter#readme

replit: https://replit.com/~

Stable Diffusion related

Stable Diffusion web UI: https://github.com/AUTOMATIC1111/stable-diffusion-webui

txt2mask: https://github.com/ThereforeGames/txt2mask

Img2img Video: https://github.com/memes-forever/Stable-diffusion-webui-video

Vid2Vid: https://github.com/Filarius/stable-diffusion-webui/blob/master/scripts/vid2vid.py

Image segmentation and recognition

semantic-segmentation: https://huggingface.co/nvidia/segformer-b0-finetuned-ade-512-512

plant recognition: https://web.plant.id/plant-identification-api/

texture recognition (need to apply): https://www.clarifai.com/models/texture-recognition

doodle-recognition: https://github.com/zhangchaodesign/doodle-recognition

Image style transfer

Arbitrary Neural Style Transfer: https://replicate.com/collections/style-transfer

Image search

image search: https://www.microsoft.com/en-us/bing/apis/bing-image-search-api

Graph-based

Spacy:https://spacy.io/api

Stanford OpenIE:https://nlp.stanford.edu/software/openie.html

Text generation

Demo-InferKit: https://app.inferkit.com/demo

Sassbook AI Story Generator: https://sassbook.com/ai-story-writer

Rytr-an AI writing assistant: https://rytr.me/

Title generation

OpenBMB: https://live.openbmb.org/ant

Text classification

Cohere: https://os.cohere.ai/playground/large/classify

Text analysis

Convert Unstructured Text Data Into Actionable Insights With Advanced Text Analysis: https://kpibees.com/

Semantic Role Labeling: https://demo.allennlp.org/semantic-role-labeling/semantic-role-labeling

Event extraction: https://huggingface.co/veronica320/QA-for-Event-Extraction

Sentiment analysis: https://huggingface.co/models?other=sentiment-analysis

Named entity recognition: https://huggingface.co/dslim/bert-base-NER

Continue images generation

StoryDall-E: https://github.com/adymaharana/storydalle?continueFlag=aecf3cf42991a37d09397fc61687c405 https://huggingface.co/spaces/ECCV2022/storydalle

Storygan: https://arxiv.org/pdf/1812.02784.pdf https://github.com/yitong91/StoryGAN

stable-diffusion-video: stable-diffusion-videos: https://github.com/nateraw/stable-diffusion-videos

StyleCLIP: https://github.com/orpatashnik/StyleCLIP

Generating image with specified character: https://github.com/XavierXiao/Dreambooth-Stable-Diffusion

Text-to-Video Generation

Phenaki: Variable Length Video Generation from Open Domain Textual Descriptions: https://phenaki.video/index.html

Make-A-Video: https://makeavideo.studio/

MotionDiffuse: Text-Driven Human Motion Generation with Diffusion Model: https://mingyuan-zhang.github.io/projects/MotionDiffuse.html

CogVideo: https://github.com/THUDM/CogVideo

text-image-video: https://huggingface.co/spaces/Kameswara/TextToVideo

Imagen Video: https://imagen.research.google/video/

Magicvideo: Efficient video generation with latent diffusion models

Tune-A-Video: One-Shot Tuning of Image Diffusion Models for Text-to-Video Generation

Video Diffusion Models

Image-to-Video Generation

Make it move: Controllable image-to-video generation with text descriptions

3D Generation

DreamFusion: Text-to-3D using 2D Diffusion: https://dreamfusion3d.github.io/; https://github.com/ashawkey/stable-dreamfusion

dreamfields-3D: https://github.com/shengyu-meng/dreamfields-3D

ZoeDepth:https://github.com/isl-org/ZoeDepth

Text to 3D Scene

https://github.com/oaishi/3DScene_from_text

https://nlp.stanford.edu/projects/text2scene.shtml

Representing Scenes Generation

https://www.matthewtancik.com/nerf

Depth Analysis

https://github.com/EPFL-VILAB/omnidata

Compositional Visual Generation

https://energy-based-model.github.io/Compositional-Visual-Generation-with-Composable-Diffusion-Models/

https://people.csail.mit.edu/lishuang/#Home

Dynamic Human

https://developer.nvidia.com/blog/human-like-character-animation-system-uses-ai-to-navigate-terrains/

Switch Human Body

https://github.com/NVIDIA/vid2vid

Image to Text

img2prompt: https://replicate.com/methexis-inc/img2prompt

Text to Image

Deforum Stable Diffusion: https://colab.research.google.com/github/deforum/stable-diffusion/blob/main/Deforum_Stable_Diffusion.ipynb

stable diffusion demo: https://demo.rowy.io/table/imageGeneration

Disco Diffusion: https://colab.research.google.com/github/alembics/disco-diffusion/blob/main/Disco_Diffusion.ipynb

Latent Diffusion: https://huggingface.co/spaces/multimodalart/latentdiffusion

Dreamstudio: https://beta.dreamstudio.ai/dream

clipdraw: https://deepai.org/publication/clipdraw-exploring-text-to-drawing-synthesis-through-language-image-encoders

styleclipdraw: https://github.com/pschaldenbrand/StyleCLIPDraw

Midjourney: https://www.midjourney.com/home/

DreamBooth: Fine Tuning Text-to-Image Diffusion Models for Subject-Driven Generation

Image Editing with Text

Prompt-to-Prompt Image Editing with Cross-Attention Control: https://prompt-to-prompt.github.io/

Imagic: Text-Based Real Image Editing with Diffusion Models

Text augmentation

Bloom: https://huggingface.co/bigscience/bloom

OPT : Open Pre-trained Transformer Language Models: https://huggingface.co/facebook/opt-125m

seq2seq

Bart: https://huggingface.co/facebook/bart-base

Outpainting

stablediffusion-infinity: https://github.com/lkwq007/stablediffusion-infinity

Pytorch implementation

video-diffusion-pytorch: https://github.com/lucidrains/video-diffusion-pytorch

phenaki-pytorch: https://github.com/lucidrains/phenaki-pytorch

make-a-video-pytorch: https://github.com/lucidrains/make-a-video-pytorch

imagen-pytorch: https://github.com/lucidrains/imagen-pytorch

DALLE2-pytorch: https://github.com/lucidrains/DALLE2-pytorch

Access/Share State of Art Models

Hugging Face: https://huggingface.co/

Replicate: https://replicate.com/

Rapid API: https://rapidapi.com/hub

ml5.js: https://ml5js.org/

Pollinations: https://pollinations.ai/c/Anything

Some development issue

Access-Control-Allow-Origin: https://chrome.google.com/webstore/detail/allow-cors-access-control/lhobafahddgcelffkeicbaginigeejlf/related?hl=en

Some Human-AI Systems

StoryBuddy: A Human-AI Collaborative Chatbot for Parent-Child Interactive Storytelling with Flexible Parental Involvement

StoryDrawer: A Child–AI Collaborative Drawing System to Support Children's Creative Visual Storytelling

I Lead, You Help but Only with Enough Details: Understanding User Experience of Co-Creation with Artificial Intelligence

FashionQ: An AI-Driven Creativity Support Tool for Facilitating Ideation in Fashion Design

StreamSketch: Exploring Multi-Modal Interactions in Creative Live Streams

CheXplain: Enabling Physicians to Explore and Understand Data-Driven, AI-Enabled Medical Imaging Analysis

Lessons Learned from Designing an AI-Enabled Diagnosis Tool for Pathologists

Improving Workflow Integration with xPath: Design and Evaluation of a Human-AI Diagnosis System in Pathology

Augmenting Pathologists with NaviPath: Design and Evaluation of a Human-AI Collaborative Navigation System

Human-Centered Tools for Coping with Imperfect Algorithms During Medical Decision-Making

Human–computer collaboration for skin cancer recognition

A Human-AI Collaborative Approach for Clinical Decision Making on Rehabilitation Assessment

Principles of mixed-initiative user interfaces

Believe it or not: Designing a human-AI partnership for mixed-initiative fact-checking

Datatone: Managing ambiguity in natural language interfaces for data visualization

Marvista: A Human-AI Collaborative Reading Tool

CrossA11y: Identifying Video Accessibility Issues via Cross-Modal Grounding

Revamp: Enhancing Accessible Information Seeking Experience of Online Shopping for Blind or Low Vision Users

SOLVENT: A Mixed Initiative System for Finding Analogies between Research Papers

DreamSketch: Early Stage 3D Design Explorations with Sketching and Generative Design

SmartManikin: Virtual Humans with Agency for Design Tools

Guided exploration of physically valid shapes for furniture design

Forte: User-driven generative design

A suggestive interface for 3D drawing

MAPGEN: Mixed-Initiative Planning and Scheduling for the Mars Exploration Rover Mission

Wordcraft: A Human-AI collaborative editor for story writing

STORIUM: A collaborative Story Generation Platform

Deep learning in a computational model for conceptual shifts in a co-creative design system

Collabdraw: An environment for collaborative sketching with an artificial agent

About

generative AI test

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published