_ | text2concept | _ |
---|---|---|
text2concept | : Concept Activation Vectors Directly from Text |
_ | text2im | _ |
---|---|---|
Im2Text and | text2im | : Associating Images and Texts for Cross-Modal Retrieval |
_ | text2live | _ |
---|---|---|
text2live | : Text-Driven Layered Image and Video Editing |
_ | text2mesh | _ |
---|---|---|
text2mesh | : Text-Driven Neural Stylization for Meshes |
_ | text2performer | _ |
---|---|---|
text2performer | : Text-Driven Human Video Generation |
_ | text2po | _ |
---|---|---|
text2po | s: Text-to-Point-Cloud Cross-Modal Localization |
_ | text2room | _ |
---|---|---|
text2room | : Extracting Textured 3D Meshes from 2D Text-to-Image Models |
_ | text2scene | _ |
---|---|---|
text2scene | : Generating Compositional Scenes From Textual Descriptions | |
text2scene | : Text-driven Indoor Scene Stylization with Part-Aware Details |
_ | text2shape | _ |
---|---|---|
text2shape | : Generating Shapes from Natural Language by Learning Joint Embeddings |
_ | text2sign | _ |
---|---|---|
text2sign | : Towards Sign Language Production Using Neural Machine Translation and Generative Adversarial Networks |
_ | text2sketch | _ |
---|---|---|
text2sketch | : Learning Face Sketch from Facial Attribute Text |
_ | text2tex | _ |
---|---|---|
text2tex | : Text-driven Texture Synthesis via Diffusion Models |
_ | text2video | _ |
---|---|---|
Spatial-Temporal Graphs for Cross-Modal | text2video | Retrieval |
text2video | -Zero: Text-to-Image Diffusion Models are Zero-Shot Video Generators | |
text2video | : An End-to-end Learning Framework for Expressing Text With Videos |