Research
Bypass Paywalls, Read News Articles for Free
PaywallBuster helps users access paywalled news articles by redirecting them to third-party tools that bypass paywalls. It emphasizes its commitment to privacy and legality, offering free access to information for all.
VisitInstantDrag: Fast, Interactive Drag-Based Image Editing
InstantDrag is a new method for drag-based image editing that uses optical flow generation and diffusion models to achieve fast, interactive image editing. It requires only an image and a drag instruction as input, eliminating the need for masks or text prompts. InstantDrag has been shown to perform fast photorealistic edits on facial video datasets and general scenes.
VisitHuggingChat macOS: AI Chat App for macOS
HuggingChat macOS is a native chat app for macOS that uses open-source language models for AI conversations. It allows users to access advanced AI conversation capabilities directly on their desktop.
Visit
IOPaint: Open-Source Image Inpainting Tool
IOPaint is an open-source image inpainting tool powered by AI models like Stable Diffusion. It allows users to remove unwanted objects, defects, or people from images, or to erase and replace objects with new content.
VisitInstaGraph: Knowledge Graph Generator from Text or URL
InstaGraph is a tool that converts text input or URLs into knowledge graphs using OpenAI's GPT-3.5. It allows users to visualize relationships between entities in a complex topic and provides features like color-coded graph nodes and edges, responsive design, and API endpoints for retrieving graph data.
VisitTextBoost: One-Shot Image Personalization with Fine-Tuned Text Encoder
TextBoost is a technique that personalizes text-to-image models with a single reference image. It fine-tunes the text encoder to prevent overfitting and enables control over generated images using text prompts.
VisitOpen-Source Music Generation with Quality-Aware Masked Diffusion Transformer (QAMDT)
This repository offers a PyTorch implementation of QAMDT, a music generation model integrating state-of-the-art models. It includes training and inference scripts, instructions for dataset preparation, and links to relevant resources.
VisitGVHMR: World-Grounded Human Motion Recovery
This GitHub repository provides code for the research paper "GVHMR: World-Grounded Human Motion Recovery via Gravity-View Coordinates" presented at SIGGRAPH Asia 2024. The code enables recovery of human motion using gravity-view coordinates, a novel approach for 3D human motion reconstruction.
VisitContextual Retrieval for Enhanced Retrieval-Augmented Generation
This article introduces Contextual Retrieval, a method that improves retrieval accuracy in Retrieval-Augmented Generation (RAG) by adding contextual information to chunks before embedding. Experiments show significant performance improvements.
VisitExAvatar: Expressive 3D Gaussian Avatar
This GitHub repository provides the official PyTorch implementation of ExAvatar, a novel 3D Gaussian avatar model presented at ECCV 2024. ExAvatar combines the expressiveness of SMPLX with the strong appearance modeling capabilities of 3D Gaussian models, resulting in a highly expressive and realistic avatar.
VisitPDF to Audio Converter with AI
This Python code uses OpenAI's GPT models to convert PDF documents into audio. Features include uploading multiple PDFs, customizable instructions, text generation, and voice selection. It offers draft editing and iteration for improved results.
VisitImage Outpainting with Diffusers
This Hugging Face Space demonstrates image outpainting using Diffusers, a library for text-to-image generation. It provides a user-friendly interface to experiment with different models and parameters.
VisitNVIDIA Deep Learning Institute: AI and Accelerated Computing Training
NVIDIA Deep Learning Institute (DLI) provides resources for learning AI, accelerated computing, data science, graphics, and simulation. It offers self-paced courses, instructor-led workshops, and educator programs. DLI helps individuals, teams, organizations, educators, and students advance their knowledge in AI.
VisitLate Chunking: Balancing Precision and Cost in Long Context Retrieval
Late chunking is a new method that aims to improve the precision and cost-effectiveness of long context retrieval systems. It addresses the challenges of traditional approaches by embedding the entire document before chunking, preserving contextual information.
VisitOpen-Source RAG Tool for Document Chat
Kotaemon is an open-source tool for building RAG (Retrieval Augmented Generation) pipelines. It offers a user-friendly interface for querying documents and supports both local and cloud-based LLMs. Kotaemon is designed for both end users and developers, enabling customizable RAG workflows and document QA.
VisitAI Research Assistant: NotebookLM
NotebookLM is an AI-powered research assistant that helps users analyze and synthesize information from uploaded documents. It provides personalized insights, grounded in source material with inline citations, and keeps user data private.
VisitSeedMusic: AI-Powered Music Generation and Editing
SeedMusic is a suite of AI systems for music generation and editing. It features vocal music generation, fine-grained note-level editing, and zero-shot singing voice conversion. SeedMusic offers high-quality music with style control and adapts to musician workflows.
VisitRetrieval
Bypass Paywalls, Read News Articles for Free
PaywallBuster helps users access paywalled news articles by redirecting them to third-party tools that bypass paywalls. It emphasizes its commitment to privacy and legality, offering free access to information for all.
VisitInstaGraph: Knowledge Graph Generator from Text or URL
InstaGraph is a tool that converts text input or URLs into knowledge graphs using OpenAI's GPT-3.5. It allows users to visualize relationships between entities in a complex topic and provides features like color-coded graph nodes and edges, responsive design, and API endpoints for retrieving graph data.
VisitContextual Retrieval for Enhanced Retrieval-Augmented Generation
This article introduces Contextual Retrieval, a method that improves retrieval accuracy in Retrieval-Augmented Generation (RAG) by adding contextual information to chunks before embedding. Experiments show significant performance improvements.
VisitNews
Bypass Paywalls, Read News Articles for Free
PaywallBuster helps users access paywalled news articles by redirecting them to third-party tools that bypass paywalls. It emphasizes its commitment to privacy and legality, offering free access to information for all.
VisitMachine learning
InstantDrag: Fast, Interactive Drag-Based Image Editing
InstantDrag is a new method for drag-based image editing that uses optical flow generation and diffusion models to achieve fast, interactive image editing. It requires only an image and a drag instruction as input, eliminating the need for masks or text prompts. InstantDrag has been shown to perform fast photorealistic edits on facial video datasets and general scenes.
VisitHuggingChat macOS: AI Chat App for macOS
HuggingChat macOS is a native chat app for macOS that uses open-source language models for AI conversations. It allows users to access advanced AI conversation capabilities directly on their desktop.
Visit
IOPaint: Open-Source Image Inpainting Tool
IOPaint is an open-source image inpainting tool powered by AI models like Stable Diffusion. It allows users to remove unwanted objects, defects, or people from images, or to erase and replace objects with new content.
VisitInstaGraph: Knowledge Graph Generator from Text or URL
InstaGraph is a tool that converts text input or URLs into knowledge graphs using OpenAI's GPT-3.5. It allows users to visualize relationships between entities in a complex topic and provides features like color-coded graph nodes and edges, responsive design, and API endpoints for retrieving graph data.
VisitTextBoost: One-Shot Image Personalization with Fine-Tuned Text Encoder
TextBoost is a technique that personalizes text-to-image models with a single reference image. It fine-tunes the text encoder to prevent overfitting and enables control over generated images using text prompts.
VisitOpen-Source Music Generation with Quality-Aware Masked Diffusion Transformer (QAMDT)
This repository offers a PyTorch implementation of QAMDT, a music generation model integrating state-of-the-art models. It includes training and inference scripts, instructions for dataset preparation, and links to relevant resources.
VisitContextual Retrieval for Enhanced Retrieval-Augmented Generation
This article introduces Contextual Retrieval, a method that improves retrieval accuracy in Retrieval-Augmented Generation (RAG) by adding contextual information to chunks before embedding. Experiments show significant performance improvements.
VisitPDF to Audio Converter with AI
This Python code uses OpenAI's GPT models to convert PDF documents into audio. Features include uploading multiple PDFs, customizable instructions, text generation, and voice selection. It offers draft editing and iteration for improved results.
VisitImage Outpainting with Diffusers
This Hugging Face Space demonstrates image outpainting using Diffusers, a library for text-to-image generation. It provides a user-friendly interface to experiment with different models and parameters.
VisitNVIDIA Deep Learning Institute: AI and Accelerated Computing Training
NVIDIA Deep Learning Institute (DLI) provides resources for learning AI, accelerated computing, data science, graphics, and simulation. It offers self-paced courses, instructor-led workshops, and educator programs. DLI helps individuals, teams, organizations, educators, and students advance their knowledge in AI.
VisitLate Chunking: Balancing Precision and Cost in Long Context Retrieval
Late chunking is a new method that aims to improve the precision and cost-effectiveness of long context retrieval systems. It addresses the challenges of traditional approaches by embedding the entire document before chunking, preserving contextual information.
VisitOpen-Source RAG Tool for Document Chat
Kotaemon is an open-source tool for building RAG (Retrieval Augmented Generation) pipelines. It offers a user-friendly interface for querying documents and supports both local and cloud-based LLMs. Kotaemon is designed for both end users and developers, enabling customizable RAG workflows and document QA.
VisitAI Research Assistant: NotebookLM
NotebookLM is an AI-powered research assistant that helps users analyze and synthesize information from uploaded documents. It provides personalized insights, grounded in source material with inline citations, and keeps user data private.
Visit
DrawingSpinUp: 3D Animation from Single Character Drawings
DrawingSpinUp is a system that generates 3D animations from single character drawings. It addresses challenges in 3D model reconstruction from amateur drawings by using a contour removal and restoration strategy and a skeleton-based thinning deformation algorithm for geometry refinement.
VisitSeedMusic: AI-Powered Music Generation and Editing
SeedMusic is a suite of AI systems for music generation and editing. It features vocal music generation, fine-grained note-level editing, and zero-shot singing voice conversion. SeedMusic offers high-quality music with style control and adapts to musician workflows.
VisitAI Product Photo Generator
Simplicity AI uses machine learning to create realistic product photoshoots. Users upload images and the AI generates various poses, angles, and environments. Different plans offer varying features and image credits.
VisitComputer graphics
InstantDrag: Fast, Interactive Drag-Based Image Editing
InstantDrag is a new method for drag-based image editing that uses optical flow generation and diffusion models to achieve fast, interactive image editing. It requires only an image and a drag instruction as input, eliminating the need for masks or text prompts. InstantDrag has been shown to perform fast photorealistic edits on facial video datasets and general scenes.
Visit
IOPaint: Open-Source Image Inpainting Tool
IOPaint is an open-source image inpainting tool powered by AI models like Stable Diffusion. It allows users to remove unwanted objects, defects, or people from images, or to erase and replace objects with new content.
VisitTextBoost: One-Shot Image Personalization with Fine-Tuned Text Encoder
TextBoost is a technique that personalizes text-to-image models with a single reference image. It fine-tunes the text encoder to prevent overfitting and enables control over generated images using text prompts.
VisitGVHMR: World-Grounded Human Motion Recovery
This GitHub repository provides code for the research paper "GVHMR: World-Grounded Human Motion Recovery via Gravity-View Coordinates" presented at SIGGRAPH Asia 2024. The code enables recovery of human motion using gravity-view coordinates, a novel approach for 3D human motion reconstruction.
VisitExAvatar: Expressive 3D Gaussian Avatar
This GitHub repository provides the official PyTorch implementation of ExAvatar, a novel 3D Gaussian avatar model presented at ECCV 2024. ExAvatar combines the expressiveness of SMPLX with the strong appearance modeling capabilities of 3D Gaussian models, resulting in a highly expressive and realistic avatar.
VisitImage Outpainting with Diffusers
This Hugging Face Space demonstrates image outpainting using Diffusers, a library for text-to-image generation. It provides a user-friendly interface to experiment with different models and parameters.
VisitNVIDIA Deep Learning Institute: AI and Accelerated Computing Training
NVIDIA Deep Learning Institute (DLI) provides resources for learning AI, accelerated computing, data science, graphics, and simulation. It offers self-paced courses, instructor-led workshops, and educator programs. DLI helps individuals, teams, organizations, educators, and students advance their knowledge in AI.
Visit
DrawingSpinUp: 3D Animation from Single Character Drawings
DrawingSpinUp is a system that generates 3D animations from single character drawings. It addresses challenges in 3D model reconstruction from amateur drawings by using a contour removal and restoration strategy and a skeleton-based thinning deformation algorithm for geometry refinement.
VisitAI Product Photo Generator
Simplicity AI uses machine learning to create realistic product photoshoots. Users upload images and the AI generates various poses, angles, and environments. Different plans offer varying features and image credits.
VisitNote taking
HuggingChat macOS: AI Chat App for macOS
HuggingChat macOS is a native chat app for macOS that uses open-source language models for AI conversations. It allows users to access advanced AI conversation capabilities directly on their desktop.
VisitPDF to Audio Converter with AI
This Python code uses OpenAI's GPT models to convert PDF documents into audio. Features include uploading multiple PDFs, customizable instructions, text generation, and voice selection. It offers draft editing and iteration for improved results.
VisitOpen-Source RAG Tool for Document Chat
Kotaemon is an open-source tool for building RAG (Retrieval Augmented Generation) pipelines. It offers a user-friendly interface for querying documents and supports both local and cloud-based LLMs. Kotaemon is designed for both end users and developers, enabling customizable RAG workflows and document QA.
VisitAI Research Assistant: NotebookLM
NotebookLM is an AI-powered research assistant that helps users analyze and synthesize information from uploaded documents. It provides personalized insights, grounded in source material with inline citations, and keeps user data private.
VisitMusic
Open-Source Music Generation with Quality-Aware Masked Diffusion Transformer (QAMDT)
This repository offers a PyTorch implementation of QAMDT, a music generation model integrating state-of-the-art models. It includes training and inference scripts, instructions for dataset preparation, and links to relevant resources.
VisitSeedMusic: AI-Powered Music Generation and Editing
SeedMusic is a suite of AI systems for music generation and editing. It features vocal music generation, fine-grained note-level editing, and zero-shot singing voice conversion. SeedMusic offers high-quality music with style control and adapts to musician workflows.
Visit3d animation
GVHMR: World-Grounded Human Motion Recovery
This GitHub repository provides code for the research paper "GVHMR: World-Grounded Human Motion Recovery via Gravity-View Coordinates" presented at SIGGRAPH Asia 2024. The code enables recovery of human motion using gravity-view coordinates, a novel approach for 3D human motion reconstruction.
VisitExAvatar: Expressive 3D Gaussian Avatar
This GitHub repository provides the official PyTorch implementation of ExAvatar, a novel 3D Gaussian avatar model presented at ECCV 2024. ExAvatar combines the expressiveness of SMPLX with the strong appearance modeling capabilities of 3D Gaussian models, resulting in a highly expressive and realistic avatar.
Visit
DrawingSpinUp: 3D Animation from Single Character Drawings
DrawingSpinUp is a system that generates 3D animations from single character drawings. It addresses challenges in 3D model reconstruction from amateur drawings by using a contour removal and restoration strategy and a skeleton-based thinning deformation algorithm for geometry refinement.
VisitAI Product Photo Generator
Simplicity AI uses machine learning to create realistic product photoshoots. Users upload images and the AI generates various poses, angles, and environments. Different plans offer varying features and image credits.
Visit