Research

Bypass Paywalls, Read News Articles for Free favicon

Bypass Paywalls, Read News Articles for Free

PaywallBuster helps users access paywalled news articles by redirecting them to third-party tools that bypass paywalls. It emphasizes its commitment to privacy and legality, offering free access to information for all.

Visit
InstantDrag: Fast, Interactive Drag-Based Image Editing favicon

InstantDrag: Fast, Interactive Drag-Based Image Editing

InstantDrag is a new method for drag-based image editing that uses optical flow generation and diffusion models to achieve fast, interactive image editing. It requires only an image and a drag instruction as input, eliminating the need for masks or text prompts. InstantDrag has been shown to perform fast photorealistic edits on facial video datasets and general scenes.

Visit
HuggingChat macOS: AI Chat App for macOS favicon

HuggingChat macOS: AI Chat App for macOS

HuggingChat macOS is a native chat app for macOS that uses open-source language models for AI conversations. It allows users to access advanced AI conversation capabilities directly on their desktop.

Visit
IOPaint: Open-Source Image Inpainting Tool favicon

IOPaint: Open-Source Image Inpainting Tool

IOPaint is an open-source image inpainting tool powered by AI models like Stable Diffusion. It allows users to remove unwanted objects, defects, or people from images, or to erase and replace objects with new content.

Visit
InstaGraph: Knowledge Graph Generator from Text or URL favicon

InstaGraph: Knowledge Graph Generator from Text or URL

InstaGraph is a tool that converts text input or URLs into knowledge graphs using OpenAI's GPT-3.5. It allows users to visualize relationships between entities in a complex topic and provides features like color-coded graph nodes and edges, responsive design, and API endpoints for retrieving graph data.

Visit
TextBoost: One-Shot Image Personalization with Fine-Tuned Text Encoder favicon

TextBoost: One-Shot Image Personalization with Fine-Tuned Text Encoder

TextBoost is a technique that personalizes text-to-image models with a single reference image. It fine-tunes the text encoder to prevent overfitting and enables control over generated images using text prompts.

Visit
Open-Source Music Generation with Quality-Aware Masked Diffusion Transformer (QAMDT) favicon

Open-Source Music Generation with Quality-Aware Masked Diffusion Transformer (QAMDT)

This repository offers a PyTorch implementation of QAMDT, a music generation model integrating state-of-the-art models. It includes training and inference scripts, instructions for dataset preparation, and links to relevant resources.

Visit
GVHMR: World-Grounded Human Motion Recovery favicon

GVHMR: World-Grounded Human Motion Recovery

This GitHub repository provides code for the research paper "GVHMR: World-Grounded Human Motion Recovery via Gravity-View Coordinates" presented at SIGGRAPH Asia 2024. The code enables recovery of human motion using gravity-view coordinates, a novel approach for 3D human motion reconstruction.

Visit
Contextual Retrieval for Enhanced Retrieval-Augmented Generation favicon

Contextual Retrieval for Enhanced Retrieval-Augmented Generation

This article introduces Contextual Retrieval, a method that improves retrieval accuracy in Retrieval-Augmented Generation (RAG) by adding contextual information to chunks before embedding. Experiments show significant performance improvements.

Visit
ExAvatar: Expressive 3D Gaussian Avatar favicon

ExAvatar: Expressive 3D Gaussian Avatar

This GitHub repository provides the official PyTorch implementation of ExAvatar, a novel 3D Gaussian avatar model presented at ECCV 2024. ExAvatar combines the expressiveness of SMPLX with the strong appearance modeling capabilities of 3D Gaussian models, resulting in a highly expressive and realistic avatar.

Visit
PDF to Audio Converter with AI favicon

PDF to Audio Converter with AI

This Python code uses OpenAI's GPT models to convert PDF documents into audio. Features include uploading multiple PDFs, customizable instructions, text generation, and voice selection. It offers draft editing and iteration for improved results.

Visit
Image Outpainting with Diffusers favicon

Image Outpainting with Diffusers

This Hugging Face Space demonstrates image outpainting using Diffusers, a library for text-to-image generation. It provides a user-friendly interface to experiment with different models and parameters.

Visit
NVIDIA Deep Learning Institute: AI and Accelerated Computing Training favicon

NVIDIA Deep Learning Institute: AI and Accelerated Computing Training

NVIDIA Deep Learning Institute (DLI) provides resources for learning AI, accelerated computing, data science, graphics, and simulation. It offers self-paced courses, instructor-led workshops, and educator programs. DLI helps individuals, teams, organizations, educators, and students advance their knowledge in AI.

Visit
Late Chunking: Balancing Precision and Cost in Long Context Retrieval favicon

Late Chunking: Balancing Precision and Cost in Long Context Retrieval

Late chunking is a new method that aims to improve the precision and cost-effectiveness of long context retrieval systems. It addresses the challenges of traditional approaches by embedding the entire document before chunking, preserving contextual information.

Visit
Open-Source RAG Tool for Document Chat favicon

Open-Source RAG Tool for Document Chat

Kotaemon is an open-source tool for building RAG (Retrieval Augmented Generation) pipelines. It offers a user-friendly interface for querying documents and supports both local and cloud-based LLMs. Kotaemon is designed for both end users and developers, enabling customizable RAG workflows and document QA.

Visit
AI Research Assistant: NotebookLM favicon

AI Research Assistant: NotebookLM

NotebookLM is an AI-powered research assistant that helps users analyze and synthesize information from uploaded documents. It provides personalized insights, grounded in source material with inline citations, and keeps user data private.

Visit
SeedMusic: AI-Powered Music Generation and Editing favicon

SeedMusic: AI-Powered Music Generation and Editing

SeedMusic is a suite of AI systems for music generation and editing. It features vocal music generation, fine-grained note-level editing, and zero-shot singing voice conversion. SeedMusic offers high-quality music with style control and adapts to musician workflows.

Visit

Retrieval

Bypass Paywalls, Read News Articles for Free favicon

Bypass Paywalls, Read News Articles for Free

PaywallBuster helps users access paywalled news articles by redirecting them to third-party tools that bypass paywalls. It emphasizes its commitment to privacy and legality, offering free access to information for all.

Visit
InstaGraph: Knowledge Graph Generator from Text or URL favicon

InstaGraph: Knowledge Graph Generator from Text or URL

InstaGraph is a tool that converts text input or URLs into knowledge graphs using OpenAI's GPT-3.5. It allows users to visualize relationships between entities in a complex topic and provides features like color-coded graph nodes and edges, responsive design, and API endpoints for retrieving graph data.

Visit
Contextual Retrieval for Enhanced Retrieval-Augmented Generation favicon

Contextual Retrieval for Enhanced Retrieval-Augmented Generation

This article introduces Contextual Retrieval, a method that improves retrieval accuracy in Retrieval-Augmented Generation (RAG) by adding contextual information to chunks before embedding. Experiments show significant performance improvements.

Visit

News

Bypass Paywalls, Read News Articles for Free favicon

Bypass Paywalls, Read News Articles for Free

PaywallBuster helps users access paywalled news articles by redirecting them to third-party tools that bypass paywalls. It emphasizes its commitment to privacy and legality, offering free access to information for all.

Visit

Machine learning

InstantDrag: Fast, Interactive Drag-Based Image Editing favicon

InstantDrag: Fast, Interactive Drag-Based Image Editing

InstantDrag is a new method for drag-based image editing that uses optical flow generation and diffusion models to achieve fast, interactive image editing. It requires only an image and a drag instruction as input, eliminating the need for masks or text prompts. InstantDrag has been shown to perform fast photorealistic edits on facial video datasets and general scenes.

Visit
HuggingChat macOS: AI Chat App for macOS favicon

HuggingChat macOS: AI Chat App for macOS

HuggingChat macOS is a native chat app for macOS that uses open-source language models for AI conversations. It allows users to access advanced AI conversation capabilities directly on their desktop.

Visit
IOPaint: Open-Source Image Inpainting Tool favicon

IOPaint: Open-Source Image Inpainting Tool

IOPaint is an open-source image inpainting tool powered by AI models like Stable Diffusion. It allows users to remove unwanted objects, defects, or people from images, or to erase and replace objects with new content.

Visit
InstaGraph: Knowledge Graph Generator from Text or URL favicon

InstaGraph: Knowledge Graph Generator from Text or URL

InstaGraph is a tool that converts text input or URLs into knowledge graphs using OpenAI's GPT-3.5. It allows users to visualize relationships between entities in a complex topic and provides features like color-coded graph nodes and edges, responsive design, and API endpoints for retrieving graph data.

Visit
TextBoost: One-Shot Image Personalization with Fine-Tuned Text Encoder favicon

TextBoost: One-Shot Image Personalization with Fine-Tuned Text Encoder

TextBoost is a technique that personalizes text-to-image models with a single reference image. It fine-tunes the text encoder to prevent overfitting and enables control over generated images using text prompts.

Visit
Open-Source Music Generation with Quality-Aware Masked Diffusion Transformer (QAMDT) favicon

Open-Source Music Generation with Quality-Aware Masked Diffusion Transformer (QAMDT)

This repository offers a PyTorch implementation of QAMDT, a music generation model integrating state-of-the-art models. It includes training and inference scripts, instructions for dataset preparation, and links to relevant resources.

Visit
Contextual Retrieval for Enhanced Retrieval-Augmented Generation favicon

Contextual Retrieval for Enhanced Retrieval-Augmented Generation

This article introduces Contextual Retrieval, a method that improves retrieval accuracy in Retrieval-Augmented Generation (RAG) by adding contextual information to chunks before embedding. Experiments show significant performance improvements.

Visit
PDF to Audio Converter with AI favicon

PDF to Audio Converter with AI

This Python code uses OpenAI's GPT models to convert PDF documents into audio. Features include uploading multiple PDFs, customizable instructions, text generation, and voice selection. It offers draft editing and iteration for improved results.

Visit
Image Outpainting with Diffusers favicon

Image Outpainting with Diffusers

This Hugging Face Space demonstrates image outpainting using Diffusers, a library for text-to-image generation. It provides a user-friendly interface to experiment with different models and parameters.

Visit
NVIDIA Deep Learning Institute: AI and Accelerated Computing Training favicon

NVIDIA Deep Learning Institute: AI and Accelerated Computing Training

NVIDIA Deep Learning Institute (DLI) provides resources for learning AI, accelerated computing, data science, graphics, and simulation. It offers self-paced courses, instructor-led workshops, and educator programs. DLI helps individuals, teams, organizations, educators, and students advance their knowledge in AI.

Visit
Late Chunking: Balancing Precision and Cost in Long Context Retrieval favicon

Late Chunking: Balancing Precision and Cost in Long Context Retrieval

Late chunking is a new method that aims to improve the precision and cost-effectiveness of long context retrieval systems. It addresses the challenges of traditional approaches by embedding the entire document before chunking, preserving contextual information.

Visit
Open-Source RAG Tool for Document Chat favicon

Open-Source RAG Tool for Document Chat

Kotaemon is an open-source tool for building RAG (Retrieval Augmented Generation) pipelines. It offers a user-friendly interface for querying documents and supports both local and cloud-based LLMs. Kotaemon is designed for both end users and developers, enabling customizable RAG workflows and document QA.

Visit
AI Research Assistant: NotebookLM favicon

AI Research Assistant: NotebookLM

NotebookLM is an AI-powered research assistant that helps users analyze and synthesize information from uploaded documents. It provides personalized insights, grounded in source material with inline citations, and keeps user data private.

Visit
DrawingSpinUp: 3D Animation from Single Character Drawings favicon

DrawingSpinUp: 3D Animation from Single Character Drawings

DrawingSpinUp is a system that generates 3D animations from single character drawings. It addresses challenges in 3D model reconstruction from amateur drawings by using a contour removal and restoration strategy and a skeleton-based thinning deformation algorithm for geometry refinement.

Visit
SeedMusic: AI-Powered Music Generation and Editing favicon

SeedMusic: AI-Powered Music Generation and Editing

SeedMusic is a suite of AI systems for music generation and editing. It features vocal music generation, fine-grained note-level editing, and zero-shot singing voice conversion. SeedMusic offers high-quality music with style control and adapts to musician workflows.

Visit
AI Product Photo Generator favicon

AI Product Photo Generator

Simplicity AI uses machine learning to create realistic product photoshoots. Users upload images and the AI generates various poses, angles, and environments. Different plans offer varying features and image credits.

Visit

Computer graphics

InstantDrag: Fast, Interactive Drag-Based Image Editing favicon

InstantDrag: Fast, Interactive Drag-Based Image Editing

InstantDrag is a new method for drag-based image editing that uses optical flow generation and diffusion models to achieve fast, interactive image editing. It requires only an image and a drag instruction as input, eliminating the need for masks or text prompts. InstantDrag has been shown to perform fast photorealistic edits on facial video datasets and general scenes.

Visit
IOPaint: Open-Source Image Inpainting Tool favicon

IOPaint: Open-Source Image Inpainting Tool

IOPaint is an open-source image inpainting tool powered by AI models like Stable Diffusion. It allows users to remove unwanted objects, defects, or people from images, or to erase and replace objects with new content.

Visit
TextBoost: One-Shot Image Personalization with Fine-Tuned Text Encoder favicon

TextBoost: One-Shot Image Personalization with Fine-Tuned Text Encoder

TextBoost is a technique that personalizes text-to-image models with a single reference image. It fine-tunes the text encoder to prevent overfitting and enables control over generated images using text prompts.

Visit
GVHMR: World-Grounded Human Motion Recovery favicon

GVHMR: World-Grounded Human Motion Recovery

This GitHub repository provides code for the research paper "GVHMR: World-Grounded Human Motion Recovery via Gravity-View Coordinates" presented at SIGGRAPH Asia 2024. The code enables recovery of human motion using gravity-view coordinates, a novel approach for 3D human motion reconstruction.

Visit
ExAvatar: Expressive 3D Gaussian Avatar favicon

ExAvatar: Expressive 3D Gaussian Avatar

This GitHub repository provides the official PyTorch implementation of ExAvatar, a novel 3D Gaussian avatar model presented at ECCV 2024. ExAvatar combines the expressiveness of SMPLX with the strong appearance modeling capabilities of 3D Gaussian models, resulting in a highly expressive and realistic avatar.

Visit
Image Outpainting with Diffusers favicon

Image Outpainting with Diffusers

This Hugging Face Space demonstrates image outpainting using Diffusers, a library for text-to-image generation. It provides a user-friendly interface to experiment with different models and parameters.

Visit
NVIDIA Deep Learning Institute: AI and Accelerated Computing Training favicon

NVIDIA Deep Learning Institute: AI and Accelerated Computing Training

NVIDIA Deep Learning Institute (DLI) provides resources for learning AI, accelerated computing, data science, graphics, and simulation. It offers self-paced courses, instructor-led workshops, and educator programs. DLI helps individuals, teams, organizations, educators, and students advance their knowledge in AI.

Visit
DrawingSpinUp: 3D Animation from Single Character Drawings favicon

DrawingSpinUp: 3D Animation from Single Character Drawings

DrawingSpinUp is a system that generates 3D animations from single character drawings. It addresses challenges in 3D model reconstruction from amateur drawings by using a contour removal and restoration strategy and a skeleton-based thinning deformation algorithm for geometry refinement.

Visit
AI Product Photo Generator favicon

AI Product Photo Generator

Simplicity AI uses machine learning to create realistic product photoshoots. Users upload images and the AI generates various poses, angles, and environments. Different plans offer varying features and image credits.

Visit

Note taking

HuggingChat macOS: AI Chat App for macOS favicon

HuggingChat macOS: AI Chat App for macOS

HuggingChat macOS is a native chat app for macOS that uses open-source language models for AI conversations. It allows users to access advanced AI conversation capabilities directly on their desktop.

Visit
PDF to Audio Converter with AI favicon

PDF to Audio Converter with AI

This Python code uses OpenAI's GPT models to convert PDF documents into audio. Features include uploading multiple PDFs, customizable instructions, text generation, and voice selection. It offers draft editing and iteration for improved results.

Visit
Open-Source RAG Tool for Document Chat favicon

Open-Source RAG Tool for Document Chat

Kotaemon is an open-source tool for building RAG (Retrieval Augmented Generation) pipelines. It offers a user-friendly interface for querying documents and supports both local and cloud-based LLMs. Kotaemon is designed for both end users and developers, enabling customizable RAG workflows and document QA.

Visit
AI Research Assistant: NotebookLM favicon

AI Research Assistant: NotebookLM

NotebookLM is an AI-powered research assistant that helps users analyze and synthesize information from uploaded documents. It provides personalized insights, grounded in source material with inline citations, and keeps user data private.

Visit

Music

Open-Source Music Generation with Quality-Aware Masked Diffusion Transformer (QAMDT) favicon

Open-Source Music Generation with Quality-Aware Masked Diffusion Transformer (QAMDT)

This repository offers a PyTorch implementation of QAMDT, a music generation model integrating state-of-the-art models. It includes training and inference scripts, instructions for dataset preparation, and links to relevant resources.

Visit
SeedMusic: AI-Powered Music Generation and Editing favicon

SeedMusic: AI-Powered Music Generation and Editing

SeedMusic is a suite of AI systems for music generation and editing. It features vocal music generation, fine-grained note-level editing, and zero-shot singing voice conversion. SeedMusic offers high-quality music with style control and adapts to musician workflows.

Visit

3d animation

GVHMR: World-Grounded Human Motion Recovery favicon

GVHMR: World-Grounded Human Motion Recovery

This GitHub repository provides code for the research paper "GVHMR: World-Grounded Human Motion Recovery via Gravity-View Coordinates" presented at SIGGRAPH Asia 2024. The code enables recovery of human motion using gravity-view coordinates, a novel approach for 3D human motion reconstruction.

Visit
ExAvatar: Expressive 3D Gaussian Avatar favicon

ExAvatar: Expressive 3D Gaussian Avatar

This GitHub repository provides the official PyTorch implementation of ExAvatar, a novel 3D Gaussian avatar model presented at ECCV 2024. ExAvatar combines the expressiveness of SMPLX with the strong appearance modeling capabilities of 3D Gaussian models, resulting in a highly expressive and realistic avatar.

Visit
DrawingSpinUp: 3D Animation from Single Character Drawings favicon

DrawingSpinUp: 3D Animation from Single Character Drawings

DrawingSpinUp is a system that generates 3D animations from single character drawings. It addresses challenges in 3D model reconstruction from amateur drawings by using a contour removal and restoration strategy and a skeleton-based thinning deformation algorithm for geometry refinement.

Visit
AI Product Photo Generator favicon

AI Product Photo Generator

Simplicity AI uses machine learning to create realistic product photoshoots. Users upload images and the AI generates various poses, angles, and environments. Different plans offer varying features and image credits.

Visit