Research

Bypass Paywalls, Read News Articles for Free

PaywallBuster helps users access paywalled news articles by redirecting them to third-party tools that bypass paywalls. It emphasizes its commitment to privacy and legality, offering free access to information for all.

Visit

InstantDrag: Fast, Interactive Drag-Based Image Editing

InstantDrag is a new method for drag-based image editing that uses optical flow generation and diffusion models to achieve fast, interactive image editing. It requires only an image and a drag instruction as input, eliminating the need for masks or text prompts. InstantDrag has been shown to perform fast photorealistic edits on facial video datasets and general scenes.

Visit

HuggingChat macOS: AI Chat App for macOS

HuggingChat macOS is a native chat app for macOS that uses open-source language models for AI conversations. It allows users to access advanced AI conversation capabilities directly on their desktop.

Visit

IOPaint: Open-Source Image Inpainting Tool

IOPaint is an open-source image inpainting tool powered by AI models like Stable Diffusion. It allows users to remove unwanted objects, defects, or people from images, or to erase and replace objects with new content.

Visit

InstaGraph: Knowledge Graph Generator from Text or URL

InstaGraph is a tool that converts text input or URLs into knowledge graphs using OpenAI's GPT-3.5. It allows users to visualize relationships between entities in a complex topic and provides features like color-coded graph nodes and edges, responsive design, and API endpoints for retrieving graph data.

Visit

TextBoost: One-Shot Image Personalization with Fine-Tuned Text Encoder

TextBoost is a technique that personalizes text-to-image models with a single reference image. It fine-tunes the text encoder to prevent overfitting and enables control over generated images using text prompts.

Visit

Open-Source Music Generation with Quality-Aware Masked Diffusion Transformer (QAMDT)

This repository offers a PyTorch implementation of QAMDT, a music generation model integrating state-of-the-art models. It includes training and inference scripts, instructions for dataset preparation, and links to relevant resources.

Visit

GVHMR: World-Grounded Human Motion Recovery

This GitHub repository provides code for the research paper "GVHMR: World-Grounded Human Motion Recovery via Gravity-View Coordinates" presented at SIGGRAPH Asia 2024. The code enables recovery of human motion using gravity-view coordinates, a novel approach for 3D human motion reconstruction.

Visit

Contextual Retrieval for Enhanced Retrieval-Augmented Generation

This article introduces Contextual Retrieval, a method that improves retrieval accuracy in Retrieval-Augmented Generation (RAG) by adding contextual information to chunks before embedding. Experiments show significant performance improvements.

Visit

ExAvatar: Expressive 3D Gaussian Avatar

This GitHub repository provides the official PyTorch implementation of ExAvatar, a novel 3D Gaussian avatar model presented at ECCV 2024. ExAvatar combines the expressiveness of SMPLX with the strong appearance modeling capabilities of 3D Gaussian models, resulting in a highly expressive and realistic avatar.

Visit

PDF to Audio Converter with AI

This Python code uses OpenAI's GPT models to convert PDF documents into audio. Features include uploading multiple PDFs, customizable instructions, text generation, and voice selection. It offers draft editing and iteration for improved results.

Visit

Image Outpainting with Diffusers

This Hugging Face Space demonstrates image outpainting using Diffusers, a library for text-to-image generation. It provides a user-friendly interface to experiment with different models and parameters.

Visit

NVIDIA Deep Learning Institute: AI and Accelerated Computing Training

NVIDIA Deep Learning Institute (DLI) provides resources for learning AI, accelerated computing, data science, graphics, and simulation. It offers self-paced courses, instructor-led workshops, and educator programs. DLI helps individuals, teams, organizations, educators, and students advance their knowledge in AI.

Visit

Late Chunking: Balancing Precision and Cost in Long Context Retrieval

Late chunking is a new method that aims to improve the precision and cost-effectiveness of long context retrieval systems. It addresses the challenges of traditional approaches by embedding the entire document before chunking, preserving contextual information.

Visit

Open-Source RAG Tool for Document Chat

Kotaemon is an open-source tool for building RAG (Retrieval Augmented Generation) pipelines. It offers a user-friendly interface for querying documents and supports both local and cloud-based LLMs. Kotaemon is designed for both end users and developers, enabling customizable RAG workflows and document QA.

Visit

AI Research Assistant: NotebookLM

NotebookLM is an AI-powered research assistant that helps users analyze and synthesize information from uploaded documents. It provides personalized insights, grounded in source material with inline citations, and keeps user data private.

Visit

SeedMusic: AI-Powered Music Generation and Editing

SeedMusic is a suite of AI systems for music generation and editing. It features vocal music generation, fine-grained note-level editing, and zero-shot singing voice conversion. SeedMusic offers high-quality music with style control and adapts to musician workflows.

Visit

Retrieval

Bypass Paywalls, Read News Articles for Free

Visit

InstaGraph: Knowledge Graph Generator from Text or URL

Visit

Contextual Retrieval for Enhanced Retrieval-Augmented Generation

Visit

News

Bypass Paywalls, Read News Articles for Free

Visit

Machine learning

InstantDrag: Fast, Interactive Drag-Based Image Editing

Visit

HuggingChat macOS: AI Chat App for macOS

HuggingChat macOS is a native chat app for macOS that uses open-source language models for AI conversations. It allows users to access advanced AI conversation capabilities directly on their desktop.

Visit

IOPaint: Open-Source Image Inpainting Tool

Visit

InstaGraph: Knowledge Graph Generator from Text or URL

Visit

TextBoost: One-Shot Image Personalization with Fine-Tuned Text Encoder

Visit

Open-Source Music Generation with Quality-Aware Masked Diffusion Transformer (QAMDT)

Visit

Contextual Retrieval for Enhanced Retrieval-Augmented Generation

Visit

PDF to Audio Converter with AI

Visit

Image Outpainting with Diffusers

Visit

NVIDIA Deep Learning Institute: AI and Accelerated Computing Training

Visit

Late Chunking: Balancing Precision and Cost in Long Context Retrieval

Visit

Open-Source RAG Tool for Document Chat

Visit

AI Research Assistant: NotebookLM

Visit

DrawingSpinUp: 3D Animation from Single Character Drawings

DrawingSpinUp is a system that generates 3D animations from single character drawings. It addresses challenges in 3D model reconstruction from amateur drawings by using a contour removal and restoration strategy and a skeleton-based thinning deformation algorithm for geometry refinement.

Visit

SeedMusic: AI-Powered Music Generation and Editing

Visit

AI Product Photo Generator

Simplicity AI uses machine learning to create realistic product photoshoots. Users upload images and the AI generates various poses, angles, and environments. Different plans offer varying features and image credits.

Visit

Computer graphics

InstantDrag: Fast, Interactive Drag-Based Image Editing

Visit

IOPaint: Open-Source Image Inpainting Tool

Visit

TextBoost: One-Shot Image Personalization with Fine-Tuned Text Encoder

Visit

GVHMR: World-Grounded Human Motion Recovery

Visit

ExAvatar: Expressive 3D Gaussian Avatar

Visit

Image Outpainting with Diffusers

Visit

NVIDIA Deep Learning Institute: AI and Accelerated Computing Training

Visit

DrawingSpinUp: 3D Animation from Single Character Drawings

Visit

AI Product Photo Generator

Visit

Note taking

HuggingChat macOS: AI Chat App for macOS

HuggingChat macOS is a native chat app for macOS that uses open-source language models for AI conversations. It allows users to access advanced AI conversation capabilities directly on their desktop.

Visit