2338 projects for "video" with 1 filter applied:

  • Earn up to 15% annual interest with Nexo. Icon
    Earn up to 15% annual interest with Nexo.

    More flexibility. More control.

    Generate interest, access liquidity without selling, and execute trades seamlessly. All in one platform. Geographic restrictions, eligibility, and terms apply.
    Get started with Nexo.
  • Earn up to 15% annual interest with Nexo. Icon
    Earn up to 15% annual interest with Nexo.

    Access competitive interest rates on your digital assets.

    Generate interest, borrow against your crypto, and trade a range of cryptocurrencies — all in one platform. Geographic restrictions, eligibility, and terms apply.
    Get started with Nexo.
  • 1
    Nunflix

    Nunflix

    Nunflix is a frontend Netflix clone but Warrior Nun-ified

    Nunflix is an open-source streaming application designed to provide a centralized interface for browsing and consuming video content from multiple online sources in a clean and user-friendly environment. It focuses on aggregating content into a unified experience, allowing users to search for shows or movies and stream them without navigating multiple websites. The application typically includes a modern interface optimized for ease of use, with features such as content categorization, search functionality, and metadata display. ...
    Downloads: 10 This Week
    Last Update:
    See Project
  • 2
    Depth Anything 3

    Depth Anything 3

    Recovering the Visual Space from Any Views

    Depth Anything 3 is a research-driven project that brings accurate and dense depth estimation to any input image or video, enabling foundational understanding of 3D structure from 2D visual content. Designed to work across diverse scenes, lighting conditions, and image types, it uses advanced neural networks trained on large, heterogeneous datasets, producing depth maps that reveal scene depth relationships and object surfaces with strong fidelity.
    Downloads: 10 This Week
    Last Update:
    See Project
  • 3
    Memories

    Memories

    Build a full stack MERN application

    ...The app is called “Memories” and allows users to post interesting events from their lives (memories), providing a simple social media–style interface and back-end API to support create, read, update, delete of posts. The repository is tied to a video tutorial series that covers the project from start to finish, ideal for developers learning full-stack workflows. The codebase is split into client and server folders for separation of concerns, and uses common patterns like REST APIs, React state management, and MongoDB models. While fairly standard as a “starter” full-stack project, it provides a clean, end-to-end example of building and deploying a modern web application with MERN.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    Dozzle

    Dozzle

    Realtime log viewer for containers. Supports Docker, Swarm and K8s

    Dozzle is a lightweight, self-hosted web application for real-time viewing and monitoring of container logs, focused on speed and simplicity rather than building a full log storage pipeline. Instead of indexing or storing logs, it connects to your container runtime and streams live output so you can diagnose issues as they happen. The interface includes practical quality-of-life features like fuzzy searching for containers, regex log search, split-screen viewing for multiple logs, and live...
    Downloads: 3 This Week
    Last Update:
    See Project
  • Wiz: #1 Cloud Security Software for Modern Cloud Protection Icon
    Wiz: #1 Cloud Security Software for Modern Cloud Protection

    Protect Everything You Build and Run in the Cloud

    Use the Wiz Cloud Security Platform to build faster in the cloud, enabling security, dev and devops to work together in a self-service model built for the scale and speed of your cloud development.
    Learn More
  • 5
    HunyuanVideo-Avatar

    HunyuanVideo-Avatar

    Tencent Hunyuan Multimodal diffusion transformer (MM-DiT) model

    ...Character image injection module for better consistency between training and inference conditioning. Emotion control by extracting emotion reference images and transferring emotional style into video sequences.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    SimpleTuner

    SimpleTuner

    A general fine-tuning kit geared toward image/video/audio diffusion

    SimpleTuner is an open-source toolkit designed to simplify the fine-tuning of modern diffusion models for generating images, video, and audio. The project focuses on providing a clear and understandable training environment for researchers, developers, and artists who want to customize generative AI models without navigating complex machine learning pipelines. It supports fine-tuning workflows for models such as Stable Diffusion variants and other diffusion architectures, enabling users to adapt pretrained models to specialized datasets or creative tasks. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 7
    Rackula

    Rackula

    Drag and drop rack visualizer

    Rackula is a browser-based rack layout designer aimed at homelabbers, audio/video technicians, and equipment organizers who want a visual way to plan and document physical device racks. It runs entirely client-side with no backend server required, making it lightweight, fast, and easy to self-host or run locally without external dependencies. Users can drag and drop devices into customizable rack spaces, annotate equipment, set unit sizes, and manage complex layouts as their setup evolves. ...
    Downloads: 99 This Week
    Last Update:
    See Project
  • 8
    vJEPA-2

    vJEPA-2

    PyTorch code and models for VJEPA2 self-supervised learning from video

    VJEPA2 is a next-generation self-supervised learning framework for video that extends the “predict in representation space” idea from i-JEPA to the temporal domain. Instead of reconstructing pixels, it predicts the missing high-level embeddings of masked space-time regions using a context encoder and a slowly updated target encoder. This objective encourages the model to learn semantics, motion, and long-range structure without the shortcuts that pixel-level losses can invite. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    Kaset

    Kaset

    The missing YouTube Music macOS app

    Kaset is a social audio platform framework that allows users to host, share, and interact with audio content in community-oriented spaces, combining elements of podcasting, voice rooms, and feedback-driven discovery. It provides an interface where creators can upload episodes, host live or scheduled voice sessions, and cultivate listener communities through comments, reactions, and follow systems. The platform emphasizes audio discovery with playlists, curated channels, and trending audio...
    Downloads: 3 This Week
    Last Update:
    See Project
  • Simplify Time-consuming and Overly Complicated Financial Processes. Icon
    Simplify Time-consuming and Overly Complicated Financial Processes.

    Cloud Purchase Requisition, Purchase Order & Invoice Approval Software

    Zahara's cloud based platform automates budget management, suppliers, purchase requisitions, multi-level purchase approvals, deliveries and invoice reconciliation and approvals. Zahara integrates with most leading accounting software such as QuickBooks Online and Xero to give expanding SME's real time visibility and centralized control of their purchasing. Zahara can be used to control spend in an organization. We take the initial request to buy something and automate the approval process and sending of the PO to the Vendor. Deliveries can be receipted, vendors invoices matched and processed and then exported to finance. Zahara adds control yet speeds up processing.
    Try it for FREE
  • 10
    SteadyDancer

    SteadyDancer

    Harmonized and Coherent Human Image Animation

    ...It employs computer vision and motion modeling to estimate and reduce unwanted jitters, shakes, or camera wobbles — particularly in dance or movement sequences where traditional smoothing would distort intentional motion. By differentiating between intentional rhythmic motion and unintentional instability, SteadyDancer applies adaptive filtering that enhances video quality without flattening the core movement dynamics. The system can be used both in preprocessing pipelines for content creators and in live feedback loops for performers, giving dancers and videographers a tool to refine their visual outputs. It supports integration with standard video formats and includes customizable parameters so users can tune stabilization aggressiveness.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    StoryMem

    StoryMem

    Official code for StoryMem: Multi-shot Long Video Storytelling

    StoryMem is a narrative-focused memory accumulation system that lets users build, store, and reference past conversational context or story elements with an AI, effectively enabling the AI to maintain and recall personalized story memories or character arcs over time. Instead of treating each interaction as stateless, it tracks user-defined memory nodes, tags, and story threads so that future interactions can draw on established narrative context like character traits, past events, or...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    LiveAvatar

    LiveAvatar

    Streaming Real-time Audio-Driven Avatar Generation

    LiveAvatar is an open-source research and implementation project that provides a unified framework for real-time, streaming, interactive avatar video generation driven by audio and other control signals. It implements techniques from state-of-the-art diffusion-based avatar modeling to support infinite-length continuous video generation with low latency, enabling interactive AI avatars that maintain continuity and realism over extended sessions. The project co-designs algorithms and system optimizations, such as block-wise autoregressive processing and fast sampling strategies, to deliver real-time frame rates (e.g., ~45 FPS on appropriate GPU clusters) while handling non-stop generation without quality degradation. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    VERT.sh

    VERT.sh

    The next-generation file converter

    VERT is a modern, privacy-focused file conversion platform that leverages WebAssembly to perform conversions entirely on the user’s device rather than relying on cloud-based processing. Built with Svelte and TypeScript, it provides a clean and responsive interface for converting a wide variety of file types, including images, audio, video, and documents. One of its defining characteristics is its local-first approach, which eliminates the need to upload files to external servers, thereby improving both privacy and performance. The system supports over 250 file formats and includes customizable conversion settings, allowing users to fine-tune output parameters. For more demanding tasks such as video processing, it offers an optional self-hosted backend component, giving users flexibility between local and distributed workflows.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 14
    100 proyectos de JavaScript

    100 proyectos de JavaScript

    100 JavaScript projects with code and videos for free

    100 proyectos de JavaScript is a large collection of 100 JavaScript projects, each with source code and an accompanying video tutorial, offered entirely for free. The repository is tied to the website javascript100.dev, which presents the projects as a structured learning path so learners can progressively build their skills through practice.Each project lives in its own numbered folder (for example, 01-tinder-swipe), and the README explains how to run them via the VS Code Live Preview extension, making it easy for beginners to get them running without complex tooling. ...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 15
    Frigate

    Frigate

    NVR with realtime local object detection for IP cameras

    Frigate - NVR With Realtime Object Detection for IP Cameras A complete and local NVR designed for Home Assistant with AI object detection. Uses OpenCV and Tensorflow to perform realtime object detection locally for IP cameras. Use of a Google Coral Accelerator is optional, but highly recommended. The Coral will outperform even the best CPUs and can process 100+ FPS with very little overhead.
    Downloads: 35 This Week
    Last Update:
    See Project
  • 16
    SAM 3

    SAM 3

    Code for running inference and finetuning with SAM 3 model

    SAM 3 (Segment Anything Model 3) is a unified foundation model for promptable segmentation in both images and videos, capable of detecting, segmenting, and tracking objects. It accepts both text prompts (open-vocabulary concepts like “red car” or “goalkeeper in white”) and visual prompts (points, boxes, masks) and returns high-quality masks, boxes, and scores for the requested concepts. Compared with SAM 2, SAM 3 introduces the ability to exhaustively segment all instances of an...
    Downloads: 51 This Week
    Last Update:
    See Project
  • 17
    NExT-GPT

    NExT-GPT

    Code and models for ICML 2024 paper, NExT-GPT

    NExT-GPT is an open-source research framework that implements an advanced multimodal large language model capable of understanding and generating content across multiple modalities. Unlike traditional models that primarily handle text, NExT-GPT supports input and output combinations involving text, images, video, and audio in a unified architecture. The system connects a large language model with multimodal encoders and diffusion-based decoders so it can interpret information from different sensory formats and generate responses in different media types. This architecture allows the model to convert between modalities, such as generating images from text descriptions or producing audio or video outputs based on textual prompts. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    Hello Python

    Hello Python

    Comprehensive tutorial repository aimed at teaching the Python program

    Hello-Python is a comprehensive tutorial repository aimed at teaching the Python programming language from scratch for beginners. It includes over 100 classes and about 44 hours of video instruction, combined with code samples, projects, and a chat community for support. The material covers the fundamentals—variables, data types, loops, functions—as well as intermediate topics like date handling, list comprehensions, file IO, regular expressions, modules, and packages. The course is designed to be accessible: no prior programming experience required, and the resources are freely available. ...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 19
    Depth Pro

    Depth Pro

    Sharp Monocular Metric Depth in Less Than a Second

    ...The repo and research page emphasize boundary fidelity and crisp geometry, addressing a common weakness in monocular depth where edges can blur. Community integrations (e.g., inference wrappers and UI nodes) have sprung up around the model, reflecting practical interest in video, AR, and generative pipelines. As a general-purpose monocular depth backbone, Depth Pro slots into 3D reconstruction, relighting, and scene understanding workflows that benefit from metric predictions.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 20
    mediapy

    mediapy

    This Python library makes it easy to display images and videos

    Read/write/show images and videos in an IPython/Jupyter notebook.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 21
    The Arcade Library

    The Arcade Library

    Easy to use Python library for creating 2D arcade games

    Arcade is an easy-to-use Python library for creating 2D video games. It provides a modern and straightforward API, enabling developers to craft engaging games and graphical applications efficiently. Arcade supports rendering shapes, handling user input, and managing game physics, making it suitable for both beginners and experienced developers.
    Downloads: 11 This Week
    Last Update:
    See Project
  • 22
    PowerPoint-ist

    PowerPoint-ist

    Web presentation editor replicating many PowerPoint features online

    ...PPTist is built with Vue 3 and TypeScript and focuses on providing a highly interactive slide editing environment with extensive customization and extension potential. PPTist supports a wide variety of slide elements such as text, images, shapes, charts, tables, audio, video, and mathematical formulas, enabling users to design rich and complex presentations. It also includes tools for slide organization, element alignment, animation, and theme management to streamline presentation design workflows. In addition to editing capabilities, it provides export options for presentation files and other formats, although compatibility with traditional presentation formats.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 23
    comfyui-mixlab-nodes

    comfyui-mixlab-nodes

    Workflow and speech recognition app

    ...It introduces a “Workflow-to-APP” concept, where a ComfyUI graph can be transformed into a Web App through an AppInfo node, complete with categories, batch prompts, and editable configurations. The project also brings Real-time Design features like screen capture and floating video nodes, enabling creative pipelines that mix live screen content, generative models, and visual effects. For audio and speech, it provides nodes for SpeechRecognition and SpeechSynthesis, plus workflows that combine voice generation with real-time face swapping and other audio-visual effects. On the AI side, it integrates multiple LLM providers (cloud and local), supports OpenAI-compatible endpoints, Siliconflow models, and includes prompt-focused utilities for random prompt generation, Chinese prompts, clip interrogation.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 24
    yt-x

    yt-x

    Browse youtube from your terminal

    yt-x is a lightweight tool designed to enhance the YouTube viewing and interaction experience by providing additional functionality beyond the standard platform interface. It typically works by injecting scripts or modifying how YouTube pages behave, enabling users to customize playback, interface elements, or interaction features. The project focuses on improving usability and control, allowing users to tailor their viewing experience according to personal preferences. It may include...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 25
    myGPTReader

    myGPTReader

    AI Slack bot for reading, summarizing, and chatting with content

    myGPTReader is an AI-powered Slack bot designed to help users read, summarize, and interact with various types of digital content through conversational interfaces. It enables users to quickly understand web pages, documents, and even video content by transforming them into interactive discussions rather than static reading experiences. myGPTReader supports a wide range of file formats, including eBooks, PDFs, and text-based documents, making it flexible for both casual and professional use cases. It also integrates voice interaction capabilities, allowing users to communicate with the system verbally and even use it as a language practice assistant. ...
    Downloads: 2 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB