ZASSHA is an AI-powered tool that automatically converts screen recordings into detailed, structured documentation with timestamps, screenshots, and AI-generated explanations that can be exported to various formats.
https://co-r-e.github.io/zassha_lp?ref=producthunt
ZASSHA

Product Information

Updated:Sep 22, 2025

What is ZASSHA

ZASSHA is an innovative manual creation assistant that transforms screen recordings into comprehensive documentation. It's designed as an open-source solution that runs locally on your machine, requiring only a Gemini API key for operation. The tool supports both English and Japanese languages and focuses on maintaining user privacy by processing videos locally without external storage or uploads.

Key Features of ZASSHA

ZASSHA is an AI-powered tool that automatically converts screen recording videos into structured documentation by analyzing workflows, detecting operations, and generating step-by-step guides with timestamps, screenshots, and AI-generated explanations. The tool processes videos locally and exports polished documentation in multiple formats including Word, PowerPoint, and Excel.
Automatic Step Extraction: Analyzes workflow videos to detect operations and create a timeline with timestamps, tools used, and AI-generated explanations
Multi-format Export: Generates ready-to-share documentation in Word, PowerPoint, or Excel formats with screenshots, captions, and speaker notes
Local Processing: Processes videos locally on user's machine with only Gemini API requests leaving the device, ensuring privacy and security
Thumbnail Generation: Automatically creates visual thumbnails for each detected operation to enhance documentation clarity

Use Cases of ZASSHA

Technical Documentation: Help technical writers create knowledge base articles and documentation faster by automatically generating content from video recordings
Employee Training: Convert training recordings into step-by-step playbooks for new hire onboarding and skill development
IT Support: Document troubleshooting processes and create standardized support guides from recorded sessions
Quality Assurance: Generate detailed bug reports with visual evidence and reproducible steps from test session recordings

Pros

Privacy-focused with local processing
Multiple export formats supported
Automated workflow detection saves time
Supports both English and Japanese languages

Cons

Requires high-quality (1080p+) video recordings
Limited to videos under 5 minutes and 100MB for optimal results
Depends on Gemini API for analysis

How to Use ZASSHA

Install ZASSHA: Clone the ZASSHA repository from GitHub (https://github.com/co-r-e/Zassha) and set up your Gemini API key
Record your screen: Create a screen recording at 1080p resolution or higher using macOS or Windows screen capture tools. Keep the video under 5 minutes and 100MB for best results
Upload video: Drop your recorded video file into ZASSHA's interface
Select analysis mode: Choose between Summary or Detail mode depending on how detailed you want the documentation to be
Add goal statement: Provide a brief description of what the process aims to achieve to help the AI focus on relevant details
Review generated content: Check the overview, business inference, and operation table with hoverable thumbnails that ZASSHA automatically generated
Export documentation: Export the finished manual to your preferred format (DOCX, PPTX, or XLSX). The exported file will maintain consistent naming that matches your source video

ZASSHA FAQs

Yes. ZASSHA analyzes videos locally on your machine. Only requests you initiate to the Gemini API leave your device, and you control exactly what's sent.

Latest AI Tools Similar to ZASSHA

Noiz
Noiz
Noiz is an AI-powered video summarizer tool that transforms YouTube videos into concise summaries, transcripts, and key insights in 41 languages with just one click.
RollSummary
RollSummary
RollSummary is an AI-powered RPG session transcriber and summarizer that helps players and GMs capture, track and prepare their tabletop gaming sessions through automated transcription and smart summaries.
Long Summary
Long Summary
Long Summary is an AI-powered text summarization tool that enables users to generate summaries of any length without input or output limitations, offering custom-length summaries for various content types while maintaining information accuracy.
SceneSnap
SceneSnap
SceneSnap is an AI-powered content synthesis platform that transforms lengthy videos into personalized, interactive summaries tailored to users' specific interests and time constraints.