Short-form video is all about momentum! Ideas hit, clips get cut, captions pop on, and you need to publish … fast. videokit is a new collection of Glifs built to capture that creative energy! With just a few building blocks, you can spin up fully captioned TikToks, remix vertical split-screen reactions, or stitch images and audio into hyper-catchy slideshows, all without leaving Glif.


Why we're hyped about videokit

  • End-to-end in one place. Every Glif can be used as a skill inside of Glifchat so you can brainstorm a concept and render the final .webm in the same chat session.
  • Speed over setup. The Glifs handle clipping, looping, and timing for you—no timeline-dragging or spreadsheet-style configuration.
  • Built for vertical. Output is 9 × 16, encoded as webm. Feed it webm when you can to avoid extra transcoding delays.

What is videokit?

videokit is a collection of Glifs to enable short-form video creation. Below is the 30-second tour. A deeper walkthrough with real workflow examples will follow!

LayerWhat it does bestGo-to glifs
High-level “one-clicks”Full TikTok builds in a single promptCreate captioned TikTok video (single) — overlay captions on one clip.
Create TikTok video (split-screen) — reaction/review layout with a second clip stacked below.
Create TikTok video (fast) — turn a string of images + VO into a slideshow-style video, optional background music.
Medium-level mixersRapid prototypes and hybridsCreate video from asset links — mix any combo of images, video, and audio; decide whether to loop or clip each track.
Low-level building blocksFine-grained tweaks & utilitiesAnalyze media asset — get duration, codec, bitrate, etc.
Create speech audio — instant VO with Adam (American male).
Put text on image — meme generator in vertical format.
Combine videos — stitch side-by-side or top-and-bottom.
Mix audio to video — background music, podcast highlights, you name it.
Display video from URL — handy for Glifchat artifact previews.
Pro tip: Chain a low-level Glif (e.g., Create speech audio) into a high-level one (e.g., Create TikTok video (split-screen)) to automate faceless content!

Tips for smooth renders

  1. Stay in .webm when possible. Converting other codecs adds a few extra processing hops and can sometimes throw an error.
  2. Keep resolution reasonable. 480p or below is perfect for most mobile feeds and prevents oversized files from making your Glif workflows slow.

Example 1: Reddit-Storytime Shorts (15 s, faceless)

Below is the first Glif workflow that we wanted to show-off! It chains five Glifs (plus two vanilla blocks) to crank out a TikTok-style video in one go!

StepBlockWhat it does
1TextInput subredditAsk for the subreddit URL (defaults to r/AmItheAsshole).
2WebFetcher reddit_scannerPull the week’s top post via the RSS feed.
3GPT script_makerCompress that post into a 15-second, hook-heavy script.
4Create Speech Audio generated_speechTurn the script into a crisp voice-over (Adam, American male).
5Multipick backgroundVideoRandom-choose one of four 9:16 Minecraft-parkour clips.
6Create Captioned TikTok Video video_makerOverlay auto-generated captions center-screen, loop/clip to sync with VO, output .webm.
0:00
/0:18

What makes it shine

  • Totally hands-free: Supply a subreddit (or keep the default) and hit run, it's all automated!
  • Chronically-online tone: The GPT prompt bakes in “TikTok voice,” so it feels authentic, and you can customize it!
  • Infinite variety: Multipick’s randomizer swaps the parkour clip each run, keeping the feed fresh.
  • Feed-ready output: Final video is vertical .webm, captioned, and trimmed to ~15 s—perfect for TikTok, Reels, Shorts.

Remix ideas

  • Swap the background: Point Multipick at a set of different b-roll footage to turn Reddit drama into subtle brand placement.
  • Add BG music: Pipe the same VO into Mix Audio to Video with a royalty-free track for extra vibe.
  • Try other RSS Feeds/Data Sources with Web Fetcher: Reddit is just the beginning!

Example 2: Script-Driven Shorts with Custom Music

Need a quick reel or a branded sound-bite? This workflow lets you paste a script, pick (or upload) a song, and walk away with a captioned vertical video all in six blocks.

StepBlockPurpose
1TextInput scriptPaste the exact words you want on screen & VO.
2AudioInput audioBring your own background track (MP3, WAV, etc.).
3Create Speech Audio generated_speechTTS voice-over (Adam, American male).
4Multipick backgroundVideoRandom Minecraft-parkour clip for visual energy.
5Create Captioned TikTok Video video_makerMerges VO + captions over the chosen clip.
6Mix Audio to Video audio_overlayBlends your music under the VO (both set to “true” mix factors).
0:00
/0:06

Why it rocks

  • Plug-and-play voice: Drop any announcement or CTA into the script field, no LLM needed!
  • Instant soundtrack: audio_overlay trims & mixes your track to match video length automatically.
  • Hands-off timing: Background clip loops/shortens to sync with VO.

Remix ideas

  • Voice variation: Swap the TTS glif for a different voice model or pass your own narration via AudioInput.
  • Change the B-roll: Replace the Multipick URLs with product footage or office shots for an internal update video.
  • Caption placement: Change the third input in video_maker to "top" or "bottom" for different aesthetic.

Example 3: Top-5 Horror Slideshow Bot (@horror-tok)

Unlike our first two “fire-and-forget” recipes, this bot leans into Glifchat’s super-power: conversational iteration. Instead of spitting out a fully-rendered video in one pass, it delivers five captioned images you can tweak on the fly, then, when you’re happy, you can take the images right to TikTok.

StepBlockWhat happens
1TextInput topicAsk the user: “Pick a sub-genre (haunted hospitals, cursed dolls, etc.).”
2Wikipedia Research (custom skill) wiki_fetchPull facts & citations for the five creepiest entries in that niche.
3GPT script_writerTurn each fact into a punchy one-line caption + VO text.
4Flux-Dev Image Gen image_factory
(LoRA: Saqib’s 2000 Camera)
Generate one 9 × 16 cinematic still per item.
5Put Text on Image captionerBurn each caption onto its matching image.
6Glifchat LoopBot sends the five captioned images back individually so you can:
• ask for a style tweak (“make #3 darker”)
• swap an image (“regen #2 with fog”)
• rewrite a caption—all without rerunning the entire flow.
7(Optional) Create TikTok Video (fast) slideshowOnce you type “looks good,” the bot chains this glif to auto-sync the approved images with the VO and ship a vertical .webm.

0:00
/0:08

Why this flow shows Glifchat’s real magic

  • Draft, discuss, refine: No “redo the whole render” pain. You can chat directly with the bot to update the slideshow!
  • Human-in-the-loop quality: You stay creative director; the bot is your tiktok artist!

Remix pointers

  • Faceless slideshow videos can rack up thousands of views, and you can target any niche you want. So get creative and have fun!
  • Try using other data feeds instead of Wikipedia.
  • Try using other LoRAs for varying artistic style.

With this third workflow we’ve covered:

  1. One-shot video (Reddit Storytime)
  2. Script-plus-soundtrack video (Announcement Video)
  3. Draft-iterate-compile (Horror Slideshows for TikTok)

These Glifs are just a few examples of what's possible. We can't wait to see what you'll create!

Love,

Glif Team

Share this post