The Editor's Field Guide to Vertical Video

Vertical isn't horizontal cropped to 9:16. It has its own safe zones, text rules, and pacing — and getting them wrong quietly tanks your watch time. Here's the field guide for editing video that's built for the phone.

June 7, 20263 min readAli Bahrawy

Vertical video gets more reach than horizontal — but only when it's actually built for the format. The most common mistake is treating 9:16 as a crop of a 16:9 timeline: text drifts under the interface buttons, the subject sits in the wrong third, and watch time quietly drops. Vertical has its own rules. Here's the field guide.

Know your safe zones

A vertical frame is 1080×1920, but you don't get to use all of it — platform UI covers the edges. Keep critical content inside the safe area:

  • Top ~15% is for logos and watermarks, not anything important — it's often partially covered and easy to miss.
  • Bottom ~15% is where captions, the username, and the description sit. Keep key text above it.
  • The right ~150px is reserved by the like, comment, and share icons. Never put essential visuals or text there.

A practical rule: keep anything that must be read or seen comfortably inside the center, roughly 250px from the top and bottom of the frame. Videos with text outside the safe zones can lose a meaningful chunk of average watch time purely because people swipe away when they can't read the caption.

Make text readable on a phone, in sunlight

Mobile text rules are stricter than desktop:

  • Size: large — think 60–80pt equivalent. If it's borderline on your editing monitor, it's too small on a phone.
  • Contrast: white text with a 2–3px dark outline (or the reverse). Never light text on a light background.
  • Brevity: two to three lines max per block. Several short text appearances beat one dense paragraph.
  • Placement: use the rule of thirds — put key words near the intersection points, not dead center every time.

Frame the subject for a tall window

Composition changes when the frame is portrait. The subject's eyes and face want to sit in the upper-middle third, not the center, leaving room for captions below. If you're reframing horizontal footage, track the subject deliberately rather than locking a static center crop — a drifting subject in a vertical frame reads as amateur.

Pace it for the swipe

Vertical audiences decide in a heartbeat and the next video is one thumb-flick away. That means an even tighter open, a hook in the first seconds, and relentless momentum — though the sweet-spot length has stretched toward 60–90 seconds, giving you room for a real idea. Condensing a longer video to its strongest 60–90 seconds is the core vertical skill.

Caption everything, by default

Vertical is watched on mute more than any other format, and captions live in the busy bottom zone — so they have to be both present and well-placed. Generating captions and positioning them inside the safe area isn't polish; it's a ranking factor.

Working faster

Most of the vertical workflow — finding the clip, tightening it, captioning it — is the repetitive part. SmoothyEdit handles that from a transcript: it finds the moments worth cutting vertical, condenses them tight, and generates captions, so your attention goes to framing and safe-zone placement — the part that actually makes a vertical video work. For where the format is heading, see the 2026 editing trends.