# ProPresenter 7 `.pro` File Format Specification **Version:** 1.0 **Target Audience:** AI agents, automated parsers, developers **Proto Source:** greyshirtguy/ProPresenter7-Proto v7.16.2 (MIT License) --- ## 1. Overview ### File Format - **Extension:** `.pro` - **Binary Format:** Protocol Buffers (Google protobuf v3) - **Top-level Message:** `rv.data.Presentation` (defined in `presentation.proto`) - **Proto Definitions:** greyshirtguy/ProPresenter7-Proto v7.16.2 (MIT) ### Known Limitations - **Binary Fidelity:** Round-trip decode→encode fails on all reference files. Proto definitions are incomplete; unknown fields are lost during serialization. - **Workaround:** Preserve original binary data if exact binary reproduction is required. ### File Validity - **Empty files (0 bytes):** Invalid. Throw exception. - **Songs without arrangements:** Valid. 17 out of 169 reference files have no arrangements. - **Non-song presentations:** Files like ANKUENDIGUNGEN, MODERATION, THEMA have groups/slides but may lack text elements. --- ## 2. Song Structure ### Hierarchy Diagram ``` Presentation (rv.data.Presentation) ├── name (string, field 1) ├── uuid (rv.data.UUID, field 5) ├── cue_groups[] (rv.data.Presentation.CueGroup, field 12) ← Groups │ ├── group (rv.data.Group, field 1) │ │ ├── name (string, field 2) │ │ ├── uuid (rv.data.UUID, field 1) │ │ └── color (rv.data.Color, field 3) [optional] │ └── cue_identifiers[] (rv.data.UUID, field 2) ← Slide UUID references ├── cues[] (rv.data.Cue, field 13) ← Slides │ ├── uuid (rv.data.UUID, field 1) │ └── actions[0] (rv.data.Action, field 10) │ └── slide (rv.data.Action.SlideType, field 23) │ └── presentation (rv.data.PresentationSlide, field 2) │ └── base_slide (rv.data.Slide, field 1) │ └── elements[] (rv.data.Slide.Element, field 1) │ └── element (rv.data.Graphics.Element, field 1) │ ├── name (string, field 2) ← Label like "Orginal", "Deutsch" │ └── text (rv.data.Graphics.Text, field 13) │ └── rtf_data (bytes, field 3) ← RTF-encoded text └── arrangements[] (rv.data.Presentation.Arrangement, field 11) ├── name (string, field 2) ├── uuid (rv.data.UUID, field 1) └── group_identifiers[] (rv.data.UUID, field 3) ← Group UUID references ``` ### Navigation Paths **To access slide text:** ``` Presentation → cues[i] → actions[0] → slide → presentation → base_slide → elements[j] → element → text.rtf_data ``` **To access group metadata:** ``` Presentation → cue_groups[i] → group → name, uuid, color ``` **To access arrangement order:** ``` Presentation → arrangements[i] → group_identifiers[] ``` --- ## 3. Fields Reference ### Presentation (rv.data.Presentation) | Field Path | Protobuf Type | Field Number | Description | |------------|---------------|--------------|-------------| | `name` | `string` | 1 | Song title (e.g., "Amazing Grace") | | `uuid` | `rv.data.UUID` | 5 | Unique identifier for the presentation | | `cues[]` | `rv.data.Cue` | 13 | Array of slides | | `cue_groups[]` | `rv.data.Presentation.CueGroup` | 12 | Array of groups (song parts) | | `arrangements[]` | `rv.data.Presentation.Arrangement` | 11 | Array of arrangements | ### Presentation.CueGroup | Field Path | Protobuf Type | Field Number | Description | |------------|---------------|--------------|-------------| | `group` | `rv.data.Group` | 1 | Group metadata (name, uuid, color) | | `cue_identifiers[]` | `rv.data.UUID` | 2 | Array of slide UUIDs in this group | ### Group (rv.data.Group) | Field Path | Protobuf Type | Field Number | Description | |------------|---------------|--------------|-------------| | `uuid` | `rv.data.UUID` | 1 | Unique identifier for the group | | `name` | `string` | 2 | Display name (e.g., "Verse 1", "Chorus") | | `color` | `rv.data.Color` | 3 | Optional RGBA color (float values 0.0-1.0) | ### Presentation.Arrangement | Field Path | Protobuf Type | Field Number | Description | |------------|---------------|--------------|-------------| | `uuid` | `rv.data.UUID` | 1 | Unique identifier for the arrangement | | `name` | `string` | 2 | Arrangement name (e.g., "normal", "test2") | | `group_identifiers[]` | `rv.data.UUID` | 3 | Ordered array of group UUIDs | ### Cue (rv.data.Cue) | Field Path | Protobuf Type | Field Number | Description | |------------|---------------|--------------|-------------| | `uuid` | `rv.data.UUID` | 1 | Unique identifier for the slide | | `actions[]` | `rv.data.Action` | 10 | Array of actions (slides use `actions[0]`) | ### Action (rv.data.Action) | Field Path | Protobuf Type | Field Number | Description | |------------|---------------|--------------|-------------| | `slide` | `rv.data.Action.SlideType` | 23 | Slide data (oneof field) | ### Action.SlideType | Field Path | Protobuf Type | Field Number | Description | |------------|---------------|--------------|-------------| | `presentation` | `rv.data.PresentationSlide` | 2 | Presentation slide (oneof field) | ### PresentationSlide (rv.data.PresentationSlide) | Field Path | Protobuf Type | Field Number | Description | |------------|---------------|--------------|-------------| | `base_slide` | `rv.data.Slide` | 1 | Base slide containing elements | ### Slide (rv.data.Slide) | Field Path | Protobuf Type | Field Number | Description | |------------|---------------|--------------|-------------| | `elements[]` | `rv.data.Slide.Element` | 1 | Array of slide elements | ### Slide.Element | Field Path | Protobuf Type | Field Number | Description | |------------|---------------|--------------|-------------| | `element` | `rv.data.Graphics.Element` | 1 | Graphics element wrapper | ### Graphics.Element | Field Path | Protobuf Type | Field Number | Description | |------------|---------------|--------------|-------------| | `uuid` | `rv.data.UUID` | 1 | Unique identifier for the element | | `name` | `string` | 2 | User-defined label (e.g., "Orginal", "Deutsch") | | `text` | `rv.data.Graphics.Text` | 13 | Text data (optional) | ### Graphics.Text | Field Path | Protobuf Type | Field Number | Description | |------------|---------------|--------------|-------------| | `rtf_data` | `bytes` | 3 | RTF-encoded text content | --- ## 4. Groups ### Definition Groups represent song parts (Verse 1, Verse 2, Chorus, Bridge, Ending, etc.). They define logical sections of a song. ### Characteristics - **Names:** User-defined strings. Not standardized. Examples: "Verse 1", "Strophe 1", "Refrain", "Ending". - **Slide References:** Each group contains an ordered array of slide UUIDs (`cue_identifiers`). - **Color:** Optional RGBA color (float values 0.0-1.0 for red, green, blue, alpha). - **Special Groups:** COPYRIGHT, BLANK — treated as regular groups (no special handling required). ### Example (Test.pro) - **Verse 1** → 1 slide - **Verse 2** → 1 slide - **Chorus** → 2 slides - **Ending** → 1 slide ### Access Pattern ```php foreach ($presentation->getCueGroups() as $cueGroup) { $group = $cueGroup->getGroup(); $name = $group->getName(); $uuid = $group->getUuid()->getString(); $slideUuids = []; foreach ($cueGroup->getCueIdentifiers() as $uuid) { $slideUuids[] = $uuid->getString(); } } ``` --- ## 5. Slides ### Definition Slides are individual presentation frames. Each slide can contain multiple elements (text, shapes, media). ### Navigation Path ``` Cue → actions[0] → slide → presentation → base_slide → elements[] ``` ### Text Elements - **Location:** `base_slide.elements[]` contains `Slide.Element` wrappers. - **Graphics Element:** Each `Slide.Element` wraps a `Graphics.Element`. - **Text Data:** `Graphics.Element.text.rtf_data` contains RTF-encoded text. - **Element Name:** `Graphics.Element.name` is a user-defined label (e.g., "Orginal", "Deutsch"). ### Slides Without Text Some slides contain only media (images, videos) or shapes. These slides have `elements[]` with no `text` field set. ### UUID References Groups reference slides by UUID. Use `Cue.uuid` to match slides to group references. ### Example (Test.pro) - **5 slides total** - **Chorus group** → 2 slides (UUIDs referenced in `cue_identifiers`) --- ## 6. Arrangements ### Definition Arrangements define the order and selection of groups for a presentation. They specify which groups appear and in what sequence. ### Characteristics - **Group References:** Ordered array of group UUIDs (`group_identifiers`). - **Repetition:** The same group UUID can appear multiple times (e.g., Chorus repeated 3 times). - **Optional:** Songs may have 0 or more arrangements. - **No Arrangements:** 17 out of 169 reference files have no arrangements. This is valid. ### Example (Test.pro) - **Arrangement "normal":** Verse 1 → Chorus → Verse 2 → Chorus → Ending - **Arrangement "test2":** Verse 1 → Verse 2 → Chorus ### Access Pattern ```php foreach ($presentation->getArrangements() as $arrangement) { $name = $arrangement->getName(); $groupUuids = []; foreach ($arrangement->getGroupIdentifiers() as $uuid) { $groupUuids[] = $uuid->getString(); } } ``` --- ## 7. Translations ### Definition Multiple `elements[]` per slide represent multiple text layers. The first element is the original text; subsequent elements are translations. ### Characteristics - **Element Count:** 1 element = no translation. 2+ elements = translation present. - **Element Names:** User-defined labels (e.g., "Orginal", "Deutsch", "Text", "Text 2"). - **Label Patterns:** 3 known patterns observed: 1. "Orginal" / "Deutsch" 2. "Text" / "Text 2" 3. No specific naming (generic labels) - **Not Standardized:** Element names are arbitrary strings. Do NOT assume fixed labels. ### Detection ```php $textElements = []; foreach ($baseSlide->getElements() as $slideElement) { $graphicsElement = $slideElement->getElement(); if ($graphicsElement !== null && $graphicsElement->hasText()) { $textElements[] = $graphicsElement; } } $hasTranslation = count($textElements) >= 2; $originalText = $textElements[0]->getText()->getRtfData(); $translationText = $textElements[1]->getText()->getRtfData() ?? null; ``` ### Example (Test.pro) - **Slide 1:** 2 text elements → "Orginal" (German), "Deutsch" (English translation) - **Element Names:** User-defined, not standardized --- ## 8. Edge Cases ### Empty Files - **Size:** 0 bytes - **Validity:** Invalid - **Action:** Throw exception ### Songs Without Arrangements - **Frequency:** 17 out of 169 reference files - **Validity:** Valid - **Behavior:** `arrangements[]` is empty. Groups and slides still exist. ### Non-Song Presentations - **Examples:** ANKUENDIGUNGEN, MODERATION, THEMA - **Characteristics:** Have groups and slides but may lack text elements. - **Validity:** Valid ### Slides Without Text - **Characteristics:** `elements[]` contains shapes, media, or other non-text elements. - **Detection:** `Graphics.Element.hasText()` returns false. - **Validity:** Valid ### COPYRIGHT and BLANK Groups - **Treatment:** Regular groups (no special handling required). - **Validity:** Valid --- ## 9. RTF Text Format ### Format Variant - **Type:** Apple CocoaRTF 2761 - **Encoding:** Windows-1252 (ANSI codepage 1252) ### Structure ``` {\rtf1\ansi\ansicpg1252\cocoartf2761 {\fonttbl\f0\fswiss\fcharset0 Helvetica;} {\colortbl;\red255\green255\blue255;} {\*\expandedcolortbl;;} \pard\tx560\tx1120\tx1680\tx2240\tx2800\tx3360\tx3920\tx4480\tx5040\tx5600\tx6160\tx6720\pardirnatural\partightenfactor0 \f0\fs96 \cf1 \CocoaLigature0 TEXT STARTS HERE} ``` ### Text Extraction - **Text Start:** After `\CocoaLigature0 ` (space after 0 is the delimiter). - **Soft Returns:** `\` + newline character = line break within slide. - **Paragraph Breaks:** `\par` = paragraph break. ### Character Encoding #### Windows-1252 Hex Escapes - **Format:** `\'xx` where `xx` is a hex byte value. - **Examples:** - `\'fc` → ü (U+00FC) - `\'f6` → ö (U+00F6) - `\'e4` → ä (U+00E4) - `\'df` → ß (U+00DF) #### Unicode Escapes - **Format:** `\uN?` where `N` is a decimal codepoint, `?` is an ANSI fallback character. - **Examples:** - `\u8364?` → € (U+20AC) - `\u8220?` → " (U+201C) - `\u8221?` → " (U+201D) - **Negative Values:** RTF uses signed 16-bit integers. Negative values are converted: `codepoint + 65536`. ### Control Words - **Format:** `\word[N]` followed by space or non-alpha character. - **Common Words:** - `\par` → paragraph break - `\CocoaLigature0` → text start marker - `\f0`, `\fs96`, `\cf1` → formatting (font, size, color) - **Delimiter:** Space after control word is consumed (not part of text). ### Escaped Characters - `\{` → `{` - `\}` → `}` - `\\` → `\` (or soft return in ProPresenter context) ### Example RTF ```rtf {\rtf1\ansi\ansicpg1252\cocoartf2761 {\fonttbl\f0\fswiss\fcharset0 Helvetica;} {\colortbl;\red255\green255\blue255;} {\*\expandedcolortbl;;} \pard\tx560\pardirnatural\partightenfactor0 \f0\fs96 \cf1 \CocoaLigature0 Gro\'dfe Gnade\ Amazing Grace} ``` **Plain Text Output:** ``` Große Gnade Amazing Grace ``` --- ## 10. PHP Parser Usage ### Installation ```bash composer require propresenter/parser ``` ### Read a Song ```php use ProPresenter\Parser\ProFileReader; $song = ProFileReader::read('path/to/song.pro'); ``` ### Access Song Metadata ```php // Song name $name = $song->getName(); // "Amazing Grace" // Song UUID $uuid = $song->getUuid(); // "A1B2C3D4-..." // Groups $groups = $song->getGroups(); // Group[] // Slides $slides = $song->getSlides(); // Slide[] // Arrangements $arrangements = $song->getArrangements(); // Arrangement[] ``` ### Access Groups ```php foreach ($song->getGroups() as $group) { $name = $group->getName(); // "Verse 1" $uuid = $group->getUuid(); // "E5F6G7H8-..." $color = $group->getColor(); // ['r' => 1.0, 'g' => 0.0, 'b' => 0.0, 'a' => 1.0] or null $slideUuids = $group->getSlideUuids(); // ["uuid1", "uuid2", ...] } ``` ### Access Slides ```php foreach ($song->getSlides() as $slide) { $uuid = $slide->getUuid(); $plainText = $slide->getPlainText(); // Extracted from first text element // Check for translation if ($slide->hasTranslation()) { $translation = $slide->getTranslation(); $translatedText = $translation->getPlainText(); } // Access all text elements foreach ($slide->getTextElements() as $textElement) { $name = $textElement->getName(); // "Orginal", "Deutsch", etc. $rtf = $textElement->getRtfData(); // Raw RTF bytes $plain = $textElement->getPlainText(); // Extracted plain text } } ``` ### Access Arrangements ```php foreach ($song->getArrangements() as $arrangement) { $name = $arrangement->getName(); // "normal" $groupUuids = $arrangement->getGroupUuids(); // ["uuid1", "uuid2", "uuid1", ...] // Resolve groups $groups = $song->getGroupsForArrangement($arrangement); foreach ($groups as $group) { echo $group->getName() . "\n"; } } ``` ### Access Slides for a Group ```php $group = $song->getGroupByName("Chorus"); $slides = $song->getSlidesForGroup($group); foreach ($slides as $slide) { echo $slide->getPlainText() . "\n"; } ``` ### Modify and Write ```php use ProPresenter\Parser\ProFileWriter; // Modify song $song->setName("New Song Title"); // Modify group $group = $song->getGroupByName("Verse 1"); $group->setName("Strophe 1"); // Modify arrangement $arrangement = $song->getArrangementByName("normal"); $arrangement->setName("default"); // Write to file ProFileWriter::write($song, 'output.pro'); ``` ### Error Handling ```php try { $song = ProFileReader::read('song.pro'); } catch (\RuntimeException $e) { // File not found, empty file, or invalid protobuf echo "Error: " . $e->getMessage(); } ``` ### Example: Extract All Text ```php $song = ProFileReader::read('song.pro'); foreach ($song->getGroups() as $group) { echo "Group: " . $group->getName() . "\n"; $slides = $song->getSlidesForGroup($group); foreach ($slides as $slide) { echo " Original: " . $slide->getPlainText() . "\n"; if ($slide->hasTranslation()) { echo " Translation: " . $slide->getTranslation()->getPlainText() . "\n"; } } } ``` ### Example: Create Arrangement ```php $song = ProFileReader::read('song.pro'); // Get group UUIDs $verse1 = $song->getGroupByName("Verse 1"); $chorus = $song->getGroupByName("Chorus"); $verse2 = $song->getGroupByName("Verse 2"); // Create new arrangement $arrangement = new Arrangement(new \Rv\Data\Presentation\Arrangement()); $arrangement->setName("custom"); $arrangement->setGroupUuids([ $verse1->getUuid(), $chorus->getUuid(), $verse2->getUuid(), $chorus->getUuid(), ]); // Add to song (requires direct protobuf access) $song->getPresentation()->getArrangements()[] = $arrangement->getProto(); ProFileWriter::write($song, 'output.pro'); ``` --- ## Appendix: Test.pro Structure ### Groups (4) 1. **Verse 1** → 1 slide 2. **Verse 2** → 1 slide 3. **Chorus** → 2 slides 4. **Ending** → 1 slide ### Slides (5) - Slide 1: Verse 1 text (2 text elements: "Orginal", "Deutsch") - Slide 2: Verse 2 text (2 text elements) - Slide 3: Chorus text part 1 (2 text elements) - Slide 4: Chorus text part 2 (2 text elements) - Slide 5: Ending text (2 text elements) ### Arrangements (2) 1. **normal:** Verse 1 → Chorus → Verse 2 → Chorus → Ending 2. **test2:** Verse 1 → Verse 2 → Chorus --- ## Appendix: Reference Statistics - **Total Files:** 169 - **Parseable Files:** 168 - **Empty Files:** 1 (invalid) - **Files Without Arrangements:** 17 (valid) - **Binary Fidelity:** 0 files pass round-trip decode→encode (proto definitions incomplete) --- ## Appendix: Proto Field Numbers Quick Reference | Message | Field | Number | |---------|-------|--------| | Presentation | name | 1 | | Presentation | uuid | 5 | | Presentation | cues | 13 | | Presentation | cue_groups | 12 | | Presentation | arrangements | 11 | | CueGroup | group | 1 | | CueGroup | cue_identifiers | 2 | | Group | uuid | 1 | | Group | name | 2 | | Group | color | 3 | | Arrangement | uuid | 1 | | Arrangement | name | 2 | | Arrangement | group_identifiers | 3 | | Cue | uuid | 1 | | Cue | actions | 10 | | Action | slide | 23 | | Action.SlideType | presentation | 2 | | PresentationSlide | base_slide | 1 | | Slide | elements | 1 | | Slide.Element | element | 1 | | Graphics.Element | uuid | 1 | | Graphics.Element | name | 2 | | Graphics.Element | text | 13 | | Graphics.Text | rtf_data | 3 | --- **End of Specification**