feat(app): automated Maestro E2E functional tests (#3857) by Iansabia · Pull Request #4733 · BasedHardware/omi

Iansabia · 2026-02-11T02:14:13Z

Summary

Adds 10 Maestro E2E test flows covering all core app functionality: onboarding/sign-in, conversations (list, detail, CRUD), memories, chat, apps/plugins, settings, device connection, and recording
Flows are tagged core (runs on simulator) vs device_required (needs physical Omi hardware)
Includes runner scripts (run_all.sh, run_device.sh) with pass/fail reporting and optional HTML output
Integrates into existing test.sh via --e2e flag

How It Works

Install Maestro: brew install maestro
Build and install the app on a simulator or device
Run core tests: bash app/.maestro/scripts/run_all.sh
Run device tests (with Omi connected): bash app/.maestro/scripts/run_device.sh
Or use bash app/test.sh --e2e to run unit + widget + E2E tests together

After ~1 hour you get a full report covering sign-in, conversation recording/transcription, CRUD operations, chat, and app management.

Test Flows

Flow	What It Tests	Device Required
01_onboarding	Sign-in, name entry, language, permissions	No
02_conversations_list	List rendering, scrolling	No
03_conversation_detail	Opening conversation, transcript view	No
04_conversation_crud	Create, update, delete conversations	No
05_memories	Memory list, creation, interaction	No
06_chat	Chat input, AI responses	No
07_apps	App store, plugin install/manage	No
08_settings	Settings navigation, preferences	No
09_device_connection	BLE scan, pair, connect status	Yes
10_recording	Record, transcribe, verify conversation	Yes

Test plan

Install Maestro CLI
Build app with flutter build ios --flavor dev --simulator
Run bash app/.maestro/scripts/run_all.sh on simulator
Verify all 8 core flows pass
Run bash app/.maestro/scripts/run_device.sh with Omi connected
Verify recording + device flows pass

Closes #3857

Configures Maestro for automated functional testing with core flows and device-required flow separation via tags.

Tests app launch, sign-in, name entry, language selection, permissions, speech profile skip, and home screen landing.

Tests conversation list rendering, scrolling, and list item visibility.

Tests opening a conversation, viewing transcript, and detail screen elements.

Tests creating, updating, and deleting conversations through the UI.

Tests memory list display, creation, and interaction.

Tests chat input, message sending, and AI response display.

Tests app store browsing, plugin installation, and management.

Tests settings screen navigation, preference toggles, and profile access.

Tests BLE device scanning, pairing, and connection status. Requires physical Omi device (tagged device_required).

Tests recording start, transcription indicator, and conversation creation from captured audio. Requires physical Omi device.

Runs all core E2E flows sequentially with pass/fail summary and optional HTML report generation.

Runs Maestro flows that require a physical Omi device connected.

Adds --e2e flag to run Maestro functional tests alongside existing unit/widget tests.

gemini-code-assist

Code Review

This pull request introduces a comprehensive suite of Maestro E2E tests, which is a great addition for ensuring app quality. The tests cover core functionality and are well-structured with tags for simulator vs. device-specific flows. My review focuses on improving the maintainability and robustness of the new test scripts. I've identified a few areas with code duplication in both the YAML flow definitions and the shell runner scripts. Addressing these will make the test suite more resilient and easier to manage as the app evolves.

app/.maestro/flows/01_onboarding.yaml

app/.maestro/scripts/run_all.sh

app/.maestro/scripts/run_device.sh

…g flow Merges duplicate Maybe Later/later runFlow blocks into a single regex-based match per code review feedback.

…_all.sh Uses --exclude-tags=device_required to run all core flows dynamically, keeping config.yaml as the single source of truth for flow categorization.

…_device.sh Uses --include-tags=device_required to run device flows dynamically, keeping config.yaml as the single source of truth.

beastoin · 2026-02-17T07:36:41Z

Hey @Iansabia, closing this for now — thanks for putting it together.

The code and write-up look solid, but there's no real evidence that any of this was actually run and tested end-to-end. The test plan checkboxes are unchecked, and there are no screenshots, terminal output, videos, or logs showing it working on a real device or environment.

This matters more than ever now that AI makes writing code easy — the code itself isn't the hard part anymore. What's valuable is proving it actually works: real test output, real screenshots, real demo. That's what gives reviewers confidence to merge.

Feel free to reopen once you have real end-to-end evidence — run the tests, paste the output, show it working. We'd love to merge it then.

github-actions · 2026-02-17T07:36:51Z

Hey @Iansabia 👋

Thank you so much for taking the time to contribute to Omi! We truly appreciate you putting in the effort to submit this pull request.

After careful review, we've decided not to merge this particular PR. Please don't take this personally — we genuinely try to merge as many contributions as possible, but sometimes we have to make tough calls based on:

Project standards — Ensuring consistency across the codebase
User needs — Making sure changes align with what our users need
Code best practices — Maintaining code quality and maintainability
Project direction — Keeping aligned with our roadmap and vision

Your contribution is still valuable to us, and we'd love to see you contribute again in the future! If you'd like feedback on how to improve this PR or want to discuss alternative approaches, please don't hesitate to reach out.

Thank you for being part of the Omi community! 💜

Iansabia added 14 commits February 10, 2026 21:13

feat(app): add Maestro E2E test configuration

7322d1a

Configures Maestro for automated functional testing with core flows and device-required flow separation via tags.

feat(app): add onboarding E2E flow

e47e4c5

Tests app launch, sign-in, name entry, language selection, permissions, speech profile skip, and home screen landing.

feat(app): add conversations list E2E flow

61df76b

Tests conversation list rendering, scrolling, and list item visibility.

feat(app): add conversation detail E2E flow

2ac8cad

Tests opening a conversation, viewing transcript, and detail screen elements.

feat(app): add conversation CRUD E2E flow

fc1de12

Tests creating, updating, and deleting conversations through the UI.

feat(app): add memories E2E flow

3d9cb5a

Tests memory list display, creation, and interaction.

feat(app): add chat E2E flow

c3a707f

Tests chat input, message sending, and AI response display.

feat(app): add apps/plugins E2E flow

2b92e8b

Tests app store browsing, plugin installation, and management.

feat(app): add settings E2E flow

ebd83ca

Tests settings screen navigation, preference toggles, and profile access.

feat(app): add device connection E2E flow

fa76df6

Tests BLE device scanning, pairing, and connection status. Requires physical Omi device (tagged device_required).

feat(app): add recording E2E flow

c03f23e

Tests recording start, transcription indicator, and conversation creation from captured audio. Requires physical Omi device.

feat(app): add Maestro test runner script

216c21c

Runs all core E2E flows sequentially with pass/fail summary and optional HTML report generation.

feat(app): add device-dependent test runner script

6a10594

Runs Maestro flows that require a physical Omi device connected.

feat(app): integrate Maestro E2E tests into test.sh

1f3ef0d

Adds --e2e flag to run Maestro functional tests alongside existing unit/widget tests.

Iansabia mentioned this pull request Feb 11, 2026

omi mobile app functional tests ($300) #3857

Open

gemini-code-assist bot reviewed Feb 11, 2026

View reviewed changes

app/.maestro/flows/01_onboarding.yaml Show resolved Hide resolved

app/.maestro/scripts/run_all.sh Outdated Show resolved Hide resolved

app/.maestro/scripts/run_device.sh Outdated Show resolved Hide resolved

Iansabia added 12 commits February 10, 2026 21:19

fix(app): use regex for case-insensitive popup dismissal in onboardin…

7cd1eaa

…g flow Merges duplicate Maybe Later/later runFlow blocks into a single regex-based match per code review feedback.

fix(app): use regex for popup dismissal in conversations list flow

012de9e

fix(app): use regex for popup dismissal in conversation detail flow

6e2b37e

fix(app): use regex for popup dismissal in conversation CRUD flow

0b8a0fa

fix(app): use regex for popup dismissal in memories flow

5f2f0ec

fix(app): use regex for popup dismissal in chat flow

a8a3a7b

fix(app): use regex for popup dismissal in apps flow

67a95fe

fix(app): use regex for popup dismissal in settings flow

004f8ba

fix(app): use regex for popup dismissal in device connection flow

616c640

fix(app): use regex for popup dismissal in recording flow

451dd89

refactor(app): use Maestro tags instead of hardcoded flow list in run…

b565600

…_all.sh Uses --exclude-tags=device_required to run all core flows dynamically, keeping config.yaml as the single source of truth for flow categorization.

refactor(app): use Maestro tags instead of hardcoded flow list in run…

9bf38b2

…_device.sh Uses --include-tags=device_required to run device flows dynamically, keeping config.yaml as the single source of truth.

beastoin closed this Feb 17, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(app): automated Maestro E2E functional tests (#3857)#4733

feat(app): automated Maestro E2E functional tests (#3857)#4733
Iansabia wants to merge 26 commits intoBasedHardware:mainfrom
Iansabia:feat/maestro-functional-tests-v2

Iansabia commented Feb 11, 2026

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

beastoin commented Feb 17, 2026

Uh oh!

github-actions bot commented Feb 17, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Comments

Conversation

Iansabia commented Feb 11, 2026

Summary

How It Works

Test Flows

Test plan

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Uh oh!

Uh oh!

beastoin commented Feb 17, 2026

Uh oh!

github-actions bot commented Feb 17, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Comments