Overview
Mobile-MCP gives agents a unified API to launch devices, obtain accessibility trees, tap coordinates, run scripted flows and collect screenshots—without deep iOS/Android expertise.
Key Capabilities
- Cross-platform device control (simulators, emulators, physical).
- Structured accessibility snapshots for robust element targeting.
- Coordinate-based gestures when semantics are unavailable.
- Parallel session orchestration for large test suites.
- Road-mapped vision & agent-to-agent features for future workflows.