VisionKey is a computer vision-based virtual keyboard built with Python and OpenCV that lets you type using just your hand gestures — no physical keyboard required!
It uses real-time hand tracking to detect finger gestures and simulate key presses by interacting with a fully functional QWERTY keyboard rendered directly on the webcam feed.
- 🖐️ Gesture-based keypress using index–thumb pinch
- 🔍 Hand tracking via MediaPipe
- 🔡 Full QWERTY layout with numbers, punctuations, and symbols
- 🟩 Transparent key overlays with real-time visual feedback
- 🎯 Finger tip pointers for better accuracy
- 🔄 Works across Windows, macOS, and Linux
- 🐍 Python 3.7+
- 📸 OpenCV
- ✋ MediaPipe
- 🧰 cvzone
git clone https://github.com/ArchitJ6/VisionKey.git
cd VisionKey
pip install -r requirements.txt
python main.py
- Webcam
- Python 3.7+
- A steady hand ✋🙂
VisionKey/
├── LICENSE
├── main.py # Main script to launch the keyboard
├── README.md # Project documentation
└── requirements.txt # Python dependencies
- Launch the script and allow camera access.
- The virtual keyboard will appear centered in the camera feed.
- Move your index finger to hover over a key.
- Pinch your index finger and thumb together to "press" the key.
- Typed text is shown in a preview window below the keyboard.
This project is licensed under the MIT License. See the LICENSE file for details.
- Add Shift / CapsLock support
- Enable multi-language layout
- Integrate with text-to-speech
- Support swiping gestures for faster input