
Create Your First Project
Start adding your projects to your portfolio. Click on "Manage Projects" to get started
ROBOT
Project type
Robot
History
2025
Location
Izmir
Robot Project
"A robot that not only hears, but also understands."
This project is an advanced personal assistant robot system supported by artificial intelligence, controlled by voice, that can perceive its environment and react like a human. It not only talks, but also sees, recognizes, learns and responds.
The entire system is designed to work completely offline and has an architecture that minimizes external dependency.
๐ Basic Features:
๐ค Voice Interaction:
Wakes up when you say "Robot".
Naturally perceives voice commands (not predefined).
Speaks with an ultra-natural Turkish voice with ElevenLabs.
Whisper-based voice detection and command understanding system is integrated.
๐ง Artificial Intelligence Supported:
Uses Google Gemini (including offline version).
Answers questions, offers suggestions, analyzes conversations.
Gives personalized answers (for example, different answers to people it knows with facial recognition).
๐ฅ Visual Detection:
Object recognition with YOLOv8
Hand gesture detection with MediaPipe
Face recognition and person-specific response with DeepFace
Real-time analysis and response via camera
๐ต Multimedia Control:
When you say "Play...", it opens music from Spotify (without API, via browser)
When you say "Open...", it plays the first video on YouTube
When you say "Open movie...", it searches directly on Turkish movie sites
Secret music playback: Songs are stored in encrypted ZIP files, only opened with special commands
๐ฑ Interface:
Simple and animated user interface with Kivy GUI
Robot icon in the center with eye image
Push-button control (YouTube, Spotify, Movie, Chat, Personal Music)
Automatic updated command history and debug logs
๐ Security and Customization:
Special settings for each user (personal sounds, commands, response system)
Control panel open only to developers as admin (opened with hidden commands)
The entire system is modular and expandable
๐ฆ Technologies Used:
Python 3.12
Kivy (GUI)
ElevenLabs (voice)
Google Gemini (AI)
Whisper / Vosk (speech-to-text)
DeepFace (face recognition)
YOLOv8 (object detection)
MediaPipe (hand gesture)
ctypes, subprocess, threading (system integrations)
๐ Goal:
To make human-machine interaction natural.
To build not just a command system, but a friend, an assistant, an observer, and an entertainment center.
"This robot is no longer just a program. It's my digital alter ego."