top of page

Create Your First Project

Start adding your projects to your portfolio. Click on "Manage Projects" to get started

ROBOT

Project type

Robot

History

2025

Location

Izmir

Robot Project

"A robot that not only hears, but also understands."

This project is an advanced personal assistant robot system supported by artificial intelligence, controlled by voice, that can perceive its environment and react like a human. It not only talks, but also sees, recognizes, learns and responds.
The entire system is designed to work completely offline and has an architecture that minimizes external dependency.

๐Ÿš€ Basic Features:
๐ŸŽค Voice Interaction:

Wakes up when you say "Robot".

Naturally perceives voice commands (not predefined).

Speaks with an ultra-natural Turkish voice with ElevenLabs.

Whisper-based voice detection and command understanding system is integrated.

๐Ÿง  Artificial Intelligence Supported:

Uses Google Gemini (including offline version).

Answers questions, offers suggestions, analyzes conversations.

Gives personalized answers (for example, different answers to people it knows with facial recognition).

๐ŸŽฅ Visual Detection:

Object recognition with YOLOv8

Hand gesture detection with MediaPipe

Face recognition and person-specific response with DeepFace

Real-time analysis and response via camera

๐ŸŽต Multimedia Control:

When you say "Play...", it opens music from Spotify (without API, via browser)

When you say "Open...", it plays the first video on YouTube

When you say "Open movie...", it searches directly on Turkish movie sites

Secret music playback: Songs are stored in encrypted ZIP files, only opened with special commands

๐Ÿ“ฑ Interface:

Simple and animated user interface with Kivy GUI

Robot icon in the center with eye image

Push-button control (YouTube, Spotify, Movie, Chat, Personal Music)

Automatic updated command history and debug logs

๐Ÿ” Security and Customization:

Special settings for each user (personal sounds, commands, response system)

Control panel open only to developers as admin (opened with hidden commands)

The entire system is modular and expandable

๐Ÿ“ฆ Technologies Used:
Python 3.12

Kivy (GUI)

ElevenLabs (voice)

Google Gemini (AI)

Whisper / Vosk (speech-to-text)

DeepFace (face recognition)

YOLOv8 (object detection)

MediaPipe (hand gesture)

ctypes, subprocess, threading (system integrations)

๐Ÿ“ Goal:
To make human-machine interaction natural.
To build not just a command system, but a friend, an assistant, an observer, and an entertainment center.

"This robot is no longer just a program. It's my digital alter ego."

bottom of page