skip to navigation
skip to content

Not Logged In

py_robot 0.1


What is it?

PyRobot is a lightweight, pure Python Windows automation library loosely modeled after Java's Robot Class. It can be used to drive applications that don't provide an API or any way of hooking into them programatically. Good for when you can't (or don't want to) install the great, but a bit hefty, pywin32 libraries.

You can check out a couple of use cases for PyRobot [here](


PyRobot is a single Python file. So, you can either download it directly from this github page, or clone the repo:

git clone


Similar to Java, everything is controlled via a single class.


# Import the robot class from the pyrobot module
from pyrobot import Robot

# create an instance of the class
robot = Robot()

# Launch a program

# Do cool stuff
location_of_field = (340, 400)
expected_color = (255,255,255)

pixel = robot.get_pixel(location_of_field)
if pixel == (expected_color):
robot.add_to_clipboard('Hello world! How are ya!')

# etc..

# If you need to take screengrabs, and have PIL Installed,
# PyRobot extends the default behavior of PIL by
# allowing the entire virtual screen to be targeted (rather
# than just the primary).

# returns a list of screen rects for all attached devices
monitors_coords = robot.get_display_monitors()

# Takes a screenshot of the desktop and returns a PIL Image object
im = robot.take_screenshot(monitors_coords[-1]) # Coordinates of last monitor

# You can also do arbitrarily sized boxes
left = 100
top = 430
width = 1000
hright = 750

im = robot.take_screenshot((left, top, width, height))

# Save the PIL Image to disk'my_awesome_screenshot.png', 'png')

# and so on


* Added ability to read data from the clipboard.

my_var = robot.get_clipboard_data()

* convenience method `press_and_release`


* Added `Key` class

from pyrobot import Robot, Keys
robot = Robot()




| Method | Summary |
| --------------------------- | --------------------------- |
| get_mouse_pos() | Returns tuple of current mouse coordinates |
| set_mouse_pos(x, y) | Moves mouse pointer to given screen coordinates. |
| click_mouse(button) | Simulates a full mouse click. One down event, one up event.
button can be `'Left'`, `'right'`, or `'middle'` |
| double_click_mouse(button) | Two full mouse clicks. One down event, one up event. |
| move_and_click(x,y,button) | convenience function: Move to coordinates and click mouse |
| mouse_down(button) | Presses one mouse button.
button can be `'Left'`, `'right'`, or `'middle'` |
| mouse_up(button) | Releases mouse button.
button can be `'Left'`, `'right'`, or `'middle'` |
| scroll_mouse_wheel(direction, clicks) | Scrolls the mouse wheel either `'up'` or `'down'` X number of 'clicks'.                                |


| Method | Summary |
| --------------------------- | --------------------------- |
| key_press(key) | Presses a given key.
Input: `string` or `Keys.*` constant |
| key_release(key) | Releases a given key.
Input: `string` or `Keys.*` constant |
| press_and_release() | Simulates pressing a key: One down event, one release event                                                                            |


| Method | Summary |
| --------------------------- | --------------------------- |
| add_to_clipboard(string) | Copy text into clip board for later pasting. |
| clear_clipboard() | Clear everything out of the clipboard |
| get_clipboard_data() | Retrieves text from the Windows clipboard as a String |
| paste() | Pastes the contents of the clipboard                                                                                                                                                                                         |


| Method | Summary |
| --------------------------- | --------------------------- |
| get_display_monitors() | Enumerates and returns a list of virtual screen coordinates for the attached display devices
output = [
    (left, top, right, bottom), # Monitor 1
    (left, top, right, bottom) # Monitor 2
    # etc...
] |
| get_pixel(x, y) | Returns the pixel color `tuple(R,G,B)` of the given screen coordinate |
| take_screenshot(bounds=None) | NOTE: REQUIRES PYTHON IMAGE LIBRARY
Takes a screenshot of the entire desktop and returns it as a PIL `Image` object.

Use with `get_display_monitors` to target a specific screen, or pass in a tuple consisting of (`left`, `top`, `width`, `height`).                                                                                             |


| Method | Summary |
| --------------------------- | --------------------------- |
| sleep(duration) | Pauses the robot for `duration` number of seconds. |
| start_program(full_path) | Launches a windows application.
Input type: `string` |
| type_backwards(input_string, delay=.05) | Types right to left. Because why not! |
| type_string(input_string, delay=.005) | Convenience function for typing out strings.
delay = time between keystrokes                                                                                                                                                                           |

Special take_snapshot Note

Being that Python has no built in Image library (that I know of), it seemed of little use to include a snapshot method which simply saves the picture to disk. For 90% of program automation, I find that querying pixels works fairly well (e.g. `robot.get_pixel`). However, if you need to do more advanced display searching, or want to do template matching, you'll need an external library. So there is a method in there to contruct a PIL Image object. It just, of course, requires that PIL be installed. Other than that, dependency free! :-)


* Allow specific window/program targeting so that relative coordinates can be used while scripting.


Feature request? Bug? Hate it?
Drop me a line at, or on Twitter @thechriskiehl  
  • Downloads (All Versions):
  • 0 downloads in the last day
  • 0 downloads in the last week
  • 5 downloads in the last month