A CraftersMC Guides Product

Data Stripper

Automatically collects bazaar prices from CraftersMC

Why Data Stripper?

CraftersMC doesn't have a public API for bazaar prices. That would prevent players from getting any kind of accurate data on a website. At one point, we were manually updating the prices of every item; that quickly became unrealistic because of the number of items. So we decided to make an automated data stripper that would pull prices directly from screenshots to refresh the whole price list with just one click, eliminating manual updates and making this website always up to date with the latest prices from the bazaar.

Download

Linux Icon

Linux

Primary | v0.0.1 Beta Coming Soon
Windows Icon

Windows

Coming Soon
Android Icon

Android

Coming Soon
Mac Icon

Mac

Not Planned

Key Features

FAQ

How accurate is the OCR?

It ranges generally from 85–95% on clear screenshots. Best results are achieved with high-resolution images with good contrast.

Why are some prices wrong?

This can occur through low contrast, small fonts, misinterpreted decimals, or poor screenshots. The application uses fallback pre-processing to improve the results.

What formats are acceptable?

PNG (recommended), JPG, BMP, GIF, TIFF. Very large images may take longer to process.

How do I take screenshots?

Click the screenshot button in the app or press Ctrl + Shift + S.

Linux users may require gnome-screenshot, scrot or ImageMagick.

macOS has built-in screenshot tools.

What if the hotkey isn't working?

Check for permissions or conflicting apps.

How does item matching work?

It goes from an exact match → fuzzy match → word matching → n-grams.

Does something seem missing?

Include it in itemlist.py, and then restart the application.

Is a key needed?

A default key is provided, but frequent users should add OCR.space API keys. Multiple keys will rotate automatically.

How many images can I handle?

There's no strict limit, but large batches will consume more memory and time. Parallel processing can be configured.

Why is processing slow?

This could be due to large images, slow internet, a small number of workers, or limited system resources.

What are the token permissions?

repo (for private repositories) or public_repo (for public repositories).

  • API limit reached: add more keys or wait.
  • No tooltip detected: The border of the tooltip is not visible.
  • Item not found: Refresh the item list.
  • Screenshot failed: Verify tools/permissions.

Processed data will be stored in extracted_data.csv, while prices are recorded in mprices.txt. Check "Keep processed images" if you want the application to retain them.

Check the market_data_stripper.log, activate debug mode of the dependency and restart the app.

Supports Linux, Windows 10/11, Android and macOS (limited). Requires internet access for OCR.

  • Use clear screenshots
  • Keep your item list updated
  • Add more API keys
  • Sync with GitHub periodically