Starbelly

Starbelly is a user-friendly web crawler that is easy to deploy and configure.

Security

For information about security considerations and best practices for deploying Starbelly, please see docs/SECURITY.md.

If you discover a security vulnerability, please email acaceres@hyperiongray.com rather than opening a public issue.

Policy-Based Crawling: Define custom crawl policies to control crawler behavior
Graphical User Interface: Easy-to-use web interface for managing crawls
WebSocket API: Real-time communication and streaming results
Docker Deployment: Simple deployment using Docker containers
RethinkDB Backend: Scalable database for storing crawl data
Asynchronous I/O: Built on Trio for high-performance concurrent crawling

Installation

Starbelly is deployed using Docker and Docker Compose. See the Installation Guide for detailed instructions.

Quick start:

git clone https://github.com/hyperiongray/starbelly-docker.git
cd starbelly-docker/starbelly
docker-compose up -d

Usage

After installation, navigate to your server's address in a web browser. The default credentials are:

Username: admin
Password: admin

For detailed usage instructions, see the documentation.

Documentation

Complete documentation is available at starbelly.readthedocs.io.

Examples

Example notebooks and scripts are available in the notebooks and examples directories.

API

Starbelly provides a WebSocket API for programmatic access. See the WebSocket API documentation for details.

Python client library: starbelly-python-client

Contributing

We welcome contributions! Please see CONTRIBUTING.md for guidelines.

License

Starbelly is under the MIT License. See LICENSE for details.

For commercial support or inquiries, please contact Hyperion Gray at acaceres@hyperiongray.com

Name		Name	Last commit message	Last commit date
Latest commit History 592 Commits
.github		.github
bin		bin
conf		conf
dev		dev
docs		docs
examples		examples
integration		integration
notebooks		notebooks
starbelly		starbelly
tests		tests
tools		tools
.gitignore		.gitignore
.pylintrc		.pylintrc
.python-version		.python-version
.readthedocs.yml		.readthedocs.yml
.travis.yml		.travis.yml
AGENTS.md		AGENTS.md
CHANGELOG.md		CHANGELOG.md
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
IMPLEMENTATION_SUMMARY.md		IMPLEMENTATION_SUMMARY.md
LICENSE		LICENSE
Makefile		Makefile
QUICKSTART.md		QUICKSTART.md
README.md		README.md
SECURITY.md		SECURITY.md
SECURITY_SUMMARY.md		SECURITY_SUMMARY.md
START_HERE.md		START_HERE.md
STREAMING_API.md		STREAMING_API.md
bfg-1.15.0.jar		bfg-1.15.0.jar
conftest.py		conftest.py
poetry.lock		poetry.lock
poetry.toml		poetry.toml
pyproject.toml		pyproject.toml
pytest.ini		pytest.ini
rules.json		rules.json
starbelly.proto		starbelly.proto

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Starbelly

Security

Installation

Usage

Documentation

Examples

API

Contributing

License

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Starbelly

Security

Installation

Usage

Documentation

Examples

API

Contributing

License

About

Resources

License

Code of conduct

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages