Free APIs for Data Engineering
Practicing data engineering is better with real data sources. If you are considering doing a data engineering project, consider the following:
- Ideally, your data has entities and activities, so you can model dimensions and facts.
- Ideally, the APIs have no auth, so they can be easily tested.
- Ideally, the API should have some use case that you are modelling and showing the data for.
- Ideally, you build end-to-end pipelines to showcase extraction, ingestion, modelling and displaying data.
This article outlines 10 APIs, detailing their use cases, any free tier limitations, and authentication needs.
Material teaching data loading with dlt:
Data talks club data engineering zoomcamp
Data talks club open source spotlight
Docs
APIs Overview
1. PokeAPI
- URL: PokeAPI.
- Use: Import Pokémon data for projects on data relationships and stats visualization.
- Free: Rate-limited to 100 requests/IP/minute.
- Auth: None.
2. REST Countries API
- URL: REST Countries.
- Use: Access country data for projects analyzing global metrics.
- Free: Unlimited.
- Auth: None.
3. OpenWeather API
- URL: OpenWeather.
- Use: Fetch weather data for climate analysis and predictive modeling.
- Free: Limited requests and features.
- Auth: API key.
4. JSONPlaceholder API
- URL: JSONPlaceholder.
- Use: Ideal for testing and prototyping with fake data. Use it to simulate CRUD operations on posts, comments, and user data.
- Free: Unlimited.
- Auth: None required.
5. Quandl API
- URL: Quandl.
- Use: For financial market trends and economic indicators analysis.
- Free: Some datasets require premium.
- Auth: API key.
6. GitHub API
- URL: GitHub API
- Use: Analyze open-source trends, collaborations, or stargazers data. You can use it from our verified sources repository.
- Free: 60 requests/hour unauthenticated, 5000 authenticated.
- Auth: OAuth or personal access token.
7. NASA API
- URL: NASA API.
- Use: Space-related data for projects on space exploration or earth science.
- Free: Rate-limited.
- Auth: API key.
8. The Movie Database (TMDb) API
- URL: TMDb API.
- Use: Movie and TV data for entertainment industry trend analysis.
- Free: Requires attribution.
- Auth: API key.
9. CoinGecko API
- URL: CoinGecko API.
- Use: Cryptocurrency data for market trend analysis or predictive modeling.
- Free: Rate-limited.
- Auth: None.
10. Public APIs GitHub list
- URL: Public APIs list.
- Use: Discover APIs for various projects. A meta-resource.
- Free: Varies by API.
- Auth: Depends on API.
Each API offers unique insights for data engineering, from ingestion to visualization. Check each API's documentation for up-to-date details on limitations and authentication.
Example projects
Here are some examples from dlt users and working students:
- A pipeline that pulls data from an API and produces a dashboard in the dbt blog.
- A streaming pipeline on GCP that replaces expensive tools such as Segment/5tran with a setup 50-100x cheaper.
- Another streaming pipeline on AWS for a slightly different use case.
- Orchestrator + email + AI + Slack to summarize emails.
- Evaluate a frontend tool to show your ability to deliver end-to-end.
- An end-to-end data lineage implementation from extraction to dashboard.
- A bird pipeline and the associated schema management that ensures smooth operation Part 1, Part 2.
- Japanese language demos Notion calendar and exploring csv to bigquery with dlt.
- Demos with Dagster and Prefect.
Showcase your project
If you want your project to be featured, let us know in the #sharing-and-contributing channel of our community Slack.