Explored how local business density and types (e.g., cafes, bars, restaurants) correlate with bike availability at Lisbon’s CityBike stations. Integrated geospatial APIs, statistical modelling, and regression analysis.

This project investigates whether surrounding businesses influence bike availability at Lisbon’s CityBike stations. Using the CityBike and Foursquare APIs, I collected geospatial and venue-level data for over 9,600 locations near 195 stations. I then applied multivariate regression and statistical tests to evaluate predictors such as venue type, rating, and popularity.
Explore whether the types and density of nearby businesses (e.g., bars, cafés, restaurants) can help predict bike availability at Lisbon’s CityBike stations, and assess whether this information is useful for planning station capacity.
Despite weak correlations overall, the number of nearby bars and cafes showed minor, but statistically significant associations with bike availability. This analysis highlights both the potential and limitations of using public venue data to understand urban mobility patterns.
Within this dataset, nearby venue types and ratings are weak predictors of CityBike station availability. While specific categories such as bars and cafés show statistically significant associations, they do not provide enough explanatory power to reliably guide station capacity planning. Other factors, such as time of day, day of week, seasonality, commuter flows, and neighbourhood demographics, are likely more important drivers of demand.
For those interested, the GitHub repo includes Jupyter notebooks for EDA, prediction models, API calls and figure generation.