Analysis and prediction of New York City taxi and Uber demands
Taxi and Uber are imperative transportation modes in New York City (NYC). This paper investigates the spatiotemporal distribution of pick-ups of medallion taxis (yellow), Street Hail Livery Service taxis (green), and Uber services in NYC, within the five boroughs: Brooklyn,...
Main Author: | |
---|---|
Format: | ARTÍCULO |
Language: | es_ES |
Published: |
2024
|
Subjects: | |
Online Access: | http://dspace.ucuenca.edu.ec/handle/123456789/44187 https://www.scopus.com/record/display.uri?eid=2-s2.0-85179463056&origin=resultslist&sort=plf-f&src=s&sid=69d9922629b8a18f99e40932a847c0a6&sot=b&sdt=b&s=TITLE-ABS-KEY%28Analysis+and+prediction+of+New+York+City+taxi+and+Uber+demands%29&sl=77&sessionSearchId=69d9922629b8a18f99e40932a847c0a6&relpos=1 |
_version_ | 1793400624695476224 |
---|---|
author | Correa Barahona, Diego Estuardo |
author_facet | Correa Barahona, Diego Estuardo |
author_sort | Correa Barahona, Diego Estuardo |
collection | DSpace |
description | Taxi and Uber are imperative transportation modes in New York City (NYC). This paper investigates the spatiotemporal distribution of pick-ups of medallion taxis (yellow), Street Hail Livery Service taxis (green), and Uber services in NYC, within the five boroughs: Brooklyn, the Bronx, Manhattan, Queens, and Staten Island. Regression models and machine learning algorithms such as XGboost and random forest are used to predict the ridership of taxis and Uber dataset combined in NYC, given a time window of one-hour and locations within zip-code areas. The dataset consists of over 90 million trips within the period April-September 2014, yellow with 86% the most used in the city, followed by green with 9%, and Uber with 5%. In the outer boroughs, the number of pick-ups is 12.9 million (14%), while 77.9 million (86%) were made in Manhattan only. Yellow is the predominant option in Manhattan and Queens, while green is preferred in Brooklyn and Bronx. In Staten Island, the market is shared between the three services. However, Uber presents a highly rising trend of 81% in Manhattan and 145% in outer boroughs during the analysis period. The regression model XGboost performed best because of its exceptional capacity to catch complex feature dependencies. The XGboost model accomplished an estimation of 38.51 for RMSE and 0.97 for R^2. This model could present valuable insights to taxi companies, decision-makers, and city planners in responding to questions, e.g., how to situate taxis where they are required, understand how ridership shifts over time, and the total number of taxis needed to dispatch to meet de the demand |
format | ARTÍCULO |
id | oai:dspace.ucuenca.edu.ec:123456789-44187 |
institution | Universidad de Cuenca |
language | es_ES |
publishDate | 2024 |
record_format | dspace |
spelling | oai:dspace.ucuenca.edu.ec:123456789-441872024-03-08T16:20:20Z Analysis and prediction of New York City taxi and Uber demands Correa Barahona, Diego Estuardo New York City Machine learning algorithms GPS-enabled taxi data Taxi and Uber demand prediction Visual analytics Large scale data analysis Taxi and Uber are imperative transportation modes in New York City (NYC). This paper investigates the spatiotemporal distribution of pick-ups of medallion taxis (yellow), Street Hail Livery Service taxis (green), and Uber services in NYC, within the five boroughs: Brooklyn, the Bronx, Manhattan, Queens, and Staten Island. Regression models and machine learning algorithms such as XGboost and random forest are used to predict the ridership of taxis and Uber dataset combined in NYC, given a time window of one-hour and locations within zip-code areas. The dataset consists of over 90 million trips within the period April-September 2014, yellow with 86% the most used in the city, followed by green with 9%, and Uber with 5%. In the outer boroughs, the number of pick-ups is 12.9 million (14%), while 77.9 million (86%) were made in Manhattan only. Yellow is the predominant option in Manhattan and Queens, while green is preferred in Brooklyn and Bronx. In Staten Island, the market is shared between the three services. However, Uber presents a highly rising trend of 81% in Manhattan and 145% in outer boroughs during the analysis period. The regression model XGboost performed best because of its exceptional capacity to catch complex feature dependencies. The XGboost model accomplished an estimation of 38.51 for RMSE and 0.97 for R^2. This model could present valuable insights to taxi companies, decision-makers, and city planners in responding to questions, e.g., how to situate taxis where they are required, understand how ridership shifts over time, and the total number of taxis needed to dispatch to meet de the demand 2024-03-08T16:20:16Z 2024-03-08T16:20:16Z 2023 ARTÍCULO 1665-6423 http://dspace.ucuenca.edu.ec/handle/123456789/44187 https://www.scopus.com/record/display.uri?eid=2-s2.0-85179463056&origin=resultslist&sort=plf-f&src=s&sid=69d9922629b8a18f99e40932a847c0a6&sot=b&sdt=b&s=TITLE-ABS-KEY%28Analysis+and+prediction+of+New+York+City+taxi+and+Uber+demands%29&sl=77&sessionSearchId=69d9922629b8a18f99e40932a847c0a6&relpos=1 10.22201/icat.24486736e.2023.21.5.2074 es_ES application/pdf Journal of Applied Research and Technology |
spellingShingle | New York City Machine learning algorithms GPS-enabled taxi data Taxi and Uber demand prediction Visual analytics Large scale data analysis Correa Barahona, Diego Estuardo Analysis and prediction of New York City taxi and Uber demands |
title | Analysis and prediction of New York City taxi and Uber demands |
title_full | Analysis and prediction of New York City taxi and Uber demands |
title_fullStr | Analysis and prediction of New York City taxi and Uber demands |
title_full_unstemmed | Analysis and prediction of New York City taxi and Uber demands |
title_short | Analysis and prediction of New York City taxi and Uber demands |
title_sort | analysis and prediction of new york city taxi and uber demands |
topic | New York City Machine learning algorithms GPS-enabled taxi data Taxi and Uber demand prediction Visual analytics Large scale data analysis |
url | http://dspace.ucuenca.edu.ec/handle/123456789/44187 https://www.scopus.com/record/display.uri?eid=2-s2.0-85179463056&origin=resultslist&sort=plf-f&src=s&sid=69d9922629b8a18f99e40932a847c0a6&sot=b&sdt=b&s=TITLE-ABS-KEY%28Analysis+and+prediction+of+New+York+City+taxi+and+Uber+demands%29&sl=77&sessionSearchId=69d9922629b8a18f99e40932a847c0a6&relpos=1 |
work_keys_str_mv | AT correabarahonadiegoestuardo analysisandpredictionofnewyorkcitytaxianduberdemands |