== Parsed Logical Plan ==
GlobalLimit 6
+- LocalLimit 6
+- Project [cast(src#85 as string) AS src#112, cast(dst#86 as string) AS dst#113]
+- Project [PULocationID#18 AS src#85, DOLocationID#19 AS dst#86]
+- Relation[lpep_pickup_datetime#16,lpep_dropoff_datetime#17,PULocationID#18,DOLocationID#19,passenger_count#20,trip_distance#21,fare_amount#22,extra#23,mta_tax#24,tip_amount#25,tolls_amount#26,ehail_fee#27,total_amount#28,payment_type#29,trip_type#30,congestion_surcharge#31,taxi_type#32] csv
== Analyzed Logical Plan ==
src: string, dst: string
GlobalLimit 6
+- LocalLimit 6
+- Project [cast(src#85 as string) AS src#112, cast(dst#86 as string) AS dst#113]
+- Project [PULocationID#18 AS src#85, DOLocationID#19 AS dst#86]
+- Relation[lpep_pickup_datetime#16,lpep_dropoff_datetime#17,PULocationID#18,DOLocationID#19,passenger_count#20,trip_distance#21,fare_amount#22,extra#23,mta_tax#24,tip_amount#25,tolls_amount#26,ehail_fee#27,total_amount#28,payment_type#29,trip_type#30,congestion_surcharge#31,taxi_type#32] csv
== Optimized Logical Plan ==
GlobalLimit 6
+- LocalLimit 6
+- Project [cast(PULocationID#18 as string) AS src#112, cast(DOLocationID#19 as string) AS dst#113]
+- Relation[lpep_pickup_datetime#16,lpep_dropoff_datetime#17,PULocationID#18,DOLocationID#19,passenger_count#20,trip_distance#21,fare_amount#22,extra#23,mta_tax#24,tip_amount#25,tolls_amount#26,ehail_fee#27,total_amount#28,payment_type#29,trip_type#30,congestion_surcharge#31,taxi_type#32] csv
== Physical Plan ==
CollectLimit 6
+- *(1) Project [cast(PULocationID#18 as string) AS src#112, cast(DOLocationID#19 as string) AS dst#113]
+- FileScan csv [PULocationID#18,DOLocationID#19] Batched: false, DataFilters: [], Format: CSV, Location: InMemoryFileIndex[s3a://data-repository-bkt/ECS765/nyc_taxi/green_tripdata/2023/green_tripdata_20..., PartitionFilters: [], PushedFilters: [], ReadSchema: struct<PULocationID:int,DOLocationID:int>