== Parsed Logical Plan == GlobalLimit 11 +- LocalLimit 11 +- Project [cast(tpep_pickup_datetime#40 as string) AS tpep_pickup_datetime#182, cast(tpep_dropoff_datetime#41 as string) AS tpep_dropoff_datetime#183, cast(passenger_count#42 as string) AS passenger_count#184, cast(trip_distance#43 as string) AS trip_distance#185, cast(PULocationID#44 as string) AS PULocationID#186, cast(DOLocationID#45 as string) AS DOLocationID#187, cast(payment_type#46 as string) AS payment_type#188, cast(fare_amount#47 as string) AS fare_amount#189, cast(extra#48 as string) AS extra#190, cast(mta_tax#49 as string) AS mta_tax#191, cast(tip_amount#50 as string) AS tip_amount#192, cast(tolls_amount#51 as string) AS tolls_amount#193, cast(total_amount#52 as string) AS total_amount#194, cast(congestion_surcharge#53 as string) AS congestion_surcharge#195, cast(airport_fee#54 as string) AS airport_fee#196, cast(taxi_type#55 as string) AS taxi_type#197] +- Relation[tpep_pickup_datetime#40,tpep_dropoff_datetime#41,passenger_count#42,trip_distance#43,PULocationID#44,DOLocationID#45,payment_type#46,fare_amount#47,extra#48,mta_tax#49,tip_amount#50,tolls_amount#51,total_amount#52,congestion_surcharge#53,airport_fee#54,taxi_type#55] csv == Analyzed Logical Plan == tpep_pickup_datetime: string, tpep_dropoff_datetime: string, passenger_count: string, trip_distance: string, PULocationID: string, DOLocationID: string, payment_type: string, fare_amount: string, extra: string, mta_tax: string, tip_amount: string, tolls_amount: string, total_amount: string, congestion_surcharge: string, airport_fee: string, taxi_type: string GlobalLimit 11 +- LocalLimit 11 +- Project [cast(tpep_pickup_datetime#40 as string) AS tpep_pickup_datetime#182, cast(tpep_dropoff_datetime#41 as string) AS tpep_dropoff_datetime#183, cast(passenger_count#42 as string) AS passenger_count#184, cast(trip_distance#43 as string) AS trip_distance#185, cast(PULocationID#44 as string) AS PULocationID#186, cast(DOLocationID#45 as string) AS DOLocationID#187, cast(payment_type#46 as string) AS payment_type#188, cast(fare_amount#47 as string) AS fare_amount#189, cast(extra#48 as string) AS extra#190, cast(mta_tax#49 as string) AS mta_tax#191, cast(tip_amount#50 as string) AS tip_amount#192, cast(tolls_amount#51 as string) AS tolls_amount#193, cast(total_amount#52 as string) AS total_amount#194, cast(congestion_surcharge#53 as string) AS congestion_surcharge#195, cast(airport_fee#54 as string) AS airport_fee#196, cast(taxi_type#55 as string) AS taxi_type#197] +- Relation[tpep_pickup_datetime#40,tpep_dropoff_datetime#41,passenger_count#42,trip_distance#43,PULocationID#44,DOLocationID#45,payment_type#46,fare_amount#47,extra#48,mta_tax#49,tip_amount#50,tolls_amount#51,total_amount#52,congestion_surcharge#53,airport_fee#54,taxi_type#55] csv == Optimized Logical Plan == GlobalLimit 11 +- LocalLimit 11 +- Relation[tpep_pickup_datetime#40,tpep_dropoff_datetime#41,passenger_count#42,trip_distance#43,PULocationID#44,DOLocationID#45,payment_type#46,fare_amount#47,extra#48,mta_tax#49,tip_amount#50,tolls_amount#51,total_amount#52,congestion_surcharge#53,airport_fee#54,taxi_type#55] csv == Physical Plan == CollectLimit 11 +- FileScan csv [tpep_pickup_datetime#40,tpep_dropoff_datetime#41,passenger_count#42,trip_distance#43,PULocationID#44,DOLocationID#45,payment_type#46,fare_amount#47,extra#48,mta_tax#49,tip_amount#50,tolls_amount#51,total_amount#52,congestion_surcharge#53,airport_fee#54,taxi_type#55] Batched: false, DataFilters: [], Format: CSV, Location: InMemoryFileIndex[s3a://data-repository-bkt/ECS765/nyc_taxi/yellow_tripdata/2023/yellow_tripdata_..., PartitionFilters: [], PushedFilters: [], ReadSchema: struct<tpep_pickup_datetime:string,tpep_dropoff_datetime:string,passenger_count:string,trip_dista...