graphframes - Details for Query 1

Details for Query 1

Submitted Time: 2024/12/20 23:05:27
Duration: 1 s
Succeeded Jobs: 2

Show the Stage ID and Task ID that corresponds to the max metric

Details

== Parsed Logical Plan ==
Aggregate [count(1) AS count#68L]
+- Relation[lpep_pickup_datetime#16,lpep_dropoff_datetime#17,PULocationID#18,DOLocationID#19,passenger_count#20,trip_distance#21,fare_amount#22,extra#23,mta_tax#24,tip_amount#25,tolls_amount#26,ehail_fee#27,total_amount#28,payment_type#29,trip_type#30,congestion_surcharge#31,taxi_type#32] csv

== Analyzed Logical Plan ==
count: bigint
Aggregate [count(1) AS count#68L]
+- Relation[lpep_pickup_datetime#16,lpep_dropoff_datetime#17,PULocationID#18,DOLocationID#19,passenger_count#20,trip_distance#21,fare_amount#22,extra#23,mta_tax#24,tip_amount#25,tolls_amount#26,ehail_fee#27,total_amount#28,payment_type#29,trip_type#30,congestion_surcharge#31,taxi_type#32] csv

== Optimized Logical Plan ==
Aggregate [count(1) AS count#68L]
+- Project
   +- Relation[lpep_pickup_datetime#16,lpep_dropoff_datetime#17,PULocationID#18,DOLocationID#19,passenger_count#20,trip_distance#21,fare_amount#22,extra#23,mta_tax#24,tip_amount#25,tolls_amount#26,ehail_fee#27,total_amount#28,payment_type#29,trip_type#30,congestion_surcharge#31,taxi_type#32] csv

== Physical Plan ==
*(2) HashAggregate(keys=[], functions=[count(1)], output=[count#68L])
+- Exchange SinglePartition, true, [id=#34]
   +- *(1) HashAggregate(keys=[], functions=[partial_count(1)], output=[count#71L])
      +- FileScan csv [] Batched: false, DataFilters: [], Format: CSV, Location: InMemoryFileIndex[s3a://data-repository-bkt/ECS765/nyc_taxi/green_tripdata/2023/green_tripdata_20..., PartitionFilters: [], PushedFilters: [], ReadSchema: struct<>