System In this paper,we have three datasets with the same data model.The first dataset labeled as D1 having 1 year of airline data stored in it and the next two datasets labeled as D2,D3 containing 2 and 3 years of airline data respectively.Each year of airline data has approximately 70-80 lakh rows of data records. There are different perfomance factors that will be determine the result namely: 1.data set file size; 2.query statements; 3.Data replication factor; 4.HDFS block size; 5.query average time;