Overview
Dataset statistics
| Number of variables | 14 |
|---|---|
| Number of observations | 10683 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 197 |
| Duplicate rows (%) | 1.8% |
| Total size in memory | 3.1 MiB |
| Average record size in memory | 309.0 B |
Variable types
| Categorical | 6 |
|---|---|
| Numeric | 7 |
| DateTime | 1 |
| Dataset has 197 (1.8%) duplicate rows | Duplicates |
Dep_Hour is highly overall correlated with Is_Peak_Hour | High correlation |
Destination is highly overall correlated with Source | High correlation |
Duration is highly overall correlated with Price and 1 other fields | High correlation |
Is_Peak_Hour is highly overall correlated with Dep_Hour | High correlation |
Price is highly overall correlated with Duration | High correlation |
Source is highly overall correlated with Destination | High correlation |
Total_Stops is highly overall correlated with Duration | High correlation |
Dep_Min has 2062 (19.3%) zeros | Zeros |
Arrival_Hour has 322 (3.0%) zeros | Zeros |
Arrival_Min has 1447 (13.5%) zeros | Zeros |
Reproduction
| Analysis started | 2026-03-27 16:48:10.401464 |
|---|---|
| Analysis finished | 2026-03-27 16:48:18.090411 |
| Duration | 7.69 seconds |
| Software version | ydata-profiling vv4.18.1 |
| Download configuration | config.json |
Variables
Airline
Categorical
| Distinct | 12 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 613.7 KiB |
| Jet Airways | |
|---|---|
| IndiGo | |
| Air India | |
| Multiple carriers | |
| SpiceJet | |
| Other values (7) |
Length
| Max length | 33 |
|---|---|
| Median length | 23 |
| Mean length | 9.8099785 |
| Min length | 5 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | IndiGo |
|---|---|
| 2nd row | Air India |
| 3rd row | Jet Airways |
| 4th row | IndiGo |
| 5th row | IndiGo |
Common Values
| Value | Count | Frequency (%) |
| Jet Airways | 3849 | |
| IndiGo | 2053 | |
| Air India | 1752 | |
| Multiple carriers | 1196 | 11.2% |
| SpiceJet | 818 | 7.7% |
| Vistara | 479 | 4.5% |
| Air Asia | 319 | 3.0% |
| GoAir | 194 | 1.8% |
| Multiple carriers Premium economy | 13 | 0.1% |
| Jet Airways Business | 6 | 0.1% |
| Other values (2) | 4 | < 0.1% |
Length
| Value | Count | Frequency (%) |
| jet | 3855 | |
| airways | 3855 | |
| air | 2071 | |
| indigo | 2053 | |
| india | 1752 | |
| multiple | 1209 | 6.8% |
| carriers | 1209 | 6.8% |
| spicejet | 818 | 4.6% |
| vistara | 482 | 2.7% |
| asia | 319 | 1.8% |
| Other values (5) | 233 | 1.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| i | 13984 | |
| r | 10246 | 9.8% |
| a | 8099 | 7.7% |
| e | 7948 | 7.6% |
| 7173 | 6.8% | |
| A | 6439 | 6.1% |
| t | 6365 | 6.1% |
| s | 5883 | 5.6% |
| J | 4673 | 4.5% |
| y | 3871 | 3.7% |
| Other values (18) | 30119 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 104800 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| i | 13984 | |
| r | 10246 | 9.8% |
| a | 8099 | 7.7% |
| e | 7948 | 7.6% |
| 7173 | 6.8% | |
| A | 6439 | 6.1% |
| t | 6365 | 6.1% |
| s | 5883 | 5.6% |
| J | 4673 | 4.5% |
| y | 3871 | 3.7% |
| Other values (18) | 30119 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 104800 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| i | 13984 | |
| r | 10246 | 9.8% |
| a | 8099 | 7.7% |
| e | 7948 | 7.6% |
| 7173 | 6.8% | |
| A | 6439 | 6.1% |
| t | 6365 | 6.1% |
| s | 5883 | 5.6% |
| J | 4673 | 4.5% |
| y | 3871 | 3.7% |
| Other values (18) | 30119 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 104800 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| i | 13984 | |
| r | 10246 | 9.8% |
| a | 8099 | 7.7% |
| e | 7948 | 7.6% |
| 7173 | 6.8% | |
| A | 6439 | 6.1% |
| t | 6365 | 6.1% |
| s | 5883 | 5.6% |
| J | 4673 | 4.5% |
| y | 3871 | 3.7% |
| Other values (18) | 30119 |
Source
Categorical
High correlation
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 577.0 KiB |
| Delhi | |
|---|---|
| Kolkata | |
| Banglore | |
| Mumbai | |
| Chennai | 381 |
Length
| Max length | 8 |
|---|---|
| Median length | 7 |
| Mean length | 6.2910231 |
| Min length | 5 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Banglore |
|---|---|
| 2nd row | Kolkata |
| 3rd row | Delhi |
| 4th row | Kolkata |
| 5th row | Banglore |
Common Values
| Value | Count | Frequency (%) |
| Delhi | 4537 | |
| Kolkata | 2871 | |
| Banglore | 2197 | |
| Mumbai | 697 | 6.5% |
| Chennai | 381 | 3.6% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| delhi | 4537 | |
| kolkata | 2871 | |
| banglore | 2197 | |
| mumbai | 697 | 6.5% |
| chennai | 381 | 3.6% |
Most occurring characters
| Value | Count | Frequency (%) |
| l | 9605 | |
| a | 9017 | |
| e | 7115 | |
| i | 5615 | |
| o | 5068 | |
| h | 4918 | 7.3% |
| D | 4537 | 6.8% |
| n | 2959 | 4.4% |
| K | 2871 | 4.3% |
| t | 2871 | 4.3% |
| Other values (9) | 12631 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 67207 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| l | 9605 | |
| a | 9017 | |
| e | 7115 | |
| i | 5615 | |
| o | 5068 | |
| h | 4918 | 7.3% |
| D | 4537 | 6.8% |
| n | 2959 | 4.4% |
| K | 2871 | 4.3% |
| t | 2871 | 4.3% |
| Other values (9) | 12631 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 67207 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| l | 9605 | |
| a | 9017 | |
| e | 7115 | |
| i | 5615 | |
| o | 5068 | |
| h | 4918 | 7.3% |
| D | 4537 | 6.8% |
| n | 2959 | 4.4% |
| K | 2871 | 4.3% |
| t | 2871 | 4.3% |
| Other values (9) | 12631 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 67207 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| l | 9605 | |
| a | 9017 | |
| e | 7115 | |
| i | 5615 | |
| o | 5068 | |
| h | 4918 | 7.3% |
| D | 4537 | 6.8% |
| n | 2959 | 4.4% |
| K | 2871 | 4.3% |
| t | 2871 | 4.3% |
| Other values (9) | 12631 |
Destination
Categorical
High correlation
| Distinct | 6 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 583.4 KiB |
| Cochin | |
|---|---|
| Banglore | |
| Delhi | |
| New Delhi | |
| Hyderabad |
Length
| Max length | 9 |
|---|---|
| Median length | 8 |
| Mean length | 6.9121969 |
| Min length | 5 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | New Delhi |
|---|---|
| 2nd row | Banglore |
| 3rd row | Cochin |
| 4th row | Banglore |
| 5th row | New Delhi |
Common Values
| Value | Count | Frequency (%) |
| Cochin | 4537 | |
| Banglore | 2871 | |
| Delhi | 1265 | 11.8% |
| New Delhi | 932 | 8.7% |
| Hyderabad | 697 | 6.5% |
| Kolkata | 381 | 3.6% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| cochin | 4537 | |
| banglore | 2871 | |
| delhi | 2197 | |
| new | 932 | 8.0% |
| hyderabad | 697 | 6.0% |
| kolkata | 381 | 3.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| o | 7789 | |
| n | 7408 | |
| h | 6734 | |
| i | 6734 | |
| e | 6697 | |
| l | 5449 | 7.4% |
| a | 5027 | 6.8% |
| C | 4537 | 6.1% |
| c | 4537 | 6.1% |
| r | 3568 | 4.8% |
| Other values (13) | 15363 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 73843 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| o | 7789 | |
| n | 7408 | |
| h | 6734 | |
| i | 6734 | |
| e | 6697 | |
| l | 5449 | 7.4% |
| a | 5027 | 6.8% |
| C | 4537 | 6.1% |
| c | 4537 | 6.1% |
| r | 3568 | 4.8% |
| Other values (13) | 15363 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 73843 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| o | 7789 | |
| n | 7408 | |
| h | 6734 | |
| i | 6734 | |
| e | 6697 | |
| l | 5449 | 7.4% |
| a | 5027 | 6.8% |
| C | 4537 | 6.1% |
| c | 4537 | 6.1% |
| r | 3568 | 4.8% |
| Other values (13) | 15363 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 73843 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| o | 7789 | |
| n | 7408 | |
| h | 6734 | |
| i | 6734 | |
| e | 6697 | |
| l | 5449 | 7.4% |
| a | 5027 | 6.8% |
| C | 4537 | 6.1% |
| c | 4537 | 6.1% |
| r | 3568 | 4.8% |
| Other values (13) | 15363 |
Duration
Real number (ℝ)
High correlation
| Distinct | 368 |
|---|---|
| Distinct (%) | 3.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 643.09323 |
| Minimum | 5 |
|---|---|
| Maximum | 2860 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 83.6 KiB |
Quantile statistics
| Minimum | 5 |
|---|---|
| 5-th percentile | 90 |
| Q1 | 170 |
| median | 520 |
| Q3 | 930 |
| 95-th percentile | 1615 |
| Maximum | 2860 |
| Range | 2855 |
| Interquartile range (IQR) | 760 |
Descriptive statistics
| Standard deviation | 507.862 |
|---|---|
| Coefficient of variation (CV) | 0.78971753 |
| Kurtosis | -0.16729132 |
| Mean | 643.09323 |
| Median Absolute Deviation (MAD) | 350 |
| Skewness | 0.86107405 |
| Sum | 6870165 |
| Variance | 257923.81 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 170 | 550 | 5.1% |
| 90 | 386 | 3.6% |
| 175 | 337 | 3.2% |
| 165 | 337 | 3.2% |
| 155 | 329 | 3.1% |
| 180 | 261 | 2.4% |
| 140 | 238 | 2.2% |
| 150 | 220 | 2.1% |
| 160 | 158 | 1.5% |
| 135 | 135 | 1.3% |
| Other values (358) | 7732 |
| Value | Count | Frequency (%) |
| 5 | 1 | < 0.1% |
| 75 | 24 | 0.2% |
| 80 | 61 | 0.6% |
| 85 | 135 | 1.3% |
| 90 | 386 | |
| 95 | 15 | 0.1% |
| 135 | 135 | 1.3% |
| 140 | 238 | |
| 145 | 98 | 0.9% |
| 150 | 220 |
| Value | Count | Frequency (%) |
| 2860 | 1 | < 0.1% |
| 2820 | 1 | < 0.1% |
| 2565 | 1 | < 0.1% |
| 2525 | 1 | < 0.1% |
| 2480 | 1 | < 0.1% |
| 2420 | 1 | < 0.1% |
| 2345 | 2 | < 0.1% |
| 2315 | 4 | < 0.1% |
| 2300 | 5 | |
| 2295 | 12 |
Total_Stops
Categorical
High correlation
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 521.8 KiB |
| 1 | |
|---|---|
| 0 | |
| 2 | |
| 3 | 45 |
| 4 | 1 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 2 |
| 3rd row | 2 |
| 4th row | 1 |
| 5th row | 1 |
Common Values
| Value | Count | Frequency (%) |
| 1 | 5626 | |
| 0 | 3491 | |
| 2 | 1520 | 14.2% |
| 3 | 45 | 0.4% |
| 4 | 1 | < 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 1 | 5626 | |
| 0 | 3491 | |
| 2 | 1520 | 14.2% |
| 3 | 45 | 0.4% |
| 4 | 1 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 5626 | |
| 0 | 3491 | |
| 2 | 1520 | 14.2% |
| 3 | 45 | 0.4% |
| 4 | 1 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 10683 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 1 | 5626 | |
| 0 | 3491 | |
| 2 | 1520 | 14.2% |
| 3 | 45 | 0.4% |
| 4 | 1 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 10683 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 1 | 5626 | |
| 0 | 3491 | |
| 2 | 1520 | 14.2% |
| 3 | 45 | 0.4% |
| 4 | 1 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 10683 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 1 | 5626 | |
| 0 | 3491 | |
| 2 | 1520 | 14.2% |
| 3 | 45 | 0.4% |
| 4 | 1 | < 0.1% |
Journey_Day
Real number (ℝ)
| Distinct | 10 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 13.508378 |
| Minimum | 1 |
|---|---|
| Maximum | 27 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 83.6 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 6 |
| median | 12 |
| Q3 | 21 |
| 95-th percentile | 27 |
| Maximum | 27 |
| Range | 26 |
| Interquartile range (IQR) | 15 |
Descriptive statistics
| Standard deviation | 8.4792775 |
|---|---|
| Coefficient of variation (CV) | 0.62770509 |
| Kurtosis | -1.2728424 |
| Mean | 13.508378 |
| Median Absolute Deviation (MAD) | 6 |
| Skewness | 0.11835054 |
| Sum | 144310 |
| Variance | 71.898146 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 9 | 1406 | |
| 6 | 1288 | |
| 27 | 1130 | |
| 21 | 1111 | |
| 1 | 1075 | |
| 24 | 1052 | |
| 15 | 984 | |
| 12 | 957 | |
| 3 | 848 | |
| 18 | 832 |
| Value | Count | Frequency (%) |
| 1 | 1075 | |
| 3 | 848 | |
| 6 | 1288 | |
| 9 | 1406 | |
| 12 | 957 | |
| 15 | 984 | |
| 18 | 832 | |
| 21 | 1111 | |
| 24 | 1052 | |
| 27 | 1130 |
| Value | Count | Frequency (%) |
| 27 | 1130 | |
| 24 | 1052 | |
| 21 | 1111 | |
| 18 | 832 | |
| 15 | 984 | |
| 12 | 957 | |
| 9 | 1406 | |
| 6 | 1288 | |
| 3 | 848 | |
| 1 | 1075 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 3 |
|---|---|
| 2nd row | 5 |
| 3rd row | 6 |
| 4th row | 5 |
| 5th row | 3 |
Common Values
| Value | Count | Frequency (%) |
| 5 | 3466 | |
| 6 | 3414 | |
| 3 | 2724 | |
| 4 | 1079 | 10.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 5 | 3466 | |
| 6 | 3414 | |
| 3 | 2724 | |
| 4 | 1079 | 10.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 5 | 3466 | |
| 6 | 3414 | |
| 3 | 2724 | |
| 4 | 1079 | 10.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 10683 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 5 | 3466 | |
| 6 | 3414 | |
| 3 | 2724 | |
| 4 | 1079 | 10.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 10683 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 5 | 3466 | |
| 6 | 3414 | |
| 3 | 2724 | |
| 4 | 1079 | 10.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 10683 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 5 | 3466 | |
| 6 | 3414 | |
| 3 | 2724 | |
| 4 | 1079 | 10.1% |
Dep_Hour
Real number (ℝ)
High correlation
| Distinct | 24 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 12.490686 |
| Minimum | 0 |
|---|---|
| Maximum | 23 |
| Zeros | 40 |
| Zeros (%) | 0.4% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 83.6 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 5 |
| Q1 | 8 |
| median | 11 |
| Q3 | 18 |
| 95-th percentile | 22 |
| Maximum | 23 |
| Range | 23 |
| Interquartile range (IQR) | 10 |
Descriptive statistics
| Standard deviation | 5.7486501 |
|---|---|
| Coefficient of variation (CV) | 0.46023494 |
| Kurtosis | -1.1948465 |
| Mean | 12.490686 |
| Median Absolute Deviation (MAD) | 5 |
| Skewness | 0.11307279 |
| Sum | 133438 |
| Variance | 33.046978 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 9 | 916 | 8.6% |
| 7 | 867 | 8.1% |
| 8 | 697 | 6.5% |
| 17 | 695 | 6.5% |
| 6 | 687 | 6.4% |
| 20 | 651 | 6.1% |
| 5 | 629 | 5.9% |
| 11 | 580 | 5.4% |
| 19 | 567 | 5.3% |
| 10 | 536 | 5.0% |
| Other values (14) | 3858 |
| Value | Count | Frequency (%) |
| 0 | 40 | 0.4% |
| 1 | 37 | 0.3% |
| 2 | 194 | 1.8% |
| 3 | 24 | 0.2% |
| 4 | 170 | 1.6% |
| 5 | 629 | |
| 6 | 687 | |
| 7 | 867 | |
| 8 | 697 | |
| 9 | 916 |
| Value | Count | Frequency (%) |
| 23 | 161 | 1.5% |
| 22 | 387 | |
| 21 | 492 | |
| 20 | 651 | |
| 19 | 567 | |
| 18 | 444 | |
| 17 | 695 | |
| 16 | 472 | |
| 15 | 319 | |
| 14 | 523 |
Dep_Min
Real number (ℝ)
Zeros
| Distinct | 12 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 24.411214 |
| Minimum | 0 |
|---|---|
| Maximum | 55 |
| Zeros | 2062 |
| Zeros (%) | 19.3% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 83.6 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 5 |
| median | 25 |
| Q3 | 40 |
| 95-th percentile | 55 |
| Maximum | 55 |
| Range | 55 |
| Interquartile range (IQR) | 35 |
Descriptive statistics
| Standard deviation | 18.76798 |
|---|---|
| Coefficient of variation (CV) | 0.76882617 |
| Kurtosis | -1.2928236 |
| Mean | 24.411214 |
| Median Absolute Deviation (MAD) | 20 |
| Skewness | 0.16702906 |
| Sum | 260785 |
| Variance | 352.23708 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 2062 | |
| 30 | 1215 | |
| 55 | 1058 | |
| 10 | 890 | |
| 45 | 876 | |
| 5 | 773 | 7.2% |
| 15 | 692 | 6.5% |
| 25 | 691 | 6.5% |
| 20 | 666 | 6.2% |
| 35 | 665 | 6.2% |
| Other values (2) | 1095 |
| Value | Count | Frequency (%) |
| 0 | 2062 | |
| 5 | 773 | 7.2% |
| 10 | 890 | |
| 15 | 692 | 6.5% |
| 20 | 666 | 6.2% |
| 25 | 691 | 6.5% |
| 30 | 1215 | |
| 35 | 665 | 6.2% |
| 40 | 504 | 4.7% |
| 45 | 876 |
| Value | Count | Frequency (%) |
| 55 | 1058 | |
| 50 | 591 | |
| 45 | 876 | |
| 40 | 504 | |
| 35 | 665 | |
| 30 | 1215 | |
| 25 | 691 | |
| 20 | 666 | |
| 15 | 692 | |
| 10 | 890 |
Arrival_Hour
Real number (ℝ)
Zeros
| Distinct | 24 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 13.348778 |
| Minimum | 0 |
|---|---|
| Maximum | 23 |
| Zeros | 322 |
| Zeros (%) | 3.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 83.6 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 8 |
| median | 14 |
| Q3 | 19 |
| 95-th percentile | 22 |
| Maximum | 23 |
| Range | 23 |
| Interquartile range (IQR) | 11 |
Descriptive statistics
| Standard deviation | 6.8591252 |
|---|---|
| Coefficient of variation (CV) | 0.51383917 |
| Kurtosis | -1.0752021 |
| Mean | 13.348778 |
| Median Absolute Deviation (MAD) | 5 |
| Skewness | -0.36998825 |
| Sum | 142605 |
| Variance | 47.047599 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 19 | 1626 | |
| 12 | 897 | 8.4% |
| 4 | 838 | 7.8% |
| 21 | 703 | 6.6% |
| 22 | 647 | 6.1% |
| 1 | 529 | 5.0% |
| 18 | 514 | 4.8% |
| 9 | 490 | 4.6% |
| 23 | 485 | 4.5% |
| 10 | 476 | 4.5% |
| Other values (14) | 3478 |
| Value | Count | Frequency (%) |
| 0 | 322 | 3.0% |
| 1 | 529 | |
| 2 | 79 | 0.7% |
| 3 | 47 | 0.4% |
| 4 | 838 | |
| 5 | 69 | 0.6% |
| 6 | 52 | 0.5% |
| 7 | 417 | |
| 8 | 471 | |
| 9 | 490 |
| Value | Count | Frequency (%) |
| 23 | 485 | 4.5% |
| 22 | 647 | 6.1% |
| 21 | 703 | |
| 20 | 377 | 3.5% |
| 19 | 1626 | |
| 18 | 514 | 4.8% |
| 17 | 191 | 1.8% |
| 16 | 370 | 3.5% |
| 15 | 182 | 1.7% |
| 14 | 295 | 2.8% |
Arrival_Min
Real number (ℝ)
Zeros
| Distinct | 12 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 24.69063 |
| Minimum | 0 |
|---|---|
| Maximum | 55 |
| Zeros | 1447 |
| Zeros (%) | 13.5% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 83.6 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 10 |
| median | 25 |
| Q3 | 35 |
| 95-th percentile | 50 |
| Maximum | 55 |
| Range | 55 |
| Interquartile range (IQR) | 25 |
Descriptive statistics
| Standard deviation | 16.506036 |
|---|---|
| Coefficient of variation (CV) | 0.66851416 |
| Kurtosis | -1.0281949 |
| Mean | 24.69063 |
| Median Absolute Deviation (MAD) | 10 |
| Skewness | 0.11094486 |
| Sum | 263770 |
| Variance | 272.44922 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 1447 | |
| 25 | 1302 | |
| 15 | 1286 | |
| 35 | 1111 | |
| 20 | 902 | |
| 30 | 832 | |
| 50 | 750 | |
| 45 | 697 | |
| 5 | 660 | |
| 40 | 629 | |
| Other values (2) | 1067 |
| Value | Count | Frequency (%) |
| 0 | 1447 | |
| 5 | 660 | |
| 10 | 577 | 5.4% |
| 15 | 1286 | |
| 20 | 902 | |
| 25 | 1302 | |
| 30 | 832 | |
| 35 | 1111 | |
| 40 | 629 | |
| 45 | 697 |
| Value | Count | Frequency (%) |
| 55 | 490 | 4.6% |
| 50 | 750 | |
| 45 | 697 | |
| 40 | 629 | |
| 35 | 1111 | |
| 30 | 832 | |
| 25 | 1302 | |
| 20 | 902 | |
| 15 | 1286 | |
| 10 | 577 |
Journey_Date
Date
| Distinct | 40 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 83.6 KiB |
| Minimum | 2019-01-03 00:00:00 |
|---|---|
| Maximum | 2019-12-06 00:00:00 |
| Invalid dates | 0 |
| Invalid dates (%) | 0.0% |
Is_Peak_Hour
Categorical
High correlation
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 521.8 KiB |
| 1 | |
|---|---|
| 0 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 1 |
| 4th row | 1 |
| 5th row | 1 |
Common Values
| Value | Count | Frequency (%) |
| 1 | 6790 | |
| 0 | 3893 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 1 | 6790 | |
| 0 | 3893 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 6790 | |
| 0 | 3893 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 10683 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 1 | 6790 | |
| 0 | 3893 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 10683 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 1 | 6790 | |
| 0 | 3893 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 10683 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 1 | 6790 | |
| 0 | 3893 |
Price
Real number (ℝ)
High correlation
| Distinct | 1870 |
|---|---|
| Distinct (%) | 17.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 9087.0641 |
| Minimum | 1759 |
|---|---|
| Maximum | 79512 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 83.6 KiB |
Quantile statistics
| Minimum | 1759 |
|---|---|
| 5-th percentile | 3543 |
| Q1 | 5277 |
| median | 8372 |
| Q3 | 12373 |
| 95-th percentile | 15764 |
| Maximum | 79512 |
| Range | 77753 |
| Interquartile range (IQR) | 7096 |
Descriptive statistics
| Standard deviation | 4611.3592 |
|---|---|
| Coefficient of variation (CV) | 0.50746414 |
| Kurtosis | 13.30333 |
| Mean | 9087.0641 |
| Median Absolute Deviation (MAD) | 3382 |
| Skewness | 1.8125524 |
| Sum | 97077106 |
| Variance | 21264633 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 10262 | 258 | 2.4% |
| 10844 | 212 | 2.0% |
| 7229 | 162 | 1.5% |
| 4804 | 160 | 1.5% |
| 4823 | 131 | 1.2% |
| 14714 | 109 | 1.0% |
| 3943 | 104 | 1.0% |
| 15129 | 93 | 0.9% |
| 3841 | 91 | 0.9% |
| 12898 | 86 | 0.8% |
| Other values (1860) | 9277 |
| Value | Count | Frequency (%) |
| 1759 | 4 | < 0.1% |
| 1840 | 1 | < 0.1% |
| 1965 | 36 | |
| 2017 | 35 | |
| 2050 | 10 | 0.1% |
| 2071 | 6 | 0.1% |
| 2175 | 7 | 0.1% |
| 2227 | 40 | |
| 2228 | 9 | 0.1% |
| 2385 | 6 | 0.1% |
| Value | Count | Frequency (%) |
| 79512 | 1 | < 0.1% |
| 62427 | 1 | < 0.1% |
| 57209 | 1 | < 0.1% |
| 54826 | 3 | |
| 52285 | 1 | < 0.1% |
| 52229 | 1 | < 0.1% |
| 46490 | 1 | < 0.1% |
| 36983 | 1 | < 0.1% |
| 36235 | 2 | |
| 35185 | 1 | < 0.1% |
Interactions
Correlations
| Airline | Arrival_Hour | Arrival_Min | Dep_Hour | Dep_Min | Destination | Duration | Is_Peak_Hour | Journey_Day | Journey_Month | Price | Source | Total_Stops | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Airline | 1.000 | 0.201 | 0.192 | 0.152 | 0.155 | 0.256 | 0.249 | 0.131 | 0.064 | 0.123 | 0.378 | 0.276 | 0.334 |
| Arrival_Hour | 0.201 | 1.000 | -0.172 | 0.055 | 0.046 | 0.184 | 0.053 | 0.272 | -0.003 | 0.106 | 0.040 | 0.205 | 0.165 |
| Arrival_Min | 0.192 | -0.172 | 1.000 | 0.064 | -0.018 | 0.199 | -0.112 | 0.168 | -0.017 | 0.126 | -0.104 | 0.212 | 0.179 |
| Dep_Hour | 0.152 | 0.055 | 0.064 | 1.000 | -0.033 | 0.152 | -0.012 | 0.813 | 0.002 | 0.055 | 0.008 | 0.164 | 0.121 |
| Dep_Min | 0.155 | 0.046 | -0.018 | -0.033 | 1.000 | 0.186 | -0.037 | 0.133 | -0.007 | 0.083 | -0.062 | 0.190 | 0.134 |
| Destination | 0.256 | 0.184 | 0.199 | 0.152 | 0.186 | 1.000 | 0.334 | 0.140 | 0.129 | 0.384 | 0.227 | 1.000 | 0.383 |
| Duration | 0.249 | 0.053 | -0.112 | -0.012 | -0.037 | 0.334 | 1.000 | 0.172 | -0.024 | 0.145 | 0.692 | 0.340 | 0.549 |
| Is_Peak_Hour | 0.131 | 0.272 | 0.168 | 0.813 | 0.133 | 0.140 | 0.172 | 1.000 | 0.018 | 0.090 | 0.076 | 0.134 | 0.022 |
| Journey_Day | 0.064 | -0.003 | -0.017 | 0.002 | -0.007 | 0.129 | -0.024 | 0.018 | 1.000 | 0.196 | -0.122 | 0.139 | 0.060 |
| Journey_Month | 0.123 | 0.106 | 0.126 | 0.055 | 0.083 | 0.384 | 0.145 | 0.090 | 0.196 | 1.000 | 0.187 | 0.228 | 0.136 |
| Price | 0.378 | 0.040 | -0.104 | 0.008 | -0.062 | 0.227 | 0.692 | 0.076 | -0.122 | 0.187 | 1.000 | 0.202 | 0.309 |
| Source | 0.276 | 0.205 | 0.212 | 0.164 | 0.190 | 1.000 | 0.340 | 0.134 | 0.139 | 0.228 | 0.202 | 1.000 | 0.345 |
| Total_Stops | 0.334 | 0.165 | 0.179 | 0.121 | 0.134 | 0.383 | 0.549 | 0.022 | 0.060 | 0.136 | 0.309 | 0.345 | 1.000 |
Missing values
Sample
| Airline | Source | Destination | Duration | Total_Stops | Journey_Day | Journey_Month | Dep_Hour | Dep_Min | Arrival_Hour | Arrival_Min | Journey_Date | Is_Peak_Hour | Price | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | IndiGo | Banglore | New Delhi | 170 | 0 | 24 | 3 | 22 | 20 | 1 | 10 | 24-03-2019 | 0 | 3897 |
| 1 | Air India | Kolkata | Banglore | 445 | 2 | 1 | 5 | 5 | 50 | 13 | 15 | 01-05-2019 | 0 | 7662 |
| 2 | Jet Airways | Delhi | Cochin | 1140 | 2 | 9 | 6 | 9 | 25 | 4 | 25 | 09-06-2019 | 1 | 13882 |
| 3 | IndiGo | Kolkata | Banglore | 325 | 1 | 12 | 5 | 18 | 5 | 23 | 30 | 12-05-2019 | 1 | 6218 |
| 4 | IndiGo | Banglore | New Delhi | 285 | 1 | 1 | 3 | 16 | 50 | 21 | 35 | 01-03-2019 | 1 | 13302 |
| 5 | SpiceJet | Kolkata | Banglore | 145 | 0 | 24 | 6 | 9 | 0 | 11 | 25 | 24-06-2019 | 1 | 3873 |
| 6 | Jet Airways | Banglore | New Delhi | 930 | 1 | 12 | 3 | 18 | 55 | 10 | 25 | 12-03-2019 | 1 | 11087 |
| 7 | Jet Airways | Banglore | New Delhi | 1265 | 1 | 1 | 3 | 8 | 0 | 5 | 5 | 01-03-2019 | 0 | 22270 |
| 8 | Jet Airways | Banglore | New Delhi | 1530 | 1 | 12 | 3 | 8 | 55 | 10 | 25 | 12-03-2019 | 0 | 11087 |
| 9 | Multiple carriers | Delhi | Cochin | 470 | 1 | 27 | 5 | 11 | 25 | 19 | 15 | 27-05-2019 | 1 | 8625 |
| Airline | Source | Destination | Duration | Total_Stops | Journey_Day | Journey_Month | Dep_Hour | Dep_Min | Arrival_Hour | Arrival_Min | Journey_Date | Is_Peak_Hour | Price | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 10673 | Jet Airways | Delhi | Cochin | 900 | 2 | 27 | 5 | 13 | 25 | 4 | 25 | 27-05-2019 | 1 | 16704 |
| 10674 | Jet Airways | Banglore | New Delhi | 1485 | 1 | 12 | 3 | 20 | 35 | 21 | 20 | 12-03-2019 | 1 | 11087 |
| 10675 | Air India | Mumbai | Hyderabad | 80 | 0 | 9 | 6 | 6 | 20 | 7 | 40 | 09-06-2019 | 0 | 3100 |
| 10676 | Multiple carriers | Delhi | Cochin | 520 | 1 | 1 | 5 | 10 | 20 | 19 | 0 | 01-05-2019 | 1 | 9794 |
| 10677 | SpiceJet | Banglore | Delhi | 160 | 0 | 21 | 5 | 5 | 55 | 8 | 35 | 21-05-2019 | 0 | 3257 |
| 10678 | Air Asia | Kolkata | Banglore | 150 | 0 | 9 | 4 | 19 | 55 | 22 | 25 | 09-04-2019 | 1 | 4107 |
| 10679 | Air India | Kolkata | Banglore | 155 | 0 | 27 | 4 | 20 | 45 | 23 | 20 | 27-04-2019 | 1 | 4145 |
| 10680 | Jet Airways | Banglore | Delhi | 180 | 0 | 27 | 4 | 8 | 20 | 11 | 20 | 27-04-2019 | 0 | 7229 |
| 10681 | Vistara | Banglore | New Delhi | 160 | 0 | 1 | 3 | 11 | 30 | 14 | 10 | 01-03-2019 | 1 | 12648 |
| 10682 | Air India | Delhi | Cochin | 500 | 2 | 9 | 5 | 10 | 55 | 19 | 15 | 09-05-2019 | 1 | 11753 |
Duplicate rows
Most frequently occurring
| Airline | Source | Destination | Duration | Total_Stops | Journey_Day | Journey_Month | Dep_Hour | Dep_Min | Arrival_Hour | Arrival_Min | Journey_Date | Is_Peak_Hour | Price | # duplicates | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 6 | Air India | Delhi | Cochin | 1275 | 2 | 9 | 5 | 22 | 0 | 19 | 15 | 09-05-2019 | 0 | 10441 | 3 |
| 8 | Air India | Delhi | Cochin | 1275 | 2 | 15 | 5 | 22 | 0 | 19 | 15 | 15-05-2019 | 0 | 11281 | 3 |
| 10 | Air India | Delhi | Cochin | 1275 | 2 | 21 | 5 | 22 | 0 | 19 | 15 | 21-05-2019 | 0 | 10231 | 3 |
| 11 | Air India | Delhi | Cochin | 1275 | 2 | 24 | 6 | 22 | 0 | 19 | 15 | 24-06-2019 | 0 | 9181 | 3 |
| 29 | Air India | Delhi | Cochin | 1560 | 2 | 18 | 5 | 17 | 15 | 19 | 15 | 18-05-2019 | 1 | 12392 | 3 |
| 38 | Air India | Delhi | Cochin | 2170 | 2 | 6 | 3 | 7 | 5 | 19 | 15 | 06-03-2019 | 0 | 11552 | 3 |
| 49 | Air India | Kolkata | Banglore | 1665 | 2 | 1 | 5 | 10 | 0 | 13 | 45 | 01-05-2019 | 1 | 15164 | 3 |
| 102 | Jet Airways | Delhi | Cochin | 1195 | 2 | 9 | 5 | 23 | 5 | 19 | 0 | 09-05-2019 | 0 | 15129 | 3 |
| 104 | Jet Airways | Delhi | Cochin | 1195 | 2 | 24 | 6 | 23 | 5 | 19 | 0 | 24-06-2019 | 0 | 12819 | 3 |
| 106 | Jet Airways | Delhi | Cochin | 1195 | 2 | 27 | 6 | 23 | 5 | 19 | 0 | 27-06-2019 | 0 | 11150 | 3 |