Overview

Dataset statistics

Number of variables16
Number of observations10000
Missing cells3377
Missing cells (%)2.1%
Duplicate rows1
Duplicate rows (%)< 0.1%
Total size in memory1.4 MiB
Average record size in memory143.0 B

Variable types

Categorical5
Text4
Numeric7

Dataset

Description인천국제공항공사 항공운송실적 정보로 월별, 항공사별, 노선별 여객, 환승, 화물 운송 실적에 대한 세부 정보를 포함하고 있음
Author인천국제공항공사
URLhttps://www.data.go.kr/data/15062056/fileData.do

Alerts

Dataset has 1 (< 0.1%) duplicate rowsDuplicates
유임승객(명) is highly overall correlated with 무임승객(명) and 3 other fieldsHigh correlation
무임승객(명) is highly overall correlated with 유임승객(명) and 2 other fieldsHigh correlation
환승객(명) is highly overall correlated with 유임승객(명) and 2 other fieldsHigh correlation
항공화물(kg) is highly overall correlated with 환적화물(kg)High correlation
환적화물(kg) is highly overall correlated with 항공화물(kg)High correlation
수하물(kg) is highly overall correlated with 유임승객(명) and 2 other fieldsHigh correlation
여객_화물 is highly overall correlated with 유임승객(명)High correlation
국제_국내 is highly imbalanced (84.0%)Imbalance
항공사(ICAO) has 307 (3.1%) missing valuesMissing
항공사(IATA) has 307 (3.1%) missing valuesMissing
항공사명 has 307 (3.1%) missing valuesMissing
경유지공항 has 307 (3.1%) missing valuesMissing
유임승객(명) has 307 (3.1%) missing valuesMissing
무임승객(명) has 307 (3.1%) missing valuesMissing
환승객(명) has 307 (3.1%) missing valuesMissing
항공화물(kg) has 307 (3.1%) missing valuesMissing
환적화물(kg) has 307 (3.1%) missing valuesMissing
우편물(kg) has 307 (3.1%) missing valuesMissing
수하물(kg) has 307 (3.1%) missing valuesMissing
유임승객(명) has 3029 (30.3%) zerosZeros
무임승객(명) has 3519 (35.2%) zerosZeros
환승객(명) has 4102 (41.0%) zerosZeros
항공화물(kg) has 2114 (21.1%) zerosZeros
환적화물(kg) has 5752 (57.5%) zerosZeros
우편물(kg) has 7268 (72.7%) zerosZeros
수하물(kg) has 3038 (30.4%) zerosZeros

Reproduction

Analysis started2024-03-14 11:36:10.885811
Analysis finished2024-03-14 11:36:26.814729
Duration15.93 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

년월
Categorical

Distinct13
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-10
911 
2023-03
871 
2023-12
844 
2023-09
836 
2023-08
811 
Other values (8)
5727 

Length

Max length7
Median length7
Mean length6.9079
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2023-06
2nd row2023-02
3rd row2023-04
4th row2023-01
5th row2023-02

Common Values

ValueCountFrequency (%)
2023-10 911
9.1%
2023-03 871
8.7%
2023-12 844
8.4%
2023-09 836
8.4%
2023-08 811
8.1%
2023-11 804
8.0%
2023-07 799
8.0%
2023-06 786
7.9%
2023-01 778
7.8%
2023-05 763
7.6%
Other values (3) 1797
18.0%

Length

2024-03-14T20:36:27.033594image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
2023-10 911
9.1%
2023-03 871
8.7%
2023-12 844
8.4%
2023-09 836
8.4%
2023-08 811
8.1%
2023-11 804
8.0%
2023-07 799
8.0%
2023-06 786
7.9%
2023-01 778
7.8%
2023-05 763
7.6%
Other values (3) 1797
18.0%

항공사(ICAO)
Text

MISSING 

Distinct112
Distinct (%)1.2%
Missing307
Missing (%)3.1%
Memory size156.2 KiB
2024-03-14T20:36:28.001895image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length3
Median length3
Mean length3
Min length3

Characters and Unicode

Total characters29079
Distinct characters26
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique4 ?
Unique (%)< 0.1%

Sample

1st rowAAR
2nd rowKAL
3rd rowKAL
4th rowKAL
5th rowKAL
ValueCountFrequency (%)
kal 2469
25.5%
aar 1407
14.5%
jja 663
 
6.8%
twb 512
 
5.3%
jna 436
 
4.5%
aih 230
 
2.4%
gti 220
 
2.3%
asv 210
 
2.2%
pac 169
 
1.7%
csn 167
 
1.7%
Other values (102) 3210
33.1%
2024-03-14T20:36:29.315841image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
A 8443
29.0%
L 3126
 
10.8%
K 2721
 
9.4%
J 2016
 
6.9%
R 1575
 
5.4%
C 1460
 
5.0%
T 1171
 
4.0%
S 952
 
3.3%
B 847
 
2.9%
N 752
 
2.6%
Other values (16) 6016
20.7%

Most occurring categories

ValueCountFrequency (%)
Uppercase Letter 29079
100.0%

Most frequent character per category

Uppercase Letter
ValueCountFrequency (%)
A 8443
29.0%
L 3126
 
10.8%
K 2721
 
9.4%
J 2016
 
6.9%
R 1575
 
5.4%
C 1460
 
5.0%
T 1171
 
4.0%
S 952
 
3.3%
B 847
 
2.9%
N 752
 
2.6%
Other values (16) 6016
20.7%

Most occurring scripts

ValueCountFrequency (%)
Latin 29079
100.0%

Most frequent character per script

Latin
ValueCountFrequency (%)
A 8443
29.0%
L 3126
 
10.8%
K 2721
 
9.4%
J 2016
 
6.9%
R 1575
 
5.4%
C 1460
 
5.0%
T 1171
 
4.0%
S 952
 
3.3%
B 847
 
2.9%
N 752
 
2.6%
Other values (16) 6016
20.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII 29079
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
A 8443
29.0%
L 3126
 
10.8%
K 2721
 
9.4%
J 2016
 
6.9%
R 1575
 
5.4%
C 1460
 
5.0%
T 1171
 
4.0%
S 952
 
3.3%
B 847
 
2.9%
N 752
 
2.6%
Other values (16) 6016
20.7%

항공사(IATA)
Text

MISSING 

Distinct111
Distinct (%)1.1%
Missing307
Missing (%)3.1%
Memory size156.2 KiB
2024-03-14T20:36:30.161869image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length3
Median length2
Mean length2.0008253
Min length2

Characters and Unicode

Total characters19394
Distinct characters35
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique4 ?
Unique (%)< 0.1%

Sample

1st rowOZ
2nd rowKE
3rd rowKE
4th rowKE
5th rowKE
ValueCountFrequency (%)
ke 2469
25.5%
oz 1407
14.5%
7c 663
 
6.8%
tw 512
 
5.3%
lj 436
 
4.5%
kj 230
 
2.4%
5y 220
 
2.3%
rs 210
 
2.2%
po 169
 
1.7%
cz 167
 
1.7%
Other values (101) 3210
33.1%
2024-03-14T20:36:31.218122image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
K 2895
14.9%
E 2614
13.5%
Z 1736
 
9.0%
O 1715
 
8.8%
C 1293
 
6.7%
J 911
 
4.7%
T 721
 
3.7%
7 706
 
3.6%
L 691
 
3.6%
W 530
 
2.7%
Other values (25) 5582
28.8%

Most occurring categories

ValueCountFrequency (%)
Uppercase Letter 17911
92.4%
Decimal Number 1483
 
7.6%

Most frequent character per category

Uppercase Letter
ValueCountFrequency (%)
K 2895
16.2%
E 2614
14.6%
Z 1736
9.7%
O 1715
9.6%
C 1293
 
7.2%
J 911
 
5.1%
T 721
 
4.0%
L 691
 
3.9%
W 530
 
3.0%
X 526
 
2.9%
Other values (16) 4279
23.9%
Decimal Number
ValueCountFrequency (%)
7 706
47.6%
5 422
28.5%
9 78
 
5.3%
4 73
 
4.9%
3 57
 
3.8%
8 48
 
3.2%
2 40
 
2.7%
6 31
 
2.1%
0 28
 
1.9%

Most occurring scripts

ValueCountFrequency (%)
Latin 17911
92.4%
Common 1483
 
7.6%

Most frequent character per script

Latin
ValueCountFrequency (%)
K 2895
16.2%
E 2614
14.6%
Z 1736
9.7%
O 1715
9.6%
C 1293
 
7.2%
J 911
 
5.1%
T 721
 
4.0%
L 691
 
3.9%
W 530
 
3.0%
X 526
 
2.9%
Other values (16) 4279
23.9%
Common
ValueCountFrequency (%)
7 706
47.6%
5 422
28.5%
9 78
 
5.3%
4 73
 
4.9%
3 57
 
3.8%
8 48
 
3.2%
2 40
 
2.7%
6 31
 
2.1%
0 28
 
1.9%

Most occurring blocks

ValueCountFrequency (%)
ASCII 19394
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
K 2895
14.9%
E 2614
13.5%
Z 1736
 
9.0%
O 1715
 
8.8%
C 1293
 
6.7%
J 911
 
4.7%
T 721
 
3.7%
7 706
 
3.6%
L 691
 
3.6%
W 530
 
2.7%
Other values (25) 5582
28.8%

항공사명
Text

MISSING 

Distinct111
Distinct (%)1.1%
Missing307
Missing (%)3.1%
Memory size156.2 KiB
2024-03-14T20:36:31.965369image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length16
Median length15
Mean length5.2015888
Min length3

Characters and Unicode

Total characters50419
Distinct characters170
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique4 ?
Unique (%)< 0.1%

Sample

1st row아시아나항공
2nd row대한항공
3rd row대한항공
4th row대한항공
5th row대한항공
ValueCountFrequency (%)
대한항공 2469
22.3%
아시아나항공 1407
 
12.7%
에어 700
 
6.3%
제주항공 663
 
6.0%
티웨이항공 512
 
4.6%
436
 
3.9%
항공 289
 
2.6%
에어인천 230
 
2.1%
아틀라스항공 220
 
2.0%
에어서울 210
 
1.9%
Other values (120) 3926
35.5%
2024-03-14T20:36:32.939384image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
7942
15.8%
7942
15.8%
3592
 
7.1%
2524
 
5.0%
2469
 
4.9%
1876
 
3.7%
1648
 
3.3%
1598
 
3.2%
1500
 
3.0%
1369
 
2.7%
Other values (160) 17959
35.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 47590
94.4%
Space Separator 1369
 
2.7%
Uppercase Letter 1136
 
2.3%
Lowercase Letter 282
 
0.6%
Open Punctuation 19
 
< 0.1%
Close Punctuation 19
 
< 0.1%
Dash Punctuation 4
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
7942
16.7%
7942
16.7%
3592
 
7.5%
2524
 
5.3%
2469
 
5.2%
1876
 
3.9%
1648
 
3.5%
1598
 
3.4%
1500
 
3.2%
969
 
2.0%
Other values (135) 15530
32.6%
Uppercase Letter
ValueCountFrequency (%)
E 199
17.5%
F 144
12.7%
X 141
12.4%
L 118
10.4%
I 78
 
6.9%
A 74
 
6.5%
T 65
 
5.7%
D 51
 
4.5%
M 51
 
4.5%
R 46
 
4.0%
Other values (9) 169
14.9%
Lowercase Letter
ValueCountFrequency (%)
d 141
50.0%
e 141
50.0%
Space Separator
ValueCountFrequency (%)
1369
100.0%
Open Punctuation
ValueCountFrequency (%)
( 19
100.0%
Close Punctuation
ValueCountFrequency (%)
) 19
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 4
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 47590
94.4%
Latin 1418
 
2.8%
Common 1411
 
2.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
7942
16.7%
7942
16.7%
3592
 
7.5%
2524
 
5.3%
2469
 
5.2%
1876
 
3.9%
1648
 
3.5%
1598
 
3.4%
1500
 
3.2%
969
 
2.0%
Other values (135) 15530
32.6%
Latin
ValueCountFrequency (%)
E 199
14.0%
F 144
10.2%
X 141
9.9%
d 141
9.9%
e 141
9.9%
L 118
8.3%
I 78
 
5.5%
A 74
 
5.2%
T 65
 
4.6%
D 51
 
3.6%
Other values (11) 266
18.8%
Common
ValueCountFrequency (%)
1369
97.0%
( 19
 
1.3%
) 19
 
1.3%
- 4
 
0.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 47590
94.4%
ASCII 2829
 
5.6%

Most frequent character per block

Hangul
ValueCountFrequency (%)
7942
16.7%
7942
16.7%
3592
 
7.5%
2524
 
5.3%
2469
 
5.2%
1876
 
3.9%
1648
 
3.5%
1598
 
3.4%
1500
 
3.2%
969
 
2.0%
Other values (135) 15530
32.6%
ASCII
ValueCountFrequency (%)
1369
48.4%
E 199
 
7.0%
F 144
 
5.1%
X 141
 
5.0%
d 141
 
5.0%
e 141
 
5.0%
L 118
 
4.2%
I 78
 
2.8%
A 74
 
2.6%
T 65
 
2.3%
Other values (15) 359
 
12.7%

국제_국내
Categorical

IMBALANCE 

Distinct3
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
국제
9628 
<NA>
 
307
국내
 
65

Length

Max length4
Median length2
Mean length2.0614
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row국제
2nd row국제
3rd row국제
4th row국제
5th row국제

Common Values

ValueCountFrequency (%)
국제 9628
96.3%
<NA> 307
 
3.1%
국내 65
 
0.7%

Length

2024-03-14T20:36:33.182085image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-14T20:36:33.370912image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
국제 9628
96.3%
na 307
 
3.1%
국내 65
 
0.7%

경유지공항
Text

MISSING 

Distinct215
Distinct (%)2.2%
Missing307
Missing (%)3.1%
Memory size156.2 KiB
2024-03-14T20:36:34.832910image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length3
Median length3
Mean length3
Min length3

Characters and Unicode

Total characters29079
Distinct characters26
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique20 ?
Unique (%)0.2%

Sample

1st rowTAO
2nd rowOSL
3rd rowBNE
4th rowKKJ
5th rowSEA
ValueCountFrequency (%)
nrt 364
 
3.8%
hkg 309
 
3.2%
kix 279
 
2.9%
tpe 229
 
2.4%
han 224
 
2.3%
pvg 212
 
2.2%
bkk 210
 
2.2%
dad 206
 
2.1%
lax 197
 
2.0%
cxr 169
 
1.7%
Other values (205) 7294
75.3%
2024-03-14T20:36:36.615911image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
N 2465
 
8.5%
K 2313
 
8.0%
A 2196
 
7.6%
T 1700
 
5.8%
C 1633
 
5.6%
G 1597
 
5.5%
S 1591
 
5.5%
H 1444
 
5.0%
D 1431
 
4.9%
R 1221
 
4.2%
Other values (16) 11488
39.5%

Most occurring categories

ValueCountFrequency (%)
Uppercase Letter 29079
100.0%

Most frequent character per category

Uppercase Letter
ValueCountFrequency (%)
N 2465
 
8.5%
K 2313
 
8.0%
A 2196
 
7.6%
T 1700
 
5.8%
C 1633
 
5.6%
G 1597
 
5.5%
S 1591
 
5.5%
H 1444
 
5.0%
D 1431
 
4.9%
R 1221
 
4.2%
Other values (16) 11488
39.5%

Most occurring scripts

ValueCountFrequency (%)
Latin 29079
100.0%

Most frequent character per script

Latin
ValueCountFrequency (%)
N 2465
 
8.5%
K 2313
 
8.0%
A 2196
 
7.6%
T 1700
 
5.8%
C 1633
 
5.6%
G 1597
 
5.5%
S 1591
 
5.5%
H 1444
 
5.0%
D 1431
 
4.9%
R 1221
 
4.2%
Other values (16) 11488
39.5%

Most occurring blocks

ValueCountFrequency (%)
ASCII 29079
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
N 2465
 
8.5%
K 2313
 
8.0%
A 2196
 
7.6%
T 1700
 
5.8%
C 1633
 
5.6%
G 1597
 
5.5%
S 1591
 
5.5%
H 1444
 
5.0%
D 1431
 
4.9%
R 1221
 
4.2%
Other values (16) 11488
39.5%

도착_출발
Categorical

Distinct3
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
출발
4854 
도착
4839 
<NA>
 
307

Length

Max length4
Median length2
Mean length2.0614
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row출발
2nd row도착
3rd row도착
4th row도착
5th row출발

Common Values

ValueCountFrequency (%)
출발 4854
48.5%
도착 4839
48.4%
<NA> 307
 
3.1%

Length

2024-03-14T20:36:36.864712image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-14T20:36:37.054435image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
출발 4854
48.5%
도착 4839
48.4%
na 307
 
3.1%

정기_부정기
Categorical

Distinct3
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
정기
7879 
부정기
1814 
<NA>
 
307

Length

Max length4
Median length2
Mean length2.2428
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row정기
2nd row정기
3rd row정기
4th row부정기
5th row정기

Common Values

ValueCountFrequency (%)
정기 7879
78.8%
부정기 1814
 
18.1%
<NA> 307
 
3.1%

Length

2024-03-14T20:36:37.253639image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-14T20:36:37.540437image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
정기 7879
78.8%
부정기 1814
 
18.1%
na 307
 
3.1%

여객_화물
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
여객
6671 
화물
3022 
<NA>
 
307

Length

Max length4
Median length2
Mean length2.0614
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row여객
2nd row화물
3rd row여객
4th row화물
5th row여객

Common Values

ValueCountFrequency (%)
여객 6671
66.7%
화물 3022
30.2%
<NA> 307
 
3.1%

Length

2024-03-14T20:36:37.813701image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-14T20:36:38.004838image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
여객 6671
66.7%
화물 3022
30.2%
na 307
 
3.1%

유임승객(명)
Real number (ℝ)

HIGH CORRELATION  MISSING  ZEROS 

Distinct4986
Distinct (%)51.4%
Missing307
Missing (%)3.1%
Infinite0
Infinite (%)0.0%
Mean3725.1487
Minimum0
Maximum39315
Zeros3029
Zeros (%)30.3%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-03-14T20:36:38.206073image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median1931
Q35218
95-th percentile14076.2
Maximum39315
Range39315
Interquartile range (IQR)5218

Descriptive statistics

Standard deviation5042.0313
Coefficient of variation (CV)1.3535114
Kurtosis6.3271582
Mean3725.1487
Median Absolute Deviation (MAD)1931
Skewness2.1979239
Sum36107866
Variance25422080
MonotonicityNot monotonic
2024-03-14T20:36:38.469453image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 3029
30.3%
2 6
 
0.1%
172 6
 
0.1%
188 6
 
0.1%
3661 6
 
0.1%
4582 6
 
0.1%
304 6
 
0.1%
2127 6
 
0.1%
179 5
 
0.1%
171 5
 
0.1%
Other values (4976) 6612
66.1%
(Missing) 307
 
3.1%
ValueCountFrequency (%)
0 3029
30.3%
1 3
 
< 0.1%
2 6
 
0.1%
3 2
 
< 0.1%
4 2
 
< 0.1%
5 3
 
< 0.1%
6 1
 
< 0.1%
7 2
 
< 0.1%
8 1
 
< 0.1%
9 1
 
< 0.1%
ValueCountFrequency (%)
39315 1
< 0.1%
36354 1
< 0.1%
36046 1
< 0.1%
35513 1
< 0.1%
35182 1
< 0.1%
34798 1
< 0.1%
34564 1
< 0.1%
34184 1
< 0.1%
33849 1
< 0.1%
33353 1
< 0.1%

무임승객(명)
Real number (ℝ)

HIGH CORRELATION  MISSING  ZEROS 

Distinct212
Distinct (%)2.2%
Missing307
Missing (%)3.1%
Infinite0
Infinite (%)0.0%
Mean18.666151
Minimum0
Maximum361
Zeros3519
Zeros (%)35.2%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-03-14T20:36:38.868550image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median6
Q325
95-th percentile78
Maximum361
Range361
Interquartile range (IQR)25

Descriptive statistics

Standard deviation31.170149
Coefficient of variation (CV)1.6698755
Kurtosis17.432776
Mean18.666151
Median Absolute Deviation (MAD)6
Skewness3.3629709
Sum180931
Variance971.57817
MonotonicityNot monotonic
2024-03-14T20:36:39.312517image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 3519
35.2%
1 285
 
2.9%
2 260
 
2.6%
5 249
 
2.5%
3 239
 
2.4%
6 211
 
2.1%
4 209
 
2.1%
7 189
 
1.9%
8 162
 
1.6%
10 159
 
1.6%
Other values (202) 4211
42.1%
(Missing) 307
 
3.1%
ValueCountFrequency (%)
0 3519
35.2%
1 285
 
2.9%
2 260
 
2.6%
3 239
 
2.4%
4 209
 
2.1%
5 249
 
2.5%
6 211
 
2.1%
7 189
 
1.9%
8 162
 
1.6%
9 159
 
1.6%
ValueCountFrequency (%)
361 1
< 0.1%
332 1
< 0.1%
328 1
< 0.1%
326 1
< 0.1%
313 1
< 0.1%
311 1
< 0.1%
309 1
< 0.1%
287 1
< 0.1%
282 1
< 0.1%
280 1
< 0.1%

환승객(명)
Real number (ℝ)

HIGH CORRELATION  MISSING  ZEROS 

Distinct1861
Distinct (%)19.2%
Missing307
Missing (%)3.1%
Infinite0
Infinite (%)0.0%
Mean477.76519
Minimum0
Maximum18673
Zeros4102
Zeros (%)41.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-03-14T20:36:39.740116image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median8
Q3237
95-th percentile2831.4
Maximum18673
Range18673
Interquartile range (IQR)237

Descriptive statistics

Standard deviation1362.3092
Coefficient of variation (CV)2.85142
Kurtosis39.092452
Mean477.76519
Median Absolute Deviation (MAD)8
Skewness5.3925986
Sum4630978
Variance1855886.5
MonotonicityNot monotonic
2024-03-14T20:36:40.179167image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 4102
41.0%
1 157
 
1.6%
2 136
 
1.4%
3 103
 
1.0%
4 98
 
1.0%
6 78
 
0.8%
5 71
 
0.7%
9 60
 
0.6%
8 59
 
0.6%
7 54
 
0.5%
Other values (1851) 4775
47.8%
(Missing) 307
 
3.1%
ValueCountFrequency (%)
0 4102
41.0%
1 157
 
1.6%
2 136
 
1.4%
3 103
 
1.0%
4 98
 
1.0%
5 71
 
0.7%
6 78
 
0.8%
7 54
 
0.5%
8 59
 
0.6%
9 60
 
0.6%
ValueCountFrequency (%)
18673 1
< 0.1%
18026 1
< 0.1%
17571 1
< 0.1%
15811 1
< 0.1%
15022 1
< 0.1%
14897 1
< 0.1%
14894 1
< 0.1%
14832 1
< 0.1%
14373 1
< 0.1%
13949 1
< 0.1%

항공화물(kg)
Real number (ℝ)

HIGH CORRELATION  MISSING  ZEROS 

Distinct7331
Distinct (%)75.6%
Missing307
Missing (%)3.1%
Infinite0
Infinite (%)0.0%
Mean209714.82
Minimum0
Maximum4843510
Zeros2114
Zeros (%)21.1%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-03-14T20:36:40.611456image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q1336
median41677
Q3232017
95-th percentile978658
Maximum4843510
Range4843510
Interquartile range (IQR)231681

Descriptive statistics

Standard deviation422734.49
Coefficient of variation (CV)2.0157588
Kurtosis23.745511
Mean209714.82
Median Absolute Deviation (MAD)41677
Skewness4.1560985
Sum2.0327658 × 109
Variance1.7870445 × 1011
MonotonicityNot monotonic
2024-03-14T20:36:41.075639image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 2114
 
21.1%
3 9
 
0.1%
1 7
 
0.1%
45 6
 
0.1%
231 5
 
0.1%
15 5
 
0.1%
200 4
 
< 0.1%
72 4
 
< 0.1%
336 4
 
< 0.1%
60 4
 
< 0.1%
Other values (7321) 7531
75.3%
(Missing) 307
 
3.1%
ValueCountFrequency (%)
0 2114
21.1%
1 7
 
0.1%
2 2
 
< 0.1%
3 9
 
0.1%
4 2
 
< 0.1%
5 1
 
< 0.1%
6 3
 
< 0.1%
7 3
 
< 0.1%
8 3
 
< 0.1%
9 3
 
< 0.1%
ValueCountFrequency (%)
4843510 1
< 0.1%
4789015 1
< 0.1%
4729729 1
< 0.1%
4299632 1
< 0.1%
4247784 1
< 0.1%
4118294 1
< 0.1%
4077933 1
< 0.1%
3974124 1
< 0.1%
3914062 1
< 0.1%
3872575 1
< 0.1%

환적화물(kg)
Real number (ℝ)

HIGH CORRELATION  MISSING  ZEROS 

Distinct3876
Distinct (%)40.0%
Missing307
Missing (%)3.1%
Infinite0
Infinite (%)0.0%
Mean88027.241
Minimum0
Maximum3583128
Zeros5752
Zeros (%)57.5%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-03-14T20:36:41.674913image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q340330
95-th percentile450588.2
Maximum3583128
Range3583128
Interquartile range (IQR)40330

Descriptive statistics

Standard deviation286377.06
Coefficient of variation (CV)3.2532777
Kurtosis54.646902
Mean88027.241
Median Absolute Deviation (MAD)0
Skewness6.5618139
Sum8.5324804 × 108
Variance8.201182 × 1010
MonotonicityNot monotonic
2024-03-14T20:36:42.126018image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 5752
57.5%
2 5
 
0.1%
261 3
 
< 0.1%
18 3
 
< 0.1%
4 3
 
< 0.1%
1 3
 
< 0.1%
123 3
 
< 0.1%
250 3
 
< 0.1%
26 3
 
< 0.1%
120 2
 
< 0.1%
Other values (3866) 3913
39.1%
(Missing) 307
 
3.1%
ValueCountFrequency (%)
0 5752
57.5%
1 3
 
< 0.1%
2 5
 
0.1%
4 3
 
< 0.1%
6 2
 
< 0.1%
8 1
 
< 0.1%
9 1
 
< 0.1%
10 2
 
< 0.1%
11 1
 
< 0.1%
12 2
 
< 0.1%
ValueCountFrequency (%)
3583128 1
< 0.1%
3548079 1
< 0.1%
3447077 1
< 0.1%
3429379 1
< 0.1%
3364957 1
< 0.1%
3340117 1
< 0.1%
3330973 1
< 0.1%
3274068 1
< 0.1%
3268393 1
< 0.1%
3215044 1
< 0.1%

우편물(kg)
Real number (ℝ)

MISSING  ZEROS 

Distinct2230
Distinct (%)23.0%
Missing307
Missing (%)3.1%
Infinite0
Infinite (%)0.0%
Mean2891.0424
Minimum0
Maximum230373
Zeros7268
Zeros (%)72.7%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-03-14T20:36:42.514258image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q31
95-th percentile15909
Maximum230373
Range230373
Interquartile range (IQR)1

Descriptive statistics

Standard deviation10762.029
Coefficient of variation (CV)3.7225429
Kurtosis76.72897
Mean2891.0424
Median Absolute Deviation (MAD)0
Skewness7.2445221
Sum28022874
Variance1.1582127 × 108
MonotonicityNot monotonic
2024-03-14T20:36:42.772084image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 7268
72.7%
47 5
 
0.1%
25 5
 
0.1%
143 4
 
< 0.1%
384 3
 
< 0.1%
63 3
 
< 0.1%
1612 3
 
< 0.1%
11 3
 
< 0.1%
555 3
 
< 0.1%
8 3
 
< 0.1%
Other values (2220) 2393
 
23.9%
(Missing) 307
 
3.1%
ValueCountFrequency (%)
0 7268
72.7%
1 2
 
< 0.1%
2 3
 
< 0.1%
3 1
 
< 0.1%
4 3
 
< 0.1%
5 3
 
< 0.1%
7 1
 
< 0.1%
8 3
 
< 0.1%
9 1
 
< 0.1%
10 1
 
< 0.1%
ValueCountFrequency (%)
230373 1
< 0.1%
200132 1
< 0.1%
152349 1
< 0.1%
146802 1
< 0.1%
142600 1
< 0.1%
137411 1
< 0.1%
131656 1
< 0.1%
124784 1
< 0.1%
120190 1
< 0.1%
118032 1
< 0.1%

수하물(kg)
Real number (ℝ)

HIGH CORRELATION  MISSING  ZEROS 

Distinct6528
Distinct (%)67.3%
Missing307
Missing (%)3.1%
Infinite0
Infinite (%)0.0%
Mean64381.776
Minimum0
Maximum827819
Zeros3038
Zeros (%)30.4%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-03-14T20:36:43.157993image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median31798
Q391804
95-th percentile239289.8
Maximum827819
Range827819
Interquartile range (IQR)91804

Descriptive statistics

Standard deviation92662.907
Coefficient of variation (CV)1.4392723
Kurtosis10.073781
Mean64381.776
Median Absolute Deviation (MAD)31798
Skewness2.6498708
Sum6.2405255 × 108
Variance8.5864144 × 109
MonotonicityNot monotonic
2024-03-14T20:36:43.580303image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 3038
30.4%
2036 3
 
< 0.1%
25370 3
 
< 0.1%
91818 2
 
< 0.1%
20105 2
 
< 0.1%
83037 2
 
< 0.1%
48672 2
 
< 0.1%
45828 2
 
< 0.1%
37060 2
 
< 0.1%
25306 2
 
< 0.1%
Other values (6518) 6635
66.3%
(Missing) 307
 
3.1%
ValueCountFrequency (%)
0 3038
30.4%
27 2
 
< 0.1%
33 1
 
< 0.1%
49 1
 
< 0.1%
62 1
 
< 0.1%
70 1
 
< 0.1%
85 1
 
< 0.1%
90 1
 
< 0.1%
110 1
 
< 0.1%
120 1
 
< 0.1%
ValueCountFrequency (%)
827819 1
< 0.1%
820141 1
< 0.1%
774555 1
< 0.1%
754131 1
< 0.1%
731399 1
< 0.1%
693293 1
< 0.1%
691836 1
< 0.1%
689815 1
< 0.1%
688287 1
< 0.1%
685344 1
< 0.1%

Interactions

2024-03-14T20:36:23.995895image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-14T20:36:13.102351image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-14T20:36:15.082805image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-14T20:36:17.009257image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-14T20:36:18.950949image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-14T20:36:20.835975image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-14T20:36:22.755446image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-14T20:36:24.172688image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-14T20:36:13.393467image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-14T20:36:15.371265image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-14T20:36:17.296324image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-14T20:36:19.235729image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-14T20:36:21.123612image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-14T20:36:22.933368image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-14T20:36:24.344562image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-14T20:36:13.682673image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-14T20:36:15.649467image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-14T20:36:17.578966image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-14T20:36:19.512006image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-14T20:36:21.406570image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-14T20:36:23.103515image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-14T20:36:24.520255image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-14T20:36:13.971686image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-14T20:36:15.931499image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-14T20:36:17.861916image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-14T20:36:19.788533image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-14T20:36:21.692061image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-14T20:36:23.277268image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-14T20:36:24.679178image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-14T20:36:14.248455image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-14T20:36:16.199272image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-14T20:36:18.139536image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-14T20:36:20.048058image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-14T20:36:21.963730image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-14T20:36:23.437208image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-14T20:36:24.854680image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-14T20:36:14.543449image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-14T20:36:16.482920image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-14T20:36:18.424257image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-14T20:36:20.327261image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-14T20:36:22.243450image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-14T20:36:23.613636image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-14T20:36:25.092139image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-14T20:36:14.815400image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-14T20:36:16.741662image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-14T20:36:18.689717image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-14T20:36:20.581238image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-14T20:36:22.506074image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-14T20:36:23.811392image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-03-14T20:36:43.851523image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
년월국제_국내도착_출발정기_부정기여객_화물유임승객(명)무임승객(명)환승객(명)항공화물(kg)환적화물(kg)우편물(kg)수하물(kg)
년월1.0000.0290.0000.1320.0400.0950.0490.0290.0000.0000.0000.061
국제_국내0.0291.0000.0620.0970.0830.0650.0550.0000.0090.0000.0000.083
도착_출발0.0000.0621.0000.0210.0090.0000.0330.0000.0740.0550.0710.038
정기_부정기0.1320.0970.0211.0000.2820.4230.2660.1740.1390.0550.0660.372
여객_화물0.0400.0830.0090.2821.0000.6480.3930.2530.5560.4320.0360.535
유임승객(명)0.0950.0650.0000.4230.6481.0000.6670.3160.2180.1310.1490.773
무임승객(명)0.0490.0550.0330.2660.3930.6671.0000.3580.1050.0000.1490.620
환승객(명)0.0290.0000.0000.1740.2530.3160.3581.0000.1470.1260.2360.824
항공화물(kg)0.0000.0090.0740.1390.5560.2180.1050.1471.0000.9260.1480.263
환적화물(kg)0.0000.0000.0550.0550.4320.1310.0000.1260.9261.0000.1380.162
우편물(kg)0.0000.0000.0710.0660.0360.1490.1490.2360.1480.1381.0000.287
수하물(kg)0.0610.0830.0380.3720.5350.7730.6200.8240.2630.1620.2871.000
2024-03-14T20:36:44.185602image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
정기_부정기국제_국내년월여객_화물도착_출발
정기_부정기1.0000.0620.1020.1820.013
국제_국내0.0621.0000.0220.0530.039
년월0.1020.0221.0000.0310.000
여객_화물0.1820.0530.0311.0000.006
도착_출발0.0130.0390.0000.0061.000
2024-03-14T20:36:44.470220image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
유임승객(명)무임승객(명)환승객(명)항공화물(kg)환적화물(kg)우편물(kg)수하물(kg)년월국제_국내도착_출발정기_부정기여객_화물
유임승객(명)1.0000.9170.717-0.217-0.2400.2150.9530.0400.0500.0000.3250.503
무임승객(명)0.9171.0000.744-0.199-0.1850.2140.9110.0210.0420.0260.2040.301
환승객(명)0.7170.7441.000-0.0190.0460.2850.8300.0120.0000.0000.1330.194
항공화물(kg)-0.217-0.199-0.0191.0000.6430.372-0.1170.0000.0070.0570.1070.429
환적화물(kg)-0.240-0.1850.0460.6431.0000.171-0.1390.0000.0000.0420.0420.332
우편물(kg)0.2150.2140.2850.3720.1711.0000.2720.0000.0000.0710.0660.036
수하물(kg)0.9530.9110.830-0.117-0.1390.2721.0000.0260.0640.0290.2850.412
년월0.0400.0210.0120.0000.0000.0000.0261.0000.0220.0000.1020.031
국제_국내0.0500.0420.0000.0070.0000.0000.0640.0221.0000.0390.0620.053
도착_출발0.0000.0260.0000.0570.0420.0710.0290.0000.0391.0000.0130.006
정기_부정기0.3250.2040.1330.1070.0420.0660.2850.1020.0620.0131.0000.182
여객_화물0.5030.3010.1940.4290.3320.0360.4120.0310.0530.0060.1821.000

Missing values

2024-03-14T20:36:25.506989image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-14T20:36:25.878634image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2024-03-14T20:36:26.355097image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

년월항공사(ICAO)항공사(IATA)항공사명국제_국내경유지공항도착_출발정기_부정기여객_화물유임승객(명)무임승객(명)환승객(명)항공화물(kg)환적화물(kg)우편물(kg)수하물(kg)
54582023-06AAROZ아시아나항공국제TAO출발정기여객785372165013575027726
17872023-02KALKE대한항공국제OSL도착정기화물00063610510779400
38482023-04KALKE대한항공국제BNE도착정기여객760139232300015910
7032023-01KALKE대한항공국제KKJ도착부정기화물0003868345200
18092023-02KALKE대한항공국제SEA출발정기여객190721249683159775740134936
13234<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>
26292023-03JJA7C제주항공국제FUK출발정기여객2172310391000173646
46992023-05GTI5Y아틀라스항공국제HAN출발정기화물00072291000
1122023-01AAROZ아시아나항공국제SFO도착정기여객405217454511788257340276846
107462023-10MFXC6MY FREIGHTER국제TAS출발부정기화물00083258000
년월항공사(ICAO)항공사(IATA)항공사명국제_국내경유지공항도착_출발정기_부정기여객_화물유임승객(명)무임승객(명)환승객(명)항공화물(kg)환적화물(kg)우편물(kg)수하물(kg)
49292023-05KALKE대한항공국제DAD출발정기여객10268136136733388294570149289
13472<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>
29512023-03KALKE대한항공국제RGN출발정기여객71021023595323740022658
25432023-03FDXFXFedEX항공국제SZX출발정기화물000395393929000
84022023-08PACPO폴라에어카고국제CVG출발부정기화물00012007171500
100722023-10CESMU중국동방항공국제XIY출발정기여객900000123
89362023-09CSNCZ중국남방항공국제CSX출발정기여객177010001943
15682023-02JNALJ진 에어국제CRK도착부정기여객3190530006781
53712023-06AAROZ아시아나항공국제FRA도착정기여객817024218310883332880158105
103702023-10JNALJ진 에어국제BKK도착정기여객10941356323624204581131930

Duplicate rows

Most frequently occurring

년월항공사(ICAO)항공사(IATA)항공사명국제_국내경유지공항도착_출발정기_부정기여객_화물유임승객(명)무임승객(명)환승객(명)항공화물(kg)환적화물(kg)우편물(kg)수하물(kg)# duplicates
0<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>307