Overview

Dataset statistics

Number of variables8
Number of observations46
Missing cells46
Missing cells (%)12.5%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory3.2 KiB
Average record size in memory71.9 B

Variable types

Numeric4
Unsupported1
Categorical3

Dataset

Description공항별로 국내선과 국제선으로 나누어 여객성인 출발/도착 수, 여객유아 출발/도착 수
Author충청남도
URLhttps://alldam.chungnam.go.kr/bigdata/collect/view.chungnam?menuCd=DOM_000000201001001000&apiIdx=2629

Alerts

여객유아도착 is highly overall correlated with 여객유아출발 and 2 other fieldsHigh correlation
여객유아출발 is highly overall correlated with 여객유아도착 and 2 other fieldsHigh correlation
여객성인출발 is highly overall correlated with 여객유아도착 and 2 other fieldsHigh correlation
여객성인도착 is highly overall correlated with 여객유아도착 and 2 other fieldsHigh correlation
연도 has 46 (100.0%) missing valuesMissing
연도 is an unsupported type, check if it needs cleaning or further analysisUnsupported
여객유아도착 has 5 (10.9%) zerosZeros
여객유아출발 has 6 (13.0%) zerosZeros
여객성인출발 has 6 (13.0%) zerosZeros
여객성인도착 has 5 (10.9%) zerosZeros

Reproduction

Analysis started2024-01-09 22:52:24.969644
Analysis finished2024-01-09 22:52:26.595681
Duration1.63 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

여객유아도착
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct38
Distinct (%)82.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean4315.9348
Minimum0
Maximum62393
Zeros5
Zeros (%)10.9%
Negative0
Negative (%)0.0%
Memory size546.0 B
2024-01-10T07:52:26.651964image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q16.5
median120
Q31633
95-th percentile18573.25
Maximum62393
Range62393
Interquartile range (IQR)1626.5

Descriptive statistics

Standard deviation12367.91
Coefficient of variation (CV)2.8656387
Kurtosis16.196786
Mean4315.9348
Median Absolute Deviation (MAD)120
Skewness4.0071907
Sum198533
Variance1.5296519 × 108
MonotonicityNot monotonic
2024-01-10T07:52:26.762465image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=38)
ValueCountFrequency (%)
0 5
 
10.9%
1 3
 
6.5%
37 2
 
4.3%
6 2
 
4.3%
154 1
 
2.2%
1327 1
 
2.2%
556 1
 
2.2%
5563 1
 
2.2%
50 1
 
2.2%
92 1
 
2.2%
Other values (28) 28
60.9%
ValueCountFrequency (%)
0 5
10.9%
1 3
6.5%
3 1
 
2.2%
4 1
 
2.2%
6 2
 
4.3%
8 1
 
2.2%
13 1
 
2.2%
24 1
 
2.2%
37 2
 
4.3%
40 1
 
2.2%
ValueCountFrequency (%)
62393 1
2.2%
54662 1
2.2%
19587 1
2.2%
15532 1
2.2%
10216 1
2.2%
5563 1
2.2%
5151 1
2.2%
5112 1
2.2%
4679 1
2.2%
2836 1
2.2%

여객유아출발
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct36
Distinct (%)78.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean4443.1739
Minimum0
Maximum61412
Zeros6
Zeros (%)13.0%
Negative0
Negative (%)0.0%
Memory size546.0 B
2024-01-10T07:52:26.861606image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q18
median109
Q32778.75
95-th percentile20114.75
Maximum61412
Range61412
Interquartile range (IQR)2770.75

Descriptive statistics

Standard deviation12350.531
Coefficient of variation (CV)2.779664
Kurtosis15.564546
Mean4443.1739
Median Absolute Deviation (MAD)109
Skewness3.9289154
Sum204386
Variance1.5253561 × 108
MonotonicityNot monotonic
2024-01-10T07:52:26.962772image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=36)
ValueCountFrequency (%)
0 6
 
13.0%
8 3
 
6.5%
3311 2
 
4.3%
4 2
 
4.3%
1 2
 
4.3%
53 1
 
2.2%
35 1
 
2.2%
77 1
 
2.2%
46 1
 
2.2%
117 1
 
2.2%
Other values (26) 26
56.5%
ValueCountFrequency (%)
0 6
13.0%
1 2
 
4.3%
4 2
 
4.3%
6 1
 
2.2%
8 3
6.5%
23 1
 
2.2%
35 1
 
2.2%
44 1
 
2.2%
45 1
 
2.2%
46 1
 
2.2%
ValueCountFrequency (%)
61412 1
2.2%
55044 1
2.2%
21554 1
2.2%
15797 1
2.2%
10361 1
2.2%
5472 1
2.2%
5236 1
2.2%
5143 1
2.2%
4764 1
2.2%
3311 2
4.3%

연도
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing46
Missing (%)100.0%
Memory size546.0 B

공항
Categorical

Distinct15
Distinct (%)32.6%
Missing0
Missing (%)0.0%
Memory size500.0 B
인천 INCHEON
김포 GIMPO
김해 GIMHAE
제주 JEJU
대구 DAEGU
Other values (10)
26 

Length

Max length11
Median length9
Mean length8.8695652
Min length7

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row인천 INCHEON
2nd row인천 INCHEON
3rd row김포 GIMPO
4th row김포 GIMPO
5th row김해 GIMHAE

Common Values

ValueCountFrequency (%)
인천 INCHEON 4
 
8.7%
김포 GIMPO 4
 
8.7%
김해 GIMHAE 4
 
8.7%
제주 JEJU 4
 
8.7%
대구 DAEGU 4
 
8.7%
청주 CHEONGJU 4
 
8.7%
무안 MUAN 4
 
8.7%
양양 YANGYANG 4
 
8.7%
광주 GWANGJU 2
 
4.3%
여수 YEOSU 2
 
4.3%
Other values (5) 10
21.7%

Length

2024-01-10T07:52:27.075413image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
인천 4
 
4.3%
daegu 4
 
4.3%
incheon 4
 
4.3%
양양 4
 
4.3%
muan 4
 
4.3%
무안 4
 
4.3%
cheongju 4
 
4.3%
청주 4
 
4.3%
yangyang 4
 
4.3%
대구 4
 
4.3%
Other values (20) 52
56.5%

여객성인출발
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct41
Distinct (%)89.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean624358.46
Minimum0
Maximum9174160
Zeros6
Zeros (%)13.0%
Negative0
Negative (%)0.0%
Memory size546.0 B
2024-01-10T07:52:27.438618image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q1951.75
median11709.5
Q3254548.75
95-th percentile3342345.2
Maximum9174160
Range9174160
Interquartile range (IQR)253597

Descriptive statistics

Standard deviation1813028.2
Coefficient of variation (CV)2.9038258
Kurtosis15.125685
Mean624358.46
Median Absolute Deviation (MAD)11709.5
Skewness3.8655183
Sum28720489
Variance3.2870711 × 1012
MonotonicityNot monotonic
2024-01-10T07:52:27.545913image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=41)
ValueCountFrequency (%)
0 6
 
13.0%
1239 1
 
2.2%
26483 1
 
2.2%
185377 1
 
2.2%
70855 1
 
2.2%
698616 1
 
2.2%
7729 1
 
2.2%
12167 1
 
2.2%
6529 1
 
2.2%
7255 1
 
2.2%
Other values (31) 31
67.4%
ValueCountFrequency (%)
0 6
13.0%
105 1
 
2.2%
398 1
 
2.2%
541 1
 
2.2%
620 1
 
2.2%
768 1
 
2.2%
856 1
 
2.2%
1239 1
 
2.2%
2324 1
 
2.2%
3361 1
 
2.2%
ValueCountFrequency (%)
9174160 1
2.2%
7593814 1
2.2%
3480161 1
2.2%
2928898 1
2.2%
1048074 1
2.2%
698616 1
2.2%
697381 1
2.2%
671679 1
2.2%
645592 1
2.2%
372851 1
2.2%

여객성인도착
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct42
Distinct (%)91.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean625703.87
Minimum0
Maximum9131290
Zeros5
Zeros (%)10.9%
Negative0
Negative (%)0.0%
Memory size546.0 B
2024-01-10T07:52:27.650414image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q11584.25
median12316
Q3226406.5
95-th percentile3412517.2
Maximum9131290
Range9131290
Interquartile range (IQR)224822.25

Descriptive statistics

Standard deviation1820038.6
Coefficient of variation (CV)2.908786
Kurtosis14.911242
Mean625703.87
Median Absolute Deviation (MAD)12316
Skewness3.8452637
Sum28782378
Variance3.3125406 × 1012
MonotonicityNot monotonic
2024-01-10T07:52:27.757498image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=42)
ValueCountFrequency (%)
0 5
 
10.9%
1303 1
 
2.2%
24874 1
 
2.2%
181138 1
 
2.2%
60150 1
 
2.2%
699245 1
 
2.2%
8145 1
 
2.2%
13218 1
 
2.2%
5142 1
 
2.2%
7137 1
 
2.2%
Other values (32) 32
69.6%
ValueCountFrequency (%)
0 5
10.9%
61 1
 
2.2%
235 1
 
2.2%
430 1
 
2.2%
701 1
 
2.2%
818 1
 
2.2%
847 1
 
2.2%
1303 1
 
2.2%
2428 1
 
2.2%
2527 1
 
2.2%
ValueCountFrequency (%)
9131290 1
2.2%
7689020 1
2.2%
3580970 1
2.2%
2907159 1
2.2%
1050046 1
2.2%
699245 1
2.2%
696636 1
2.2%
673104 1
2.2%
645191 1
2.2%
371984 1
2.2%

노선
Categorical

Distinct2
Distinct (%)4.3%
Missing0
Missing (%)0.0%
Memory size500.0 B
국내선
30 
국제선
16 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row국내선
2nd row국제선
3rd row국내선
4th row국제선
5th row국내선

Common Values

ValueCountFrequency (%)
국내선 30
65.2%
국제선 16
34.8%

Length

2024-01-10T07:52:27.860350image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-10T07:52:27.937196image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
국내선 30
65.2%
국제선 16
34.8%
Distinct2
Distinct (%)4.3%
Missing0
Missing (%)0.0%
Memory size500.0 B
정기
23 
부정기
23 

Length

Max length3
Median length2.5
Mean length2.5
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row정기
2nd row정기
3rd row정기
4th row정기
5th row정기

Common Values

ValueCountFrequency (%)
정기 23
50.0%
부정기 23
50.0%

Length

2024-01-10T07:52:28.015923image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-10T07:52:28.090785image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
정기 23
50.0%
부정기 23
50.0%

Interactions

2024-01-10T07:52:26.150558image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T07:52:25.219129image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T07:52:25.538009image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T07:52:25.849813image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T07:52:26.218360image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T07:52:25.300311image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T07:52:25.619135image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T07:52:25.929966image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T07:52:26.287634image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T07:52:25.383326image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T07:52:25.697516image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T07:52:26.010057image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T07:52:26.351024image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T07:52:25.466566image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T07:52:25.777972image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T07:52:26.081435image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-01-10T07:52:28.141213image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
여객유아도착여객유아출발공항여객성인출발여객성인도착노선정기_부정기여부
여객유아도착1.0001.0000.0001.0001.0000.0000.170
여객유아출발1.0001.0000.0001.0001.0000.0000.170
공항0.0000.0001.0000.0000.0000.0000.000
여객성인출발1.0001.0000.0001.0001.0000.0000.160
여객성인도착1.0001.0000.0001.0001.0000.0000.160
노선0.0000.0000.0000.0000.0001.0000.000
정기_부정기여부0.1700.1700.0000.1600.1600.0001.000
2024-01-10T07:52:28.230164image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
노선공항정기_부정기여부
노선1.0000.0000.000
공항0.0001.0000.000
정기_부정기여부0.0000.0001.000
2024-01-10T07:52:28.308544image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
여객유아도착여객유아출발여객성인출발여객성인도착공항노선정기_부정기여부
여객유아도착1.0000.9930.9320.9640.0000.0000.105
여객유아출발0.9931.0000.9220.9540.0000.0000.105
여객성인출발0.9320.9221.0000.9920.0000.0000.184
여객성인도착0.9640.9540.9921.0000.0000.0000.184
공항0.0000.0000.0000.0001.0000.0000.000
노선0.0000.0000.0000.0000.0001.0000.000
정기_부정기여부0.1050.1050.1840.1840.0000.0001.000

Missing values

2024-01-10T07:52:26.448463image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-01-10T07:52:26.553930image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

여객유아도착여객유아출발연도공항여객성인출발여객성인도착노선정기_부정기여부
000<NA>인천 INCHEON00국내선정기
11958721554<NA>인천 INCHEON34801613580970국제선정기
25466255044<NA>김포 GIMPO75938147689020국내선정기
311<NA>김포 GIMPO10561국제선정기
41553215797<NA>김해 GIMHAE29288982907159국내선정기
5492439<NA>김해 GIMHAE4638754679국제선정기
66239361412<NA>제주 JEJU91741609131290국내선정기
711<NA>제주 JEJU620235국제선정기
851125143<NA>대구 DAEGU671679673104국내선정기
94123<NA>대구 DAEGU34335869국제선정기
여객유아도착여객유아출발연도공항여객성인출발여객성인도착노선정기_부정기여부
3668<NA>무안 MUAN39634329국제선부정기
376046<NA>양양 YANGYANG55765909국내선부정기
3834<NA>양양 YANGYANG856847국제선부정기
3910477<NA>광주 GWANGJU1125211414국내선부정기
403735<NA>여수 YEOSU23242527국내선부정기
414053<NA>울산 ULSAN44543656국내선부정기
4200<NA>사천 SACHEON00국내선부정기
43138<NA>포항 POHANG768818국내선부정기
44136117<NA>군산 GUNSAN1372414392국내선부정기
4568<NA>원주 WONJU398430국내선부정기