Overview

Dataset statistics

Number of variables6
Number of observations10000
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory566.4 KiB
Average record size in memory58.0 B

Variable types

Numeric2
Categorical3
Text1

Dataset

Description축산관계자 출국신고의 출입국일, 신고방법, 이용항만 등 제공
Author농림축산검역본부
URLhttps://data.mafra.go.kr/opendata/data/indexOpenDataDetail.do?data_id=20211015000000001647

Alerts

순번 is highly overall correlated with 출국일시High correlation
출국일시 is highly overall correlated with 순번High correlation
출발항 is highly imbalanced (67.4%)Imbalance
순번 has unique valuesUnique

Reproduction

Analysis started2023-12-16 06:33:17.877734
Analysis finished2023-12-16 06:33:21.470231
Duration3.59 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

순번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct10000
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean10180.442
Minimum2
Maximum20345
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-16T06:33:21.734014image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2
5-th percentile1079.95
Q15126.75
median10172.5
Q315246
95-th percentile19301.05
Maximum20345
Range20343
Interquartile range (IQR)10119.25

Descriptive statistics

Standard deviation5842.774
Coefficient of variation (CV)0.57392147
Kurtosis-1.1930972
Mean10180.442
Median Absolute Deviation (MAD)5061
Skewness-0.0015832866
Sum1.0180442 × 108
Variance34138008
MonotonicityNot monotonic
2023-12-16T06:33:22.344413image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1206 1
 
< 0.1%
3140 1
 
< 0.1%
2946 1
 
< 0.1%
19939 1
 
< 0.1%
6354 1
 
< 0.1%
9594 1
 
< 0.1%
7096 1
 
< 0.1%
2176 1
 
< 0.1%
4082 1
 
< 0.1%
18234 1
 
< 0.1%
Other values (9990) 9990
99.9%
ValueCountFrequency (%)
2 1
< 0.1%
3 1
< 0.1%
4 1
< 0.1%
10 1
< 0.1%
11 1
< 0.1%
13 1
< 0.1%
16 1
< 0.1%
18 1
< 0.1%
21 1
< 0.1%
22 1
< 0.1%
ValueCountFrequency (%)
20345 1
< 0.1%
20344 1
< 0.1%
20340 1
< 0.1%
20339 1
< 0.1%
20338 1
< 0.1%
20332 1
< 0.1%
20330 1
< 0.1%
20326 1
< 0.1%
20325 1
< 0.1%
20324 1
< 0.1%

성별
Categorical

Distinct3
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
6184 
3683 
미기재
 
133

Length

Max length3
Median length1
Mean length1.0266
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row
2nd row
3rd row
4th row
5th row

Common Values

ValueCountFrequency (%)
6184
61.8%
3683
36.8%
미기재 133
 
1.3%

Length

2023-12-16T06:33:22.983423image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-16T06:33:23.563078image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
6184
61.8%
3683
36.8%
미기재 133
 
1.3%

출국일시
Real number (ℝ)

HIGH CORRELATION 

Distinct516
Distinct (%)5.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean20200889
Minimum20200101
Maximum20210930
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-16T06:33:23.937448image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum20200101
5-th percentile20200103
Q120200110
median20200126
Q320200210
95-th percentile20210304
Maximum20210930
Range10829
Interquartile range (IQR)100

Descriptive statistics

Standard deviation2609.277
Coefficient of variation (CV)0.00012916644
Kurtosis9.6105612
Mean20200889
Median Absolute Deviation (MAD)20
Skewness3.3971455
Sum2.0200889 × 1011
Variance6808326.2
MonotonicityNot monotonic
2023-12-16T06:33:24.613174image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
20200106 302
 
3.0%
20200128 289
 
2.9%
20200107 279
 
2.8%
20200108 278
 
2.8%
20200103 268
 
2.7%
20200105 265
 
2.6%
20200109 258
 
2.6%
20200104 256
 
2.6%
20200110 256
 
2.6%
20200124 241
 
2.4%
Other values (506) 7308
73.1%
ValueCountFrequency (%)
20200101 167
1.7%
20200102 185
1.8%
20200103 268
2.7%
20200104 256
2.6%
20200105 265
2.6%
20200106 302
3.0%
20200107 279
2.8%
20200108 278
2.8%
20200109 258
2.6%
20200110 256
2.6%
ValueCountFrequency (%)
20210930 2
 
< 0.1%
20210929 3
< 0.1%
20210928 2
 
< 0.1%
20210926 5
0.1%
20210925 4
< 0.1%
20210924 1
 
< 0.1%
20210923 1
 
< 0.1%
20210922 4
< 0.1%
20210921 1
 
< 0.1%
20210920 1
 
< 0.1%
Distinct550
Distinct (%)5.5%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-16T06:33:25.827431image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length8
Median length8
Mean length7.463
Min length3

Characters and Unicode

Total characters74630
Distinct characters13
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique243 ?
Unique (%)2.4%

Sample

1st row20200109
2nd row20200118
3rd row20200228
4th row20210711
5th row미기재
ValueCountFrequency (%)
미기재 1074
 
10.7%
20200110 273
 
2.7%
20200112 263
 
2.6%
20200111 254
 
2.5%
20200113 226
 
2.3%
20200127 218
 
2.2%
20200115 208
 
2.1%
20200117 197
 
2.0%
20200202 197
 
2.0%
20200118 196
 
2.0%
Other values (540) 6894
68.9%
2023-12-16T06:33:27.464989image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 29944
40.1%
2 24130
32.3%
1 9920
 
13.3%
3 1597
 
2.1%
7 1146
 
1.5%
1074
 
1.4%
1074
 
1.4%
1074
 
1.4%
6 1002
 
1.3%
8 982
 
1.3%
Other values (3) 2687
 
3.6%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 71408
95.7%
Other Letter 3222
 
4.3%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 29944
41.9%
2 24130
33.8%
1 9920
 
13.9%
3 1597
 
2.2%
7 1146
 
1.6%
6 1002
 
1.4%
8 982
 
1.4%
9 964
 
1.3%
5 885
 
1.2%
4 838
 
1.2%
Other Letter
ValueCountFrequency (%)
1074
33.3%
1074
33.3%
1074
33.3%

Most occurring scripts

ValueCountFrequency (%)
Common 71408
95.7%
Hangul 3222
 
4.3%

Most frequent character per script

Common
ValueCountFrequency (%)
0 29944
41.9%
2 24130
33.8%
1 9920
 
13.9%
3 1597
 
2.2%
7 1146
 
1.6%
6 1002
 
1.4%
8 982
 
1.4%
9 964
 
1.3%
5 885
 
1.2%
4 838
 
1.2%
Hangul
ValueCountFrequency (%)
1074
33.3%
1074
33.3%
1074
33.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 71408
95.7%
Hangul 3222
 
4.3%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 29944
41.9%
2 24130
33.8%
1 9920
 
13.9%
3 1597
 
2.2%
7 1146
 
1.6%
6 1002
 
1.4%
8 982
 
1.4%
9 964
 
1.3%
5 885
 
1.2%
4 838
 
1.2%
Hangul
ValueCountFrequency (%)
1074
33.3%
1074
33.3%
1074
33.3%

출발항
Categorical

IMBALANCE 

Distinct21
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
인천공항
7539 
김해공항
890 
무안국제공항
 
515
대구공항
 
450
미기재
 
253
Other values (16)
 
353

Length

Max length6
Median length4
Mean length4.1026
Min length3

Unique

Unique6 ?
Unique (%)0.1%

Sample

1st row인천공항
2nd row인천공항
3rd row인천공항
4th row인천공항
5th row인천공항

Common Values

ValueCountFrequency (%)
인천공항 7539
75.4%
김해공항 890
 
8.9%
무안국제공항 515
 
5.1%
대구공항 450
 
4.5%
미기재 253
 
2.5%
청주국제공항 138
 
1.4%
김포공항 83
 
0.8%
부산항 45
 
0.4%
제주공항 41
 
0.4%
양양국제공항 20
 
0.2%
Other values (11) 26
 
0.3%

Length

2023-12-16T06:33:27.976963image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
인천공항 7539
75.4%
김해공항 890
 
8.9%
무안국제공항 515
 
5.1%
대구공항 450
 
4.5%
미기재 253
 
2.5%
청주국제공항 138
 
1.4%
김포공항 83
 
0.8%
부산항 45
 
0.4%
제주공항 41
 
0.4%
양양국제공항 20
 
0.2%
Other values (11) 26
 
0.3%

구분
Categorical

Distinct8
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
신고함
3959 
전화
3132 
모바일
1443 
방문
612 
개인
598 
Other values (3)
 
256

Length

Max length5
Median length3
Mean length2.5787
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row모바일
2nd row모바일
3rd row신고함
4th row모바일
5th row신고함

Common Values

ValueCountFrequency (%)
신고함 3959
39.6%
전화 3132
31.3%
모바일 1443
 
14.4%
방문 612
 
6.1%
개인 598
 
6.0%
대행사 143
 
1.4%
키오스크 97
 
1.0%
여행자협회 16
 
0.2%

Length

2023-12-16T06:33:28.642666image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-16T06:33:29.168897image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
신고함 3959
39.6%
전화 3132
31.3%
모바일 1443
 
14.4%
방문 612
 
6.1%
개인 598
 
6.0%
대행사 143
 
1.4%
키오스크 97
 
1.0%
여행자협회 16
 
0.2%

Interactions

2023-12-16T06:33:19.773657image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-16T06:33:18.895012image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-16T06:33:20.158050image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-16T06:33:19.331807image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-16T06:33:29.562554image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번성별출국일시출발항구분
순번1.0000.1550.7400.2270.218
성별0.1551.0000.1530.1660.155
출국일시0.7400.1531.0000.2620.193
출발항0.2270.1660.2621.0000.665
구분0.2180.1550.1930.6651.000
2023-12-16T06:33:30.023024image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
성별출발항구분
성별1.0000.0770.099
출발항0.0771.0000.348
구분0.0990.3481.000
2023-12-16T06:33:30.412804image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번출국일시성별출발항구분
순번1.0001.0000.0930.0850.105
출국일시1.0001.0000.0470.1240.123
성별0.0930.0471.0000.0770.099
출발항0.0850.1240.0771.0000.348
구분0.1050.1230.0990.3481.000

Missing values

2023-12-16T06:33:20.632893image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-16T06:33:21.157723image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

순번성별출국일시귀국예정일출발항구분
120512062020010320200109인천공항모바일
667566762020011420200118인천공항모바일
16207162082020021620200228인천공항신고함
18824188252020121020210711인천공항모바일
201662016720210829미기재인천공항신고함
12622126232020020120200206무안국제공항방문
693169322020011520200118무안국제공항방문
4724732020010220200113인천공항신고함
253625372020010620200109인천공항개인
597559762020011320200117인천공항대행사
순번성별출국일시귀국예정일출발항구분
369736982020010820200111인천공항개인
256925702020010620200110인천공항전화
14273142742020020520200210대구공항신고함
326732682020010720200111청주국제공항신고함
10820108212020012720200201대구공항신고함
564256432020011220200118인천공항개인
15127151282020021020200214인천공항전화
123351233620200131미기재김해공항신고함
272827292020010620200110인천공항신고함
18455184562020091020200927인천공항전화