Overview

Dataset statistics

Number of variables6
Number of observations10000
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory566.4 KiB
Average record size in memory58.0 B

Variable types

Numeric2
Categorical3
Text1

Dataset

Description축산관계자 출국신고의 출입국일, 신고방법, 이용항만 등 제공
Author농림축산검역본부
URLhttps://data.mafra.go.kr/opendata/data/indexOpenDataDetail.do?data_id=20211015000000001647

Alerts

순번 is highly overall correlated with 출국일시High correlation
출국일시 is highly overall correlated with 순번High correlation
출발항 is highly imbalanced (66.0%)Imbalance
순번 has unique valuesUnique

Reproduction

Analysis started2023-12-16 06:32:52.601131
Analysis finished2023-12-16 06:32:58.053566
Duration5.45 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

순번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct10000
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean10111.815
Minimum1
Maximum20345
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-16T06:32:58.644478image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1036.9
Q14968.25
median10076.5
Q315231
95-th percentile19310.25
Maximum20345
Range20344
Interquartile range (IQR)10262.75

Descriptive statistics

Standard deviation5892.0933
Coefficient of variation (CV)0.58269392
Kurtosis-1.2121038
Mean10111.815
Median Absolute Deviation (MAD)5135.5
Skewness0.019669876
Sum1.0111815 × 108
Variance34716763
MonotonicityNot monotonic
2023-12-16T06:32:59.418955image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
15973 1
 
< 0.1%
18753 1
 
< 0.1%
3706 1
 
< 0.1%
9020 1
 
< 0.1%
10142 1
 
< 0.1%
18254 1
 
< 0.1%
6013 1
 
< 0.1%
20094 1
 
< 0.1%
11205 1
 
< 0.1%
19578 1
 
< 0.1%
Other values (9990) 9990
99.9%
ValueCountFrequency (%)
1 1
< 0.1%
2 1
< 0.1%
9 1
< 0.1%
10 1
< 0.1%
11 1
< 0.1%
12 1
< 0.1%
13 1
< 0.1%
15 1
< 0.1%
20 1
< 0.1%
21 1
< 0.1%
ValueCountFrequency (%)
20345 1
< 0.1%
20343 1
< 0.1%
20342 1
< 0.1%
20339 1
< 0.1%
20336 1
< 0.1%
20335 1
< 0.1%
20334 1
< 0.1%
20332 1
< 0.1%
20330 1
< 0.1%
20329 1
< 0.1%

성별
Categorical

Distinct3
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
6151 
3702 
미기재
 
147

Length

Max length3
Median length1
Mean length1.0294
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row
2nd row
3rd row
4th row
5th row

Common Values

ValueCountFrequency (%)
6151
61.5%
3702
37.0%
미기재 147
 
1.5%

Length

2023-12-16T06:33:00.265154image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-16T06:33:00.869458image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
6151
61.5%
3702
37.0%
미기재 147
 
1.5%

출국일시
Real number (ℝ)

HIGH CORRELATION 

Distinct528
Distinct (%)5.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean20200900
Minimum20200101
Maximum20210930
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-16T06:33:01.746145image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum20200101
5-th percentile20200103
Q120200110
median20200125
Q320200210
95-th percentile20210305
Maximum20210930
Range10829
Interquartile range (IQR)100

Descriptive statistics

Standard deviation2628.7343
Coefficient of variation (CV)0.00013012956
Kurtosis9.4058655
Mean20200900
Median Absolute Deviation (MAD)19
Skewness3.3669008
Sum2.02009 × 1011
Variance6910243.8
MonotonicityNot monotonic
2023-12-16T06:33:02.610018image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
20200106 316
 
3.2%
20200108 307
 
3.1%
20200107 287
 
2.9%
20200105 273
 
2.7%
20200110 266
 
2.7%
20200128 265
 
2.6%
20200103 256
 
2.6%
20200104 252
 
2.5%
20200109 244
 
2.4%
20200111 220
 
2.2%
Other values (518) 7314
73.1%
ValueCountFrequency (%)
20200101 173
1.7%
20200102 213
2.1%
20200103 256
2.6%
20200104 252
2.5%
20200105 273
2.7%
20200106 316
3.2%
20200107 287
2.9%
20200108 307
3.1%
20200109 244
2.4%
20200110 266
2.7%
ValueCountFrequency (%)
20210930 2
 
< 0.1%
20210929 5
0.1%
20210928 3
< 0.1%
20210927 1
 
< 0.1%
20210926 2
 
< 0.1%
20210925 6
0.1%
20210924 2
 
< 0.1%
20210923 1
 
< 0.1%
20210922 5
0.1%
20210921 1
 
< 0.1%
Distinct550
Distinct (%)5.5%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-16T06:33:04.275983image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length8
Median length8
Mean length7.443
Min length3

Characters and Unicode

Total characters74430
Distinct characters13
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique243 ?
Unique (%)2.4%

Sample

1st row20200217
2nd row20200117
3rd row20200203
4th row20200219
5th row20200114
ValueCountFrequency (%)
미기재 1114
 
11.1%
20200112 271
 
2.7%
20200110 266
 
2.7%
20200111 252
 
2.5%
20200113 239
 
2.4%
20200118 214
 
2.1%
20200115 210
 
2.1%
20200127 207
 
2.1%
20200108 207
 
2.1%
20200202 206
 
2.1%
Other values (540) 6814
68.1%
2023-12-16T06:33:06.065402image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 29823
40.1%
2 23930
32.2%
1 9977
 
13.4%
3 1593
 
2.1%
1114
 
1.5%
1114
 
1.5%
1114
 
1.5%
7 1100
 
1.5%
8 1031
 
1.4%
6 985
 
1.3%
Other values (3) 2649
 
3.6%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 71088
95.5%
Other Letter 3342
 
4.5%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 29823
42.0%
2 23930
33.7%
1 9977
 
14.0%
3 1593
 
2.2%
7 1100
 
1.5%
8 1031
 
1.5%
6 985
 
1.4%
9 942
 
1.3%
5 894
 
1.3%
4 813
 
1.1%
Other Letter
ValueCountFrequency (%)
1114
33.3%
1114
33.3%
1114
33.3%

Most occurring scripts

ValueCountFrequency (%)
Common 71088
95.5%
Hangul 3342
 
4.5%

Most frequent character per script

Common
ValueCountFrequency (%)
0 29823
42.0%
2 23930
33.7%
1 9977
 
14.0%
3 1593
 
2.2%
7 1100
 
1.5%
8 1031
 
1.5%
6 985
 
1.4%
9 942
 
1.3%
5 894
 
1.3%
4 813
 
1.1%
Hangul
ValueCountFrequency (%)
1114
33.3%
1114
33.3%
1114
33.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 71088
95.5%
Hangul 3342
 
4.5%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 29823
42.0%
2 23930
33.7%
1 9977
 
14.0%
3 1593
 
2.2%
7 1100
 
1.5%
8 1031
 
1.5%
6 985
 
1.4%
9 942
 
1.3%
5 894
 
1.3%
4 813
 
1.1%
Hangul
ValueCountFrequency (%)
1114
33.3%
1114
33.3%
1114
33.3%

출발항
Categorical

IMBALANCE 

Distinct18
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
인천공항
7562 
김해공항
928 
무안국제공항
 
493
대구공항
 
415
미기재
 
256
Other values (13)
 
346

Length

Max length6
Median length4
Mean length4.0938
Min length3

Unique

Unique5 ?
Unique (%)< 0.1%

Sample

1st row인천공항
2nd row미기재
3rd row인천공항
4th row인천공항
5th row김해공항

Common Values

ValueCountFrequency (%)
인천공항 7562
75.6%
김해공항 928
 
9.3%
무안국제공항 493
 
4.9%
대구공항 415
 
4.2%
미기재 256
 
2.6%
청주국제공항 126
 
1.3%
김포공항 77
 
0.8%
부산항 51
 
0.5%
제주공항 49
 
0.5%
양양국제공항 16
 
0.2%
Other values (8) 27
 
0.3%

Length

2023-12-16T06:33:06.763278image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
인천공항 7562
75.6%
김해공항 928
 
9.3%
무안국제공항 493
 
4.9%
대구공항 415
 
4.2%
미기재 256
 
2.6%
청주국제공항 126
 
1.3%
김포공항 77
 
0.8%
부산항 51
 
0.5%
제주공항 49
 
0.5%
양양국제공항 16
 
0.2%
Other values (8) 27
 
0.3%

구분
Categorical

Distinct8
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
신고함
3958 
전화
3178 
모바일
1425 
방문
606 
개인
580 
Other values (3)
 
253

Length

Max length5
Median length3
Mean length2.5765
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row신고함
2nd row전화
3rd row신고함
4th row신고함
5th row신고함

Common Values

ValueCountFrequency (%)
신고함 3958
39.6%
전화 3178
31.8%
모바일 1425
 
14.2%
방문 606
 
6.1%
개인 580
 
5.8%
대행사 138
 
1.4%
키오스크 101
 
1.0%
여행자협회 14
 
0.1%

Length

2023-12-16T06:33:07.582456image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-16T06:33:08.240965image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
신고함 3958
39.6%
전화 3178
31.8%
모바일 1425
 
14.2%
방문 606
 
6.1%
개인 580
 
5.8%
대행사 138
 
1.4%
키오스크 101
 
1.0%
여행자협회 14
 
0.1%

Interactions

2023-12-16T06:32:55.798339image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-16T06:32:54.682310image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-16T06:32:56.563177image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-16T06:32:55.102182image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-16T06:33:08.681854image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번성별출국일시출발항구분
순번1.0000.1650.7420.2300.216
성별0.1651.0000.1740.1450.154
출국일시0.7420.1741.0000.2150.179
출발항0.2300.1450.2151.0000.659
구분0.2160.1540.1790.6591.000
2023-12-16T06:33:09.021992image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
성별출발항구분
성별1.0000.0660.098
출발항0.0661.0000.348
구분0.0980.3481.000
2023-12-16T06:33:09.596071image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번출국일시성별출발항구분
순번1.0001.0000.0990.0900.104
출국일시1.0001.0000.0540.1000.115
성별0.0990.0541.0000.0660.098
출발항0.0900.1000.0661.0000.348
구분0.1040.1150.0980.3481.000

Missing values

2023-12-16T06:32:57.236470image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-16T06:32:57.644807image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

순번성별출국일시귀국예정일출발항구분
15972159732020021420200217인천공항신고함
668666872020011420200117미기재전화
11951119522020013020200203인천공항신고함
16237162382020021620200219인천공항신고함
484448452020011020200114김해공항신고함
18445184462020090720200929인천공항신고함
366736682020010820200112인천공항전화
1799017991미기재20200417미기재인천공항전화
936893692020012320200131인천공항모바일
248724882020010620200112인천공항전화
순번성별출국일시귀국예정일출발항구분
695369542020011520200126인천공항신고함
680068012020011520200118인천공항신고함
180281802920200508미기재인천공항전화
839583962020011920200122인천공항전화
187471874820201121미기재인천공항전화
279527962020010620200111인천공항모바일
114311442020010320200105인천공항전화
14850148512020020820200211인천공항모바일
989898992020012420200127인천공항전화
12547125482020013120200204인천공항전화