Overview

Dataset statistics

Number of variables6
Number of observations10000
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory566.4 KiB
Average record size in memory58.0 B

Variable types

Numeric2
Categorical3
Unsupported1

Dataset

Description축산관계자 출국신고의 출입국일, 신고방법, 이용항만 등 제공
Author농림축산검역본부
URLhttps://data.mafra.go.kr/opendata/data/indexOpenDataDetail.do?data_id=20211015000000001647

Alerts

순번 is highly overall correlated with 출국일시High correlation
출국일시 is highly overall correlated with 순번High correlation
출발항 is highly imbalanced (79.6%)Imbalance
순번 has unique valuesUnique
귀국예정일 is an unsupported type, check if it needs cleaning or further analysisUnsupported

Reproduction

Analysis started2023-12-16 06:33:41.060199
Analysis finished2023-12-16 06:33:47.709650
Duration6.65 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

순번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct10000
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean12370.782
Minimum7
Maximum24769
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-16T06:33:48.328102image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum7
5-th percentile1260.95
Q16115.25
median12345.5
Q318583.75
95-th percentile23562.65
Maximum24769
Range24762
Interquartile range (IQR)12468.5

Descriptive statistics

Standard deviation7168.6676
Coefficient of variation (CV)0.57948378
Kurtosis-1.2097581
Mean12370.782
Median Absolute Deviation (MAD)6235.5
Skewness0.0054135146
Sum1.2370782 × 108
Variance51389795
MonotonicityNot monotonic
2023-12-16T06:33:49.135314image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
11397 1
 
< 0.1%
12713 1
 
< 0.1%
21172 1
 
< 0.1%
14364 1
 
< 0.1%
6982 1
 
< 0.1%
10548 1
 
< 0.1%
24762 1
 
< 0.1%
19733 1
 
< 0.1%
6570 1
 
< 0.1%
16080 1
 
< 0.1%
Other values (9990) 9990
99.9%
ValueCountFrequency (%)
7 1
< 0.1%
10 1
< 0.1%
12 1
< 0.1%
16 1
< 0.1%
17 1
< 0.1%
20 1
< 0.1%
21 1
< 0.1%
28 1
< 0.1%
30 1
< 0.1%
32 1
< 0.1%
ValueCountFrequency (%)
24769 1
< 0.1%
24767 1
< 0.1%
24766 1
< 0.1%
24762 1
< 0.1%
24760 1
< 0.1%
24756 1
< 0.1%
24753 1
< 0.1%
24749 1
< 0.1%
24746 1
< 0.1%
24745 1
< 0.1%

성별
Categorical

Distinct3
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
7070 
2792 
미기재
 
138

Length

Max length4
Median length1
Mean length1.0414
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row
2nd row
3rd row
4th row
5th row

Common Values

ValueCountFrequency (%)
7070
70.7%
2792
 
27.9%
미기재 138
 
1.4%

Length

2023-12-16T06:33:50.047207image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-16T06:33:50.590904image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
7070
70.7%
2792
 
27.9%
미기재 138
 
1.4%

출국일시
Real number (ℝ)

HIGH CORRELATION 

Distinct425
Distinct (%)4.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean20220431
Minimum20211001
Maximum20221130
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-16T06:33:51.591218image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum20211001
5-th percentile20220115
Q120220627
median20220828
Q320221031
95-th percentile20221126
Maximum20221130
Range10129
Interquartile range (IQR)404

Descriptive statistics

Standard deviation1965.1473
Coefficient of variation (CV)9.718622 × 10-5
Kurtosis18.183216
Mean20220431
Median Absolute Deviation (MAD)202
Skewness-4.4509461
Sum2.0220431 × 1011
Variance3861803.8
MonotonicityNot monotonic
2023-12-16T06:33:53.041041image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
20221128 133
 
1.3%
20221121 132
 
1.3%
20221124 130
 
1.3%
20221114 125
 
1.2%
20221126 118
 
1.2%
20220822 110
 
1.1%
20221108 99
 
1.0%
20221130 93
 
0.9%
20221125 92
 
0.9%
20221120 90
 
0.9%
Other values (415) 8878
88.8%
ValueCountFrequency (%)
20211001 1
 
< 0.1%
20211002 2
 
< 0.1%
20211003 3
< 0.1%
20211004 4
< 0.1%
20211005 4
< 0.1%
20211006 6
0.1%
20211007 4
< 0.1%
20211008 6
0.1%
20211009 3
< 0.1%
20211010 3
< 0.1%
ValueCountFrequency (%)
20221130 93
0.9%
20221129 78
0.8%
20221128 133
1.3%
20221127 83
0.8%
20221126 118
1.2%
20221125 92
0.9%
20221124 130
1.3%
20221123 73
0.7%
20221122 63
0.6%
20221121 132
1.3%

귀국예정일
Unsupported

REJECTED  UNSUPPORTED 

Missing0
Missing (%)0.0%
Memory size156.2 KiB

출발항
Categorical

IMBALANCE 

Distinct20
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
인천공항
8451 
김해공항
1091 
미기재
 
140
무안국제공항
 
114
대구공항
 
82
Other values (15)
 
122

Length

Max length6
Median length4
Mean length4.0136
Min length2

Unique

Unique8 ?
Unique (%)0.1%

Sample

1st row인천공항
2nd row인천공항
3rd row인천공항
4th row인천공항
5th row인천공항

Common Values

ValueCountFrequency (%)
인천공항 8451
84.5%
김해공항 1091
 
10.9%
미기재 140
 
1.4%
무안국제공항 114
 
1.1%
대구공항 82
 
0.8%
김포공항 41
 
0.4%
제주공항 28
 
0.3%
양양국제공항 17
 
0.2%
청주국제공항 16
 
0.2%
인천항 6
 
0.1%
Other values (10) 14
 
0.1%

Length

2023-12-16T06:33:53.653018image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
인천공항 8451
84.5%
김해공항 1091
 
10.9%
미기재 140
 
1.4%
무안국제공항 114
 
1.1%
대구공항 82
 
0.8%
김포공항 41
 
0.4%
제주공항 28
 
0.3%
양양국제공항 17
 
0.2%
청주국제공항 16
 
0.2%
인천항 6
 
0.1%
Other values (10) 14
 
0.1%

구분
Categorical

Distinct7
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
전화
4989 
신고함
2393 
모바일
1449 
대행사
 
421
방문
 
383
Other values (2)
 
365

Length

Max length5
Median length2
Mean length2.4266
Min length2

Unique

Unique1 ?
Unique (%)< 0.1%

Sample

1st row전화
2nd row신고함
3rd row신고함
4th row전화
5th row신고함

Common Values

ValueCountFrequency (%)
전화 4989
49.9%
신고함 2393
23.9%
모바일 1449
 
14.5%
대행사 421
 
4.2%
방문 383
 
3.8%
개인 364
 
3.6%
여행자협회 1
 
< 0.1%

Length

2023-12-16T06:33:54.280759image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-16T06:33:54.952599image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
전화 4989
49.9%
신고함 2393
23.9%
모바일 1449
 
14.5%
대행사 421
 
4.2%
방문 383
 
3.8%
개인 364
 
3.6%
여행자협회 1
 
< 0.1%

Interactions

2023-12-16T06:33:43.683520image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-16T06:33:42.734944image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-16T06:33:44.555804image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-16T06:33:43.195717image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-16T06:33:55.394222image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번성별출국일시출발항구분
순번1.0000.1360.6430.3130.178
성별0.1361.0000.0000.1930.085
출국일시0.6430.0001.0000.1240.060
출발항0.3130.1930.1241.0000.557
구분0.1780.0850.0600.5571.000
2023-12-16T06:33:55.946634image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
성별출발항구분
성별1.0000.1020.057
출발항0.1021.0000.279
구분0.0570.2791.000
2023-12-16T06:33:56.624990image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번출국일시성별출발항구분
순번1.0001.0000.0810.1040.091
출국일시1.0001.0000.0010.0650.040
성별0.0810.0011.0000.1020.057
출발항0.1040.0650.1021.0000.279
구분0.0910.0400.0570.2791.000

Missing values

2023-12-16T06:33:45.639100image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-16T06:33:47.377172image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

순번성별출국일시귀국예정일출발항구분
11396113972022082020220824인천공항전화
19958199592022110920221113인천공항신고함
423342342022052420220527인천공항신고함
392239232022051720220522인천공항전화
636063612022070120220705인천공항신고함
731273132022071020220724인천공항전화
238392384020221127미기재인천공항전화
785178522022071620220721인천공항전화
426542662022052520221013인천공항전화
22953229542022112320221203인천공항전화
순번성별출국일시귀국예정일출발항구분
160916102022021320220227인천공항신고함
20231202322022111020221113김해공항신고함
10276102772022080920220818인천공항전화
139141391520220918미기재인천공항전화
8698702021121520220216인천공항전화
208062080720221113미기재인천공항전화
21379213802022111620221119인천공항대행사
494549462022060820220614인천공항전화
21883218842022111820221123인천공항모바일
22040220412022111920221130인천공항개인