Overview

Dataset statistics

Number of variables6
Number of observations10000
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory556.6 KiB
Average record size in memory57.0 B

Variable types

Numeric1
Categorical3
DateTime1
Text1

Dataset

Description축산관계자 출국신고를 통해 해외로 출국한 축산관계자의 출입국일, 신고방법, 이용항만 등의 현황 제공(축산관계자 중 해외로 출국시 출국신고 현황을 제공함)
Author농림축산식품부 농림축산검역본부
URLhttps://www.data.go.kr/data/15102240/fileData.do

Alerts

출발항 is highly imbalanced (58.6%)Imbalance
순번 has unique valuesUnique

Reproduction

Analysis started2023-12-16 15:42:11.365999
Analysis finished2023-12-16 15:42:13.771555
Duration2.41 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

순번
Real number (ℝ)

UNIQUE 

Distinct10000
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean50272.807
Minimum8
Maximum99970
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-16T15:42:14.518052image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum8
5-th percentile5223.95
Q125199
median50314.5
Q375235.25
95-th percentile94938.1
Maximum99970
Range99962
Interquartile range (IQR)50036.25

Descriptive statistics

Standard deviation28845.566
Coefficient of variation (CV)0.5737807
Kurtosis-1.2017353
Mean50272.807
Median Absolute Deviation (MAD)25004
Skewness-0.020055118
Sum5.0272807 × 108
Variance8.3206669 × 108
MonotonicityNot monotonic
2023-12-16T15:42:15.462589image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
48396 1
 
< 0.1%
11846 1
 
< 0.1%
46398 1
 
< 0.1%
85191 1
 
< 0.1%
59543 1
 
< 0.1%
89614 1
 
< 0.1%
28774 1
 
< 0.1%
68995 1
 
< 0.1%
38859 1
 
< 0.1%
67816 1
 
< 0.1%
Other values (9990) 9990
99.9%
ValueCountFrequency (%)
8 1
< 0.1%
78 1
< 0.1%
88 1
< 0.1%
91 1
< 0.1%
111 1
< 0.1%
126 1
< 0.1%
127 1
< 0.1%
128 1
< 0.1%
135 1
< 0.1%
145 1
< 0.1%
ValueCountFrequency (%)
99970 1
< 0.1%
99960 1
< 0.1%
99954 1
< 0.1%
99951 1
< 0.1%
99946 1
< 0.1%
99941 1
< 0.1%
99940 1
< 0.1%
99936 1
< 0.1%
99931 1
< 0.1%
99898 1
< 0.1%

성별
Categorical

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
6403 
3597 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row
2nd row
3rd row
4th row
5th row

Common Values

ValueCountFrequency (%)
6403
64.0%
3597
36.0%

Length

2023-12-16T15:42:16.119451image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-16T15:42:16.734767image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
6403
64.0%
3597
36.0%
Distinct354
Distinct (%)3.5%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
Minimum2022-12-01 00:00:00
Maximum2023-11-19 00:00:00
2023-12-16T15:42:17.699220image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-16T15:42:18.692440image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
Distinct407
Distinct (%)4.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-16T15:42:20.034540image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length10
Median length10
Mean length9.808
Min length3

Characters and Unicode

Total characters98080
Distinct characters14
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique34 ?
Unique (%)0.3%

Sample

1st row2023-05-22
2nd row2023-01-05
3rd row2023-09-01
4th row2023-06-26
5th row2023-10-12
ValueCountFrequency (%)
미기재 266
 
2.7%
2023-07-07 61
 
0.6%
2023-06-23 56
 
0.6%
2023-02-17 53
 
0.5%
2023-10-01 53
 
0.5%
2023-01-14 52
 
0.5%
2023-03-12 51
 
0.5%
2023-01-24 50
 
0.5%
2023-02-23 50
 
0.5%
2023-09-08 49
 
0.5%
Other values (397) 9259
92.6%
2023-12-16T15:42:22.838448image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2 25723
26.2%
0 21982
22.4%
- 19410
19.8%
3 11490
11.7%
1 8072
 
8.2%
7 1933
 
2.0%
8 1855
 
1.9%
9 1766
 
1.8%
6 1716
 
1.7%
4 1711
 
1.7%
Other values (4) 2422
 
2.5%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 77872
79.4%
Dash Punctuation 19410
 
19.8%
Other Letter 798
 
0.8%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
2 25723
33.0%
0 21982
28.2%
3 11490
14.8%
1 8072
 
10.4%
7 1933
 
2.5%
8 1855
 
2.4%
9 1766
 
2.3%
6 1716
 
2.2%
4 1711
 
2.2%
5 1624
 
2.1%
Other Letter
ValueCountFrequency (%)
266
33.3%
266
33.3%
266
33.3%
Dash Punctuation
ValueCountFrequency (%)
- 19410
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 97282
99.2%
Hangul 798
 
0.8%

Most frequent character per script

Common
ValueCountFrequency (%)
2 25723
26.4%
0 21982
22.6%
- 19410
20.0%
3 11490
11.8%
1 8072
 
8.3%
7 1933
 
2.0%
8 1855
 
1.9%
9 1766
 
1.8%
6 1716
 
1.8%
4 1711
 
1.8%
Hangul
ValueCountFrequency (%)
266
33.3%
266
33.3%
266
33.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 97282
99.2%
Hangul 798
 
0.8%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2 25723
26.4%
0 21982
22.6%
- 19410
20.0%
3 11490
11.8%
1 8072
 
8.3%
7 1933
 
2.0%
8 1855
 
1.9%
9 1766
 
1.8%
6 1716
 
1.8%
4 1711
 
1.8%
Hangul
ValueCountFrequency (%)
266
33.3%
266
33.3%
266
33.3%

출발항
Categorical

IMBALANCE 

Distinct11
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
인천공항
7182 
김해공항
1605 
대구공항
 
418
무안국제공항
 
310
청주국제공항
 
152
Other values (6)
 
333

Length

Max length6
Median length4
Mean length4.0869
Min length3

Unique

Unique1 ?
Unique (%)< 0.1%

Sample

1st row인천공항
2nd row대구공항
3rd row인천공항
4th row무안국제공항
5th row인천공항

Common Values

ValueCountFrequency (%)
인천공항 7182
71.8%
김해공항 1605
 
16.1%
대구공항 418
 
4.2%
무안국제공항 310
 
3.1%
청주국제공항 152
 
1.5%
김포공항 143
 
1.4%
부산항 110
 
1.1%
제주공항 45
 
0.4%
양양국제공항 30
 
0.3%
속초항 4
 
< 0.1%

Length

2023-12-16T15:42:23.709276image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
인천공항 7182
71.8%
김해공항 1605
 
16.1%
대구공항 418
 
4.2%
무안국제공항 310
 
3.1%
청주국제공항 152
 
1.5%
김포공항 143
 
1.4%
부산항 110
 
1.1%
제주공항 45
 
0.4%
양양국제공항 30
 
0.3%
속초항 4
 
< 0.1%

구분
Categorical

Distinct7
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
전화
3778 
신고함
3126 
모바일
1593 
방문
762 
개인
390 
Other values (2)
 
351

Length

Max length5
Median length3
Mean length2.5092
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row전화
2nd row개인
3rd row신고함
4th row방문
5th row대행사

Common Values

ValueCountFrequency (%)
전화 3778
37.8%
신고함 3126
31.3%
모바일 1593
15.9%
방문 762
 
7.6%
개인 390
 
3.9%
대행사 340
 
3.4%
여행자협회 11
 
0.1%

Length

2023-12-16T15:42:24.900921image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-16T15:42:25.784169image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
전화 3778
37.8%
신고함 3126
31.3%
모바일 1593
15.9%
방문 762
 
7.6%
개인 390
 
3.9%
대행사 340
 
3.4%
여행자협회 11
 
0.1%

Interactions

2023-12-16T15:42:12.401141image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-16T15:42:26.451523image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번성별출발항구분
순번1.0000.0000.1190.154
성별0.0001.0000.0000.056
출발항0.1190.0001.0000.573
구분0.1540.0560.5731.000
2023-12-16T15:42:27.357567image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
구분성별출발항
구분1.0000.0600.328
성별0.0601.0000.000
출발항0.3280.0001.000
2023-12-16T15:42:28.136671image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번성별출발항구분
순번1.0000.0000.0510.078
성별0.0001.0000.0000.060
출발항0.0510.0001.0000.328
구분0.0780.0600.3281.000

Missing values

2023-12-16T15:42:13.045549image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-16T15:42:13.576520image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

순번성별출국일시귀국예정일출발항구분
48395483962023-05-182023-05-22인천공항전화
814381442023-01-022023-01-05대구공항개인
77106771072023-08-292023-09-01인천공항신고함
57371573722023-06-232023-06-26무안국제공항방문
86740867412023-10-042023-10-12인천공항대행사
90193901942023-10-162023-10-20인천공항신고함
37302373032023-04-012023-04-04대구공항모바일
35178351792023-03-242023-03-28인천공항전화
13652136532023-01-212023-01-24인천공항신고함
39362393632023-04-102023-04-14김포공항개인
순번성별출국일시귀국예정일출발항구분
52307523082023-06-042023-06-08김해공항신고함
89060890612023-10-122023-10-16인천공항전화
16528165292023-01-302023-02-08인천공항전화
39293392942023-04-092023-04-21인천공항모바일
19437194382023-02-062023-02-09인천공항방문
61262612632023-07-042023-07-07인천공항대행사
66754667552023-07-232023-08-02인천공항모바일
92509925102023-10-252023-10-28청주국제공항신고함
25611256122023-02-222023-02-26인천공항전화
30834308352023-03-112023-03-16무안국제공항방문