Overview

Dataset statistics

Number of variables4
Number of observations280
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory9.7 KiB
Average record size in memory35.5 B

Variable types

Categorical2
Numeric2

Dataset

Description월별 전자여행허가(K-ETA)를 신청하는 외국인의 국내 입국목적(여행목적)별 현황을 제공(입국목적: 관광, 방문, 사업, 질병치료, 회의, 각종행사, 스포츠경기, 기타 등)
Author법무부
URLhttps://www.data.go.kr/data/15099999/fileData.do

Reproduction

Analysis started2024-04-29 22:57:56.638447
Analysis finished2024-04-29 22:57:58.496739
Duration1.86 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables


Categorical

Distinct4
Distinct (%)1.4%
Missing0
Missing (%)0.0%
Memory size2.3 KiB
2022
96 
2023
96 
2021
64 
2024
24 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2021
2nd row2021
3rd row2021
4th row2021
5th row2021

Common Values

ValueCountFrequency (%)
2022 96
34.3%
2023 96
34.3%
2021 64
22.9%
2024 24
 
8.6%

Length

2024-04-30T07:57:58.567525image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-30T07:57:58.682699image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2022 96
34.3%
2023 96
34.3%
2021 64
22.9%
2024 24
 
8.6%


Real number (ℝ)

Distinct12
Distinct (%)4.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean6.5714286
Minimum1
Maximum12
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size2.6 KiB
2024-04-30T07:57:58.794052image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q13
median7
Q310
95-th percentile12
Maximum12
Range11
Interquartile range (IQR)7

Descriptive statistics

Standard deviation3.4809113
Coefficient of variation (CV)0.52970389
Kurtosis-1.2194777
Mean6.5714286
Median Absolute Deviation (MAD)3
Skewness-0.051312921
Sum1840
Variance12.116743
MonotonicityNot monotonic
2024-04-30T07:57:58.899175image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=12)
ValueCountFrequency (%)
5 24
8.6%
6 24
8.6%
7 24
8.6%
8 24
8.6%
9 24
8.6%
10 24
8.6%
11 24
8.6%
12 24
8.6%
1 24
8.6%
2 24
8.6%
Other values (2) 40
14.3%
ValueCountFrequency (%)
1 24
8.6%
2 24
8.6%
3 24
8.6%
4 16
5.7%
5 24
8.6%
6 24
8.6%
7 24
8.6%
8 24
8.6%
9 24
8.6%
10 24
8.6%
ValueCountFrequency (%)
12 24
8.6%
11 24
8.6%
10 24
8.6%
9 24
8.6%
8 24
8.6%
7 24
8.6%
6 24
8.6%
5 24
8.6%
4 16
5.7%
3 24
8.6%

입국목적
Categorical

Distinct11
Distinct (%)3.9%
Missing0
Missing (%)0.0%
Memory size2.3 KiB
관광
35 
방문
35 
사업
35 
회의
35 
기타
35 
Other values (6)
105 

Length

Max length6
Median length2
Mean length2.85
Min length2

Unique

Unique1 ?
Unique (%)0.4%

Sample

1st row관광
2nd row방문
3rd row사업
4th row질병치료
5th row회의

Common Values

ValueCountFrequency (%)
관광 35
12.5%
방문 35
12.5%
사업 35
12.5%
회의 35
12.5%
기타 35
12.5%
각종행사 34
12.1%
질병치료 25
8.9%
스포츠경기 20
7.1%
스포츠 경기 15
5.4%
질병 10
 
3.6%

Length

2024-04-30T07:57:59.033839image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
관광 35
11.9%
방문 35
11.9%
사업 35
11.9%
회의 35
11.9%
기타 35
11.9%
각종행사 34
11.5%
질병치료 25
8.5%
스포츠경기 20
6.8%
스포츠 15
5.1%
경기 15
5.1%
Other values (2) 11
 
3.7%

신청수
Real number (ℝ)

Distinct271
Distinct (%)96.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean16273.857
Minimum0
Maximum338966
Zeros2
Zeros (%)0.7%
Negative0
Negative (%)0.0%
Memory size2.6 KiB
2024-04-30T07:57:59.189104image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile30.85
Q1827.5
median2351.5
Q37953.5
95-th percentile83135.75
Maximum338966
Range338966
Interquartile range (IQR)7126

Descriptive statistics

Standard deviation40075.793
Coefficient of variation (CV)2.4625872
Kurtosis23.221062
Mean16273.857
Median Absolute Deviation (MAD)2177
Skewness4.3226175
Sum4556680
Variance1.6060692 × 109
MonotonicityNot monotonic
2024-04-30T07:57:59.326864image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
18 2
 
0.7%
1384 2
 
0.7%
277 2
 
0.7%
869 2
 
0.7%
0 2
 
0.7%
123 2
 
0.7%
1340 2
 
0.7%
83 2
 
0.7%
24 2
 
0.7%
2610 1
 
0.4%
Other values (261) 261
93.2%
ValueCountFrequency (%)
0 2
0.7%
4 1
0.4%
6 1
0.4%
11 1
0.4%
16 1
0.4%
18 2
0.7%
19 1
0.4%
20 1
0.4%
21 1
0.4%
24 2
0.7%
ValueCountFrequency (%)
338966 1
0.4%
236326 1
0.4%
214695 1
0.4%
197071 1
0.4%
195027 1
0.4%
191857 1
0.4%
132308 1
0.4%
127883 1
0.4%
101696 1
0.4%
95637 1
0.4%

Interactions

2024-04-30T07:57:58.172879image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-30T07:57:57.800290image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-30T07:57:58.260817image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-30T07:57:57.934703image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-04-30T07:57:59.418737image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
입국목적신청수
1.0000.4730.2990.139
0.4731.0000.0000.000
입국목적0.2990.0001.0000.453
신청수0.1390.0000.4531.000
2024-04-30T07:57:59.513357image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
입국목적
1.0000.182
입국목적0.1821.000
2024-04-30T07:57:59.594575image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
신청수입국목적
1.0000.0070.2980.000
신청수0.0071.0000.0950.241
0.2980.0951.0000.182
입국목적0.0000.2410.1821.000

Missing values

2024-04-30T07:57:58.372695image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-30T07:57:58.449319image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

입국목적신청수
020215관광457
120215방문3230
220215사업157
320215질병치료28
420215회의19
520215각종행사16
620215스포츠경기0
720215기타188
820216관광239
920216방문2552
입국목적신청수
27020242스포츠 경기277
27120242기타3186
27220243관광73395
27320243방문10805
27420243사업4057
27520243질병1340
27620243회의1384
27720243각종행사1363
27820243스포츠 경기452
27920243기타4005