Overview

Dataset statistics

Number of variables4
Number of observations22
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory902.0 B
Average record size in memory41.0 B

Variable types

Text1
Numeric3

Dataset

Description태권도원 방문객 현황으로 기간, 국내 방문객 수, 외국인 방문객수 등의 정보를 제공합니다.
Author태권도진흥재단
URLhttps://www.data.go.kr/data/15038131/fileData.do

Alerts

내국인 방문객 is highly overall correlated with 외국인 방문객 and 1 other fieldsHigh correlation
외국인 방문객 is highly overall correlated with 내국인 방문객 and 1 other fieldsHigh correlation
is highly overall correlated with 내국인 방문객 and 1 other fieldsHigh correlation
기간 has unique valuesUnique
내국인 방문객 has unique valuesUnique
has unique valuesUnique
내국인 방문객 has 1 (4.5%) zerosZeros
외국인 방문객 has 4 (18.2%) zerosZeros
has 1 (4.5%) zerosZeros

Reproduction

Analysis started2023-12-12 06:25:29.528537
Analysis finished2023-12-12 06:25:30.614492
Duration1.09 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

기간
Text

UNIQUE 

Distinct22
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size308.0 B
2023-12-12T15:25:30.752562image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length6
Median length6
Mean length6
Min length6

Characters and Unicode

Total characters132
Distinct characters27
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique22 ?
Unique (%)100.0%

Sample

1st rowJan-19
2nd rowFeb-19
3rd rowMar-19
4th rowApr-19
5th rowMay-19
ValueCountFrequency (%)
jan-19 1
 
4.5%
feb-19 1
 
4.5%
sep-20 1
 
4.5%
aug-20 1
 
4.5%
jul-20 1
 
4.5%
jun-20 1
 
4.5%
may-20 1
 
4.5%
apr-20 1
 
4.5%
mar-20 1
 
4.5%
feb-20 1
 
4.5%
Other values (12) 12
54.5%
2023-12-12T15:25:31.130140image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 22
16.7%
1 12
 
9.1%
9 12
 
9.1%
0 10
 
7.6%
2 10
 
7.6%
a 6
 
4.5%
u 6
 
4.5%
J 6
 
4.5%
e 5
 
3.8%
r 4
 
3.0%
Other values (17) 39
29.5%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 44
33.3%
Lowercase Letter 44
33.3%
Dash Punctuation 22
16.7%
Uppercase Letter 22
16.7%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
a 6
13.6%
u 6
13.6%
e 5
11.4%
r 4
9.1%
p 4
9.1%
n 4
9.1%
c 3
6.8%
b 2
 
4.5%
t 2
 
4.5%
g 2
 
4.5%
Other values (4) 6
13.6%
Uppercase Letter
ValueCountFrequency (%)
J 6
27.3%
A 4
18.2%
M 4
18.2%
O 2
 
9.1%
S 2
 
9.1%
F 2
 
9.1%
N 1
 
4.5%
D 1
 
4.5%
Decimal Number
ValueCountFrequency (%)
1 12
27.3%
9 12
27.3%
0 10
22.7%
2 10
22.7%
Dash Punctuation
ValueCountFrequency (%)
- 22
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 66
50.0%
Latin 66
50.0%

Most frequent character per script

Latin
ValueCountFrequency (%)
a 6
 
9.1%
u 6
 
9.1%
J 6
 
9.1%
e 5
 
7.6%
r 4
 
6.1%
A 4
 
6.1%
p 4
 
6.1%
n 4
 
6.1%
M 4
 
6.1%
c 3
 
4.5%
Other values (12) 20
30.3%
Common
ValueCountFrequency (%)
- 22
33.3%
1 12
18.2%
9 12
18.2%
0 10
15.2%
2 10
15.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII 132
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 22
16.7%
1 12
 
9.1%
9 12
 
9.1%
0 10
 
7.6%
2 10
 
7.6%
a 6
 
4.5%
u 6
 
4.5%
J 6
 
4.5%
e 5
 
3.8%
r 4
 
3.0%
Other values (17) 39
29.5%

내국인 방문객
Real number (ℝ)

HIGH CORRELATION  UNIQUE  ZEROS 

Distinct22
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean15487.227
Minimum0
Maximum52944
Zeros1
Zeros (%)4.5%
Negative0
Negative (%)0.0%
Memory size330.0 B
2023-12-12T15:25:31.262832image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile369.6
Q14934.5
median14205.5
Q319479
95-th percentile46512.8
Maximum52944
Range52944
Interquartile range (IQR)14544.5

Descriptive statistics

Standard deviation13789.633
Coefficient of variation (CV)0.89038748
Kurtosis2.2636707
Mean15487.227
Median Absolute Deviation (MAD)7844
Skewness1.4184517
Sum340719
Variance1.9015398 × 108
MonotonicityNot monotonic
2023-12-12T15:25:31.396027image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=22)
ValueCountFrequency (%)
18408 1
 
4.5%
14392 1
 
4.5%
3365 1
 
4.5%
1065 1
 
4.5%
15810 1
 
4.5%
7151 1
 
4.5%
4722 1
 
4.5%
2826 1
 
4.5%
0 1
 
4.5%
333 1
 
4.5%
Other values (12) 12
54.5%
ValueCountFrequency (%)
0 1
4.5%
333 1
4.5%
1065 1
4.5%
2826 1
4.5%
3365 1
4.5%
4722 1
4.5%
5572 1
4.5%
7151 1
4.5%
11498 1
4.5%
13723 1
4.5%
ValueCountFrequency (%)
52944 1
4.5%
47533 1
4.5%
27129 1
4.5%
23537 1
4.5%
21233 1
4.5%
19814 1
4.5%
18474 1
4.5%
18408 1
4.5%
17171 1
4.5%
15810 1
4.5%

외국인 방문객
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct18
Distinct (%)81.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1412.2273
Minimum0
Maximum8694
Zeros4
Zeros (%)18.2%
Negative0
Negative (%)0.0%
Memory size330.0 B
2023-12-12T15:25:31.564838image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q12.5
median274.5
Q31555.5
95-th percentile6459.4
Maximum8694
Range8694
Interquartile range (IQR)1553

Descriptive statistics

Standard deviation2330.9146
Coefficient of variation (CV)1.6505237
Kurtosis4.30501
Mean1412.2273
Median Absolute Deviation (MAD)274.5
Skewness2.1302203
Sum31069
Variance5433162.9
MonotonicityNot monotonic
2023-12-12T15:25:31.705449image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=18)
ValueCountFrequency (%)
0 4
18.2%
1 2
 
9.1%
571 1
 
4.5%
2764 1
 
4.5%
25 1
 
4.5%
7 1
 
4.5%
102 1
 
4.5%
36 1
 
4.5%
523 1
 
4.5%
1368 1
 
4.5%
Other values (8) 8
36.4%
ValueCountFrequency (%)
0 4
18.2%
1 2
9.1%
7 1
 
4.5%
25 1
 
4.5%
36 1
 
4.5%
92 1
 
4.5%
102 1
 
4.5%
447 1
 
4.5%
523 1
 
4.5%
571 1
 
4.5%
ValueCountFrequency (%)
8694 1
4.5%
6611 1
4.5%
3579 1
4.5%
3365 1
4.5%
2764 1
4.5%
1618 1
4.5%
1368 1
4.5%
1265 1
4.5%
571 1
4.5%
523 1
4.5%


Real number (ℝ)

HIGH CORRELATION  UNIQUE  ZEROS 

Distinct22
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean16899.455
Minimum0
Maximum54144
Zeros1
Zeros (%)4.5%
Negative0
Negative (%)0.0%
Memory size330.0 B
2023-12-12T15:25:31.845052image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile369.6
Q14965.25
median16556
Q322183
95-th percentile52146.3
Maximum54144
Range54144
Interquartile range (IQR)17217.75

Descriptive statistics

Standard deviation14903.9
Coefficient of variation (CV)0.88191603
Kurtosis1.7466534
Mean16899.455
Median Absolute Deviation (MAD)9344.5
Skewness1.2736473
Sum371788
Variance2.2212623 × 108
MonotonicityNot monotonic
2023-12-12T15:25:32.032819image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=22)
ValueCountFrequency (%)
18979 1
 
4.5%
14428 1
 
4.5%
3366 1
 
4.5%
1065 1
 
4.5%
15810 1
 
4.5%
7176 1
 
4.5%
4729 1
 
4.5%
2827 1
 
4.5%
0 1
 
4.5%
333 1
 
4.5%
Other values (12) 12
54.5%
ValueCountFrequency (%)
0 1
4.5%
333 1
4.5%
1065 1
4.5%
2827 1
4.5%
3366 1
4.5%
4729 1
4.5%
5674 1
4.5%
7176 1
4.5%
11590 1
4.5%
14428 1
4.5%
ValueCountFrequency (%)
54144 1
4.5%
53391 1
4.5%
28497 1
4.5%
25865 1
4.5%
25155 1
4.5%
22498 1
4.5%
21238 1
4.5%
20337 1
4.5%
18979 1
4.5%
17384 1
4.5%

Interactions

2023-12-12T15:25:30.205896image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T15:25:29.637110image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T15:25:29.939206image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T15:25:30.298432image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T15:25:29.763817image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T15:25:30.023551image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T15:25:30.383318image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T15:25:29.842830image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T15:25:30.101872image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T15:25:32.150565image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
기간내국인 방문객외국인 방문객
기간1.0001.0001.0001.000
내국인 방문객1.0001.0000.7320.946
외국인 방문객1.0000.7321.0000.637
1.0000.9460.6371.000
2023-12-12T15:25:32.575111image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
내국인 방문객외국인 방문객
내국인 방문객1.0000.7050.972
외국인 방문객0.7051.0000.821
0.9720.8211.000

Missing values

2023-12-12T15:25:30.488289image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T15:25:30.577074image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

기간내국인 방문객외국인 방문객
0Jan-191840857118979
1Feb-19114989211590
2Mar-1914019336517384
3Apr-1921233126522498
4May-195294444753391
5Jun-1913723357917302
6Jul-1917171869425865
7Aug-1947533661154144
8Sep-1923537161825155
9Oct-1927129136828497
기간내국인 방문객외국인 방문객
12Jan-20143923614428
13Feb-2055721025674
14Mar-203330333
15Apr-20000
16May-20282612827
17Jun-20472274729
18Jul-207151257176
19Aug-2015810015810
20Sep-20106501065
21Oct-20336513366