Overview

Dataset statistics

Number of variables10
Number of observations100
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory8.4 KiB
Average record size in memory86.3 B

Variable types

Categorical7
Numeric2
Text1

Alerts

sido_nm has constant value ""Constant
FILE_NAME has constant value ""Constant
base_ymd has constant value ""Constant
residnt_cnt_sum is highly overall correlated with sgg_nm and 1 other fieldsHigh correlation
hadm_cd is highly overall correlated with sgg_nm and 1 other fieldsHigh correlation
sgg_nm is highly overall correlated with hadm_cd and 1 other fieldsHigh correlation
ksic_cd is highly overall correlated with cate_nmHigh correlation
cate_nm is highly overall correlated with ksic_cdHigh correlation

Reproduction

Analysis started2023-12-10 10:08:21.586507
Analysis finished2023-12-10 10:08:24.654872
Duration3.07 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

sido_nm
Categorical

CONSTANT 

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
강원도
100 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row강원도
2nd row강원도
3rd row강원도
4th row강원도
5th row강원도

Common Values

ValueCountFrequency (%)
강원도 100
100.0%

Length

2023-12-10T19:08:24.765845image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T19:08:24.943619image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
강원도 100
100.0%

sgg_nm
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)3.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
강릉시
54 
고성군
29 
동해시
17 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row강릉시
2nd row강릉시
3rd row강릉시
4th row강릉시
5th row강릉시

Common Values

ValueCountFrequency (%)
강릉시 54
54.0%
고성군 29
29.0%
동해시 17
 
17.0%

Length

2023-12-10T19:08:25.099647image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T19:08:25.269176image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
강릉시 54
54.0%
고성군 29
29.0%
동해시 17
 
17.0%

hadm_cd
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)3.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
42150
54 
42820
29 
42170
17 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row42150
2nd row42150
3rd row42150
4th row42150
5th row42150

Common Values

ValueCountFrequency (%)
42150 54
54.0%
42820 29
29.0%
42170 17
 
17.0%

Length

2023-12-10T19:08:25.462983image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T19:08:25.616206image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
42150 54
54.0%
42820 29
29.0%
42170 17
 
17.0%

ksic_cd
Real number (ℝ)

HIGH CORRELATION 

Distinct8
Distinct (%)8.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean56135.51
Minimum56111
Maximum56194
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2023-12-10T19:08:25.824747image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum56111
5-th percentile56111
Q156111
median56113
Q356192
95-th percentile56193
Maximum56194
Range83
Interquartile range (IQR)81

Descriptive statistics

Standard deviation35.975158
Coefficient of variation (CV)0.00064086277
Kurtosis-1.0542799
Mean56135.51
Median Absolute Deviation (MAD)2
Skewness0.96328179
Sum5613551
Variance1294.212
MonotonicityNot monotonic
2023-12-10T19:08:26.005105image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=8)
ValueCountFrequency (%)
56111 46
46.0%
56192 13
 
13.0%
56193 13
 
13.0%
56119 12
 
12.0%
56113 8
 
8.0%
56114 4
 
4.0%
56132 2
 
2.0%
56194 2
 
2.0%
ValueCountFrequency (%)
56111 46
46.0%
56113 8
 
8.0%
56114 4
 
4.0%
56119 12
 
12.0%
56132 2
 
2.0%
56192 13
 
13.0%
56193 13
 
13.0%
56194 2
 
2.0%
ValueCountFrequency (%)
56194 2
 
2.0%
56193 13
 
13.0%
56192 13
 
13.0%
56132 2
 
2.0%
56119 12
 
12.0%
56114 4
 
4.0%
56113 8
 
8.0%
56111 46
46.0%

cate_nm
Categorical

HIGH CORRELATION 

Distinct8
Distinct (%)8.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
한식
46 
피자,햄버거,샌드위치 및 유사 읍식점업
13 
치킨 전문점
13 
기타 외국식
12 
일식
Other values (3)

Length

Max length21
Median length2
Mean length5.77
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row한식
2nd row한식
3rd row한식
4th row한식
5th row한식

Common Values

ValueCountFrequency (%)
한식 46
46.0%
피자,햄버거,샌드위치 및 유사 읍식점업 13
 
13.0%
치킨 전문점 13
 
13.0%
기타 외국식 12
 
12.0%
일식 8
 
8.0%
서양식 4
 
4.0%
이동 음식업 2
 
2.0%
분식 및 김밥 전문점 2
 
2.0%

Length

2023-12-10T19:08:26.233140image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T19:08:26.549127image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
한식 46
26.7%
15
 
8.7%
전문점 15
 
8.7%
피자,햄버거,샌드위치 13
 
7.6%
유사 13
 
7.6%
읍식점업 13
 
7.6%
치킨 13
 
7.6%
기타 12
 
7.0%
외국식 12
 
7.0%
일식 8
 
4.7%
Other values (5) 12
 
7.0%
Distinct55
Distinct (%)55.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
2023-12-10T19:08:26.982798image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length6
Mean length4.34
Min length2

Characters and Unicode

Total characters434
Distinct characters130
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique22 ?
Unique (%)22.0%

Sample

1st row벌집삼겹살
2nd row몸보신
3rd row한정식
4th row생고기/등심
5th row국수전문
ValueCountFrequency (%)
갈비/불고기 3
 
3.0%
삼겹살 3
 
3.0%
일반한식 3
 
3.0%
국수전문 3
 
3.0%
족발/보쌈 3
 
3.0%
보양식 3
 
3.0%
생고기/등심 3
 
3.0%
냉면 3
 
3.0%
찌개/탕전문 3
 
3.0%
곱창/양구이 3
 
3.0%
Other values (45) 70
70.0%
2023-12-10T19:08:27.706358image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
/ 22
 
5.1%
19
 
4.4%
15
 
3.5%
14
 
3.2%
12
 
2.8%
12
 
2.8%
9
 
2.1%
8
 
1.8%
8
 
1.8%
8
 
1.8%
Other values (120) 307
70.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 395
91.0%
Other Punctuation 22
 
5.1%
Uppercase Letter 9
 
2.1%
Decimal Number 4
 
0.9%
Close Punctuation 2
 
0.5%
Open Punctuation 2
 
0.5%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
19
 
4.8%
15
 
3.8%
14
 
3.5%
12
 
3.0%
12
 
3.0%
9
 
2.3%
8
 
2.0%
8
 
2.0%
8
 
2.0%
7
 
1.8%
Other values (110) 283
71.6%
Uppercase Letter
ValueCountFrequency (%)
B 4
44.4%
Q 2
22.2%
C 1
 
11.1%
F 1
 
11.1%
K 1
 
11.1%
Decimal Number
ValueCountFrequency (%)
2 2
50.0%
4 2
50.0%
Other Punctuation
ValueCountFrequency (%)
/ 22
100.0%
Close Punctuation
ValueCountFrequency (%)
) 2
100.0%
Open Punctuation
ValueCountFrequency (%)
( 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 395
91.0%
Common 30
 
6.9%
Latin 9
 
2.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
19
 
4.8%
15
 
3.8%
14
 
3.5%
12
 
3.0%
12
 
3.0%
9
 
2.3%
8
 
2.0%
8
 
2.0%
8
 
2.0%
7
 
1.8%
Other values (110) 283
71.6%
Common
ValueCountFrequency (%)
/ 22
73.3%
2 2
 
6.7%
) 2
 
6.7%
4 2
 
6.7%
( 2
 
6.7%
Latin
ValueCountFrequency (%)
B 4
44.4%
Q 2
22.2%
C 1
 
11.1%
F 1
 
11.1%
K 1
 
11.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 395
91.0%
ASCII 39
 
9.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
/ 22
56.4%
B 4
 
10.3%
2 2
 
5.1%
Q 2
 
5.1%
) 2
 
5.1%
4 2
 
5.1%
( 2
 
5.1%
C 1
 
2.6%
F 1
 
2.6%
K 1
 
2.6%
Hangul
ValueCountFrequency (%)
19
 
4.8%
15
 
3.8%
14
 
3.5%
12
 
3.0%
12
 
3.0%
9
 
2.3%
8
 
2.0%
8
 
2.0%
8
 
2.0%
7
 
1.8%
Other values (110) 283
71.6%

facility_cnt
Real number (ℝ)

Distinct45
Distinct (%)45.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean67.06
Minimum1
Maximum2027
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2023-12-10T19:08:27.977976image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q12.75
median9.5
Q338.25
95-th percentile275.1
Maximum2027
Range2026
Interquartile range (IQR)35.5

Descriptive statistics

Standard deviation229.98972
Coefficient of variation (CV)3.429611
Kurtosis55.436188
Mean67.06
Median Absolute Deviation (MAD)8.5
Skewness6.9604355
Sum6706
Variance52895.269
MonotonicityNot monotonic
2023-12-10T19:08:28.301206image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=45)
ValueCountFrequency (%)
1 18
18.0%
2 7
 
7.0%
5 6
 
6.0%
6 5
 
5.0%
11 5
 
5.0%
8 5
 
5.0%
13 3
 
3.0%
4 3
 
3.0%
3 3
 
3.0%
14 3
 
3.0%
Other values (35) 42
42.0%
ValueCountFrequency (%)
1 18
18.0%
2 7
 
7.0%
3 3
 
3.0%
4 3
 
3.0%
5 6
 
6.0%
6 5
 
5.0%
7 2
 
2.0%
8 5
 
5.0%
9 1
 
1.0%
10 1
 
1.0%
ValueCountFrequency (%)
2027 1
1.0%
878 1
1.0%
553 1
1.0%
345 1
1.0%
334 1
1.0%
272 1
1.0%
206 1
1.0%
176 1
1.0%
168 1
1.0%
127 1
1.0%

residnt_cnt_sum
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)3.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
162905
54 
18628
29 
68772
17 

Length

Max length6
Median length6
Mean length5.54
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row162905
2nd row162905
3rd row162905
4th row162905
5th row162905

Common Values

ValueCountFrequency (%)
162905 54
54.0%
18628 29
29.0%
68772 17
 
17.0%

Length

2023-12-10T19:08:28.544804image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T19:08:28.729803image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
162905 54
54.0%
18628 29
29.0%
68772 17
 
17.0%

FILE_NAME
Categorical

CONSTANT 

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
KC_618_LLR_RSTRT_CNBAS_TRND_2019
100 

Length

Max length32
Median length32
Mean length32
Min length32

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowKC_618_LLR_RSTRT_CNBAS_TRND_2019
2nd rowKC_618_LLR_RSTRT_CNBAS_TRND_2019
3rd rowKC_618_LLR_RSTRT_CNBAS_TRND_2019
4th rowKC_618_LLR_RSTRT_CNBAS_TRND_2019
5th rowKC_618_LLR_RSTRT_CNBAS_TRND_2019

Common Values

ValueCountFrequency (%)
KC_618_LLR_RSTRT_CNBAS_TRND_2019 100
100.0%

Length

2023-12-10T19:08:28.947840image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T19:08:29.120046image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
kc_618_llr_rstrt_cnbas_trnd_2019 100
100.0%

base_ymd
Categorical

CONSTANT 

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
20200214
100 

Length

Max length8
Median length8
Mean length8
Min length8

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row20200214
2nd row20200214
3rd row20200214
4th row20200214
5th row20200214

Common Values

ValueCountFrequency (%)
20200214 100
100.0%

Length

2023-12-10T19:08:29.411402image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T19:08:29.581957image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
20200214 100
100.0%

Interactions

2023-12-10T19:08:23.216063image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T19:08:22.732053image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T19:08:23.418117image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T19:08:22.963527image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-10T19:08:29.697233image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
sgg_nmhadm_cdksic_cdcate_nmmcate_cdfacility_cntresidnt_cnt_sum
sgg_nm1.0001.0000.0000.4090.0000.0001.000
hadm_cd1.0001.0000.0000.4090.0000.0001.000
ksic_cd0.0000.0001.0001.0001.0000.0000.000
cate_nm0.4090.4091.0001.0001.0000.0750.409
mcate_cd0.0000.0001.0001.0001.0000.0000.000
facility_cnt0.0000.0000.0000.0750.0001.0000.000
residnt_cnt_sum1.0001.0000.0000.4090.0000.0001.000
2023-12-10T19:08:29.940911image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
residnt_cnt_sumhadm_cdsgg_nmcate_nm
residnt_cnt_sum1.0001.0001.0000.277
hadm_cd1.0001.0001.0000.277
sgg_nm1.0001.0001.0000.277
cate_nm0.2770.2770.2771.000
2023-12-10T19:08:30.255649image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ksic_cdfacility_cntsgg_nmhadm_cdcate_nmresidnt_cnt_sum
ksic_cd1.000-0.2410.1600.1600.9740.160
facility_cnt-0.2411.0000.0000.0000.0360.000
sgg_nm0.1600.0001.0001.0000.2771.000
hadm_cd0.1600.0001.0001.0000.2771.000
cate_nm0.9740.0360.2770.2771.0000.277
residnt_cnt_sum0.1600.0001.0001.0000.2771.000

Missing values

2023-12-10T19:08:23.752886image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-10T19:08:24.497705image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

sido_nmsgg_nmhadm_cdksic_cdcate_nmmcate_cdfacility_cntresidnt_cnt_sumFILE_NAMEbase_ymd
0강원도강릉시4215056111한식벌집삼겹살2162905KC_618_LLR_RSTRT_CNBAS_TRND_201920200214
1강원도강릉시4215056111한식몸보신38162905KC_618_LLR_RSTRT_CNBAS_TRND_201920200214
2강원도강릉시4215056111한식한정식19162905KC_618_LLR_RSTRT_CNBAS_TRND_201920200214
3강원도강릉시4215056111한식생고기/등심98162905KC_618_LLR_RSTRT_CNBAS_TRND_201920200214
4강원도강릉시4215056111한식국수전문176162905KC_618_LLR_RSTRT_CNBAS_TRND_201920200214
5강원도강릉시4215056111한식족발/보쌈78162905KC_618_LLR_RSTRT_CNBAS_TRND_201920200214
6강원도강릉시4215056111한식보양식15162905KC_618_LLR_RSTRT_CNBAS_TRND_201920200214
7강원도강릉시4215056111한식삼겹살80162905KC_618_LLR_RSTRT_CNBAS_TRND_201920200214
8강원도강릉시4215056111한식죽전문7162905KC_618_LLR_RSTRT_CNBAS_TRND_201920200214
9강원도강릉시4215056111한식찌개/탕전문88162905KC_618_LLR_RSTRT_CNBAS_TRND_201920200214
sido_nmsgg_nmhadm_cdksic_cdcate_nmmcate_cdfacility_cntresidnt_cnt_sumFILE_NAMEbase_ymd
90강원도동해시4217056111한식한정식868772KC_618_LLR_RSTRT_CNBAS_TRND_201920200214
91강원도동해시4217056111한식생고기/등심3268772KC_618_LLR_RSTRT_CNBAS_TRND_201920200214
92강원도동해시4217056111한식몸보신1468772KC_618_LLR_RSTRT_CNBAS_TRND_201920200214
93강원도동해시4217056111한식족발/보쌈3968772KC_618_LLR_RSTRT_CNBAS_TRND_201920200214
94강원도동해시4217056111한식원할머니보쌈168772KC_618_LLR_RSTRT_CNBAS_TRND_201920200214
95강원도동해시4217056111한식찌개/탕전문3468772KC_618_LLR_RSTRT_CNBAS_TRND_201920200214
96강원도동해시4217056111한식곱창/양구이3168772KC_618_LLR_RSTRT_CNBAS_TRND_201920200214
97강원도동해시4217056111한식일반한식87868772KC_618_LLR_RSTRT_CNBAS_TRND_201920200214
98강원도동해시4217056111한식국수전문7368772KC_618_LLR_RSTRT_CNBAS_TRND_201920200214
99강원도동해시4217056111한식냉면1168772KC_618_LLR_RSTRT_CNBAS_TRND_201920200214