Overview

Dataset statistics

Number of variables12
Number of observations523
Missing cells2092
Missing cells (%)33.3%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory53.2 KiB
Average record size in memory104.3 B

Variable types

Categorical6
Numeric1
Text1
Unsupported4

Dataset

DescriptionSample
Author주식회사 여기어때컴퍼니
URLhttps://www.bigdata-finance.kr/dataset/datasetView.do?datastId=SET0400005

Alerts

기준년월 has constant value ""Constant
항목설명내용 has constant value ""Constant
언어명 has constant value ""Constant
마지막수정일자 has constant value ""Constant
대상기준년월 has constant value ""Constant
음식식당사전ID is highly overall correlated with 항목소분류명1High correlation
항목소분류명1 is highly overall correlated with 음식식당사전IDHigh correlation
항목대분류명 has 523 (100.0%) missing valuesMissing
항목소분류명2 has 523 (100.0%) missing valuesMissing
저화질이미지URL has 523 (100.0%) missing valuesMissing
태깅내용 has 523 (100.0%) missing valuesMissing
음식식당사전ID has unique valuesUnique
항목대분류명 is an unsupported type, check if it needs cleaning or further analysisUnsupported
항목소분류명2 is an unsupported type, check if it needs cleaning or further analysisUnsupported
저화질이미지URL is an unsupported type, check if it needs cleaning or further analysisUnsupported
태깅내용 is an unsupported type, check if it needs cleaning or further analysisUnsupported

Reproduction

Analysis started2023-12-10 13:13:09.851812
Analysis finished2023-12-10 13:13:13.322096
Duration3.47 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

기준년월
Categorical

CONSTANT 

Distinct1
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size4.2 KiB
202108
523 

Length

Max length6
Median length6
Mean length6
Min length6

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row202108
2nd row202108
3rd row202108
4th row202108
5th row202108

Common Values

ValueCountFrequency (%)
202108 523
100.0%

Length

2023-12-10T22:13:13.431900image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T22:13:13.573586image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
202108 523
100.0%

음식식당사전ID
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct523
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean10262
Minimum10001
Maximum10523
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size4.7 KiB
2023-12-10T22:13:13.765864image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum10001
5-th percentile10027.1
Q110131.5
median10262
Q310392.5
95-th percentile10496.9
Maximum10523
Range522
Interquartile range (IQR)261

Descriptive statistics

Standard deviation151.12136
Coefficient of variation (CV)0.014726307
Kurtosis-1.2
Mean10262
Median Absolute Deviation (MAD)131
Skewness0
Sum5367026
Variance22837.667
MonotonicityStrictly increasing
2023-12-10T22:13:14.058509image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
10001 1
 
0.2%
10361 1
 
0.2%
10359 1
 
0.2%
10358 1
 
0.2%
10357 1
 
0.2%
10356 1
 
0.2%
10355 1
 
0.2%
10354 1
 
0.2%
10353 1
 
0.2%
10352 1
 
0.2%
Other values (513) 513
98.1%
ValueCountFrequency (%)
10001 1
0.2%
10002 1
0.2%
10003 1
0.2%
10004 1
0.2%
10005 1
0.2%
10006 1
0.2%
10007 1
0.2%
10008 1
0.2%
10009 1
0.2%
10010 1
0.2%
ValueCountFrequency (%)
10523 1
0.2%
10522 1
0.2%
10521 1
0.2%
10520 1
0.2%
10519 1
0.2%
10518 1
0.2%
10517 1
0.2%
10516 1
0.2%
10515 1
0.2%
10514 1
0.2%
Distinct522
Distinct (%)99.8%
Missing0
Missing (%)0.0%
Memory size4.2 KiB
2023-12-10T22:13:14.551009image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length16
Median length15
Mean length3.6137667
Min length2

Characters and Unicode

Total characters1890
Distinct characters401
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique521 ?
Unique (%)99.6%

Sample

1st row양식
2nd row일식
3rd row중식
4th row한식
5th row세계음식
ValueCountFrequency (%)
24
 
3.9%
요리 7
 
1.1%
퓨전 6
 
1.0%
음식 6
 
1.0%
기타 5
 
0.8%
일식 5
 
0.8%
중식 5
 
0.8%
한식 4
 
0.7%
나는 3
 
0.5%
정통 3
 
0.5%
Other values (512) 542
88.9%
2023-12-10T22:13:15.386897image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
87
 
4.6%
53
 
2.8%
47
 
2.5%
40
 
2.1%
36
 
1.9%
33
 
1.7%
32
 
1.7%
32
 
1.7%
25
 
1.3%
24
 
1.3%
Other values (391) 1481
78.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1779
94.1%
Space Separator 87
 
4.6%
Other Punctuation 24
 
1.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
53
 
3.0%
47
 
2.6%
40
 
2.2%
36
 
2.0%
33
 
1.9%
32
 
1.8%
32
 
1.8%
25
 
1.4%
24
 
1.3%
23
 
1.3%
Other values (389) 1434
80.6%
Space Separator
ValueCountFrequency (%)
87
100.0%
Other Punctuation
ValueCountFrequency (%)
/ 24
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1779
94.1%
Common 111
 
5.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
53
 
3.0%
47
 
2.6%
40
 
2.2%
36
 
2.0%
33
 
1.9%
32
 
1.8%
32
 
1.8%
25
 
1.4%
24
 
1.3%
23
 
1.3%
Other values (389) 1434
80.6%
Common
ValueCountFrequency (%)
87
78.4%
/ 24
 
21.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1779
94.1%
ASCII 111
 
5.9%

Most frequent character per block

ASCII
ValueCountFrequency (%)
87
78.4%
/ 24
 
21.6%
Hangul
ValueCountFrequency (%)
53
 
3.0%
47
 
2.6%
40
 
2.2%
36
 
2.0%
33
 
1.9%
32
 
1.8%
32
 
1.8%
25
 
1.4%
24
 
1.3%
23
 
1.3%
Other values (389) 1434
80.6%

항목설명내용
Categorical

CONSTANT 

Distinct1
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size4.2 KiB
Food
523 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowFood
2nd rowFood
3rd rowFood
4th rowFood
5th rowFood

Common Values

ValueCountFrequency (%)
Food 523
100.0%

Length

2023-12-10T22:13:15.698248image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T22:13:15.845566image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
food 523
100.0%

항목대분류명
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing523
Missing (%)100.0%
Memory size4.7 KiB

항목소분류명1
Categorical

HIGH CORRELATION 

Distinct6
Distinct (%)1.1%
Missing0
Missing (%)0.0%
Memory size4.2 KiB
Menu
370 
Theme
48 
Subcuisine
43 
Taste
 
37
Feature
 
17

Length

Max length10
Median length4
Mean length4.7992352
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowCuisine
2nd rowCuisine
3rd rowCuisine
4th rowCuisine
5th rowCuisine

Common Values

ValueCountFrequency (%)
Menu 370
70.7%
Theme 48
 
9.2%
Subcuisine 43
 
8.2%
Taste 37
 
7.1%
Feature 17
 
3.3%
Cuisine 8
 
1.5%

Length

2023-12-10T22:13:16.023659image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T22:13:16.199943image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
menu 370
70.7%
theme 48
 
9.2%
subcuisine 43
 
8.2%
taste 37
 
7.1%
feature 17
 
3.3%
cuisine 8
 
1.5%

항목소분류명2
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing523
Missing (%)100.0%
Memory size4.7 KiB

언어명
Categorical

CONSTANT 

Distinct1
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size4.2 KiB
Kor
523 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowKor
2nd rowKor
3rd rowKor
4th rowKor
5th rowKor

Common Values

ValueCountFrequency (%)
Kor 523
100.0%

Length

2023-12-10T22:13:16.532741image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T22:13:16.665501image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
kor 523
100.0%

마지막수정일자
Categorical

CONSTANT 

Distinct1
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size4.2 KiB
20210801
523 

Length

Max length8
Median length8
Mean length8
Min length8

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row20210801
2nd row20210801
3rd row20210801
4th row20210801
5th row20210801

Common Values

ValueCountFrequency (%)
20210801 523
100.0%

Length

2023-12-10T22:13:16.810164image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T22:13:16.975024image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
20210801 523
100.0%

저화질이미지URL
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing523
Missing (%)100.0%
Memory size4.7 KiB

태깅내용
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing523
Missing (%)100.0%
Memory size4.7 KiB

대상기준년월
Categorical

CONSTANT 

Distinct1
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size4.2 KiB
202108
523 

Length

Max length6
Median length6
Mean length6
Min length6

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row202108
2nd row202108
3rd row202108
4th row202108
5th row202108

Common Values

ValueCountFrequency (%)
202108 523
100.0%

Length

2023-12-10T22:13:17.140985image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T22:13:17.298075image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
202108 523
100.0%

Interactions

2023-12-10T22:13:11.892045image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-10T22:13:17.391495image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
음식식당사전ID항목소분류명1
음식식당사전ID1.0000.897
항목소분류명10.8971.000
2023-12-10T22:13:17.512889image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
음식식당사전ID항목소분류명1
음식식당사전ID1.0000.748
항목소분류명10.7481.000

Missing values

2023-12-10T22:13:12.343637image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-10T22:13:13.207467image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

기준년월음식식당사전ID항목명항목설명내용항목대분류명항목소분류명1항목소분류명2언어명마지막수정일자저화질이미지URL태깅내용대상기준년월
020210810001양식Food<NA>Cuisine<NA>Kor20210801<NA><NA>202108
120210810002일식Food<NA>Cuisine<NA>Kor20210801<NA><NA>202108
220210810003중식Food<NA>Cuisine<NA>Kor20210801<NA><NA>202108
320210810004한식Food<NA>Cuisine<NA>Kor20210801<NA><NA>202108
420210810005세계음식Food<NA>Cuisine<NA>Kor20210801<NA><NA>202108
520210810006뷔페Food<NA>Cuisine<NA>Kor20210801<NA><NA>202108
620210810007카페Food<NA>Cuisine<NA>Kor20210801<NA><NA>202108
720210810008주점Food<NA>Cuisine<NA>Kor20210801<NA><NA>202108
820210810009고기 요리Food<NA>Subcuisine<NA>Kor20210801<NA><NA>202108
920210810010국수 / 면 요리Food<NA>Subcuisine<NA>Kor20210801<NA><NA>202108
기준년월음식식당사전ID항목명항목설명내용항목대분류명항목소분류명1항목소분류명2언어명마지막수정일자저화질이미지URL태깅내용대상기준년월
51320210810514원테이블Food<NA>Feature<NA>Kor20210801<NA><NA>202108
51420210810515좌식Food<NA>Feature<NA>Kor20210801<NA><NA>202108
51520210810516주차Food<NA>Feature<NA>Kor20210801<NA><NA>202108
51620210810517콜키지Food<NA>Feature<NA>Kor20210801<NA><NA>202108
51720210810518콜키지프리Food<NA>Feature<NA>Kor20210801<NA><NA>202108
51820210810519테라스Food<NA>Feature<NA>Kor20210801<NA><NA>202108
51920210810520테이크아웃Food<NA>Feature<NA>Kor20210801<NA><NA>202108
52020210810521한옥집Food<NA>Feature<NA>Kor20210801<NA><NA>202108
52120210810522현금결제Food<NA>Feature<NA>Kor20210801<NA><NA>202108
52220210810523흡연Food<NA>Feature<NA>Kor20210801<NA><NA>202108