Overview

Dataset statistics

Number of variables7
Number of observations418
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory23.4 KiB
Average record size in memory57.3 B

Variable types

Numeric1
Categorical4
Text2

Dataset

Description소상공인진흥공단에서 제공하는 2020년 8월 기준 영업중인 청년몰 점포리스트(시장명, 상호명, 소재지 등)데이터 입니다.
Author소상공인시장진흥공단
URLhttps://www.data.go.kr/data/15071666/fileData.do

Alerts

시장명(청년몰) is highly overall correlated with 구분 and 2 other fieldsHigh correlation
시도 is highly overall correlated with 구분 and 2 other fieldsHigh correlation
시군구 is highly overall correlated with 구분 and 2 other fieldsHigh correlation
구분 is highly overall correlated with 시도 and 2 other fieldsHigh correlation
구분 has unique valuesUnique

Reproduction

Analysis started2023-12-12 04:52:41.794788
Analysis finished2023-12-12 04:52:43.039967
Duration1.25 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

구분
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct418
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean209.5
Minimum1
Maximum418
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size3.8 KiB
2023-12-12T13:52:43.147238image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile21.85
Q1105.25
median209.5
Q3313.75
95-th percentile397.15
Maximum418
Range417
Interquartile range (IQR)208.5

Descriptive statistics

Standard deviation120.81046
Coefficient of variation (CV)0.5766609
Kurtosis-1.2
Mean209.5
Median Absolute Deviation (MAD)104.5
Skewness0
Sum87571
Variance14595.167
MonotonicityStrictly increasing
2023-12-12T13:52:43.349438image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.2%
315 1
 
0.2%
287 1
 
0.2%
286 1
 
0.2%
285 1
 
0.2%
284 1
 
0.2%
283 1
 
0.2%
282 1
 
0.2%
281 1
 
0.2%
280 1
 
0.2%
Other values (408) 408
97.6%
ValueCountFrequency (%)
1 1
0.2%
2 1
0.2%
3 1
0.2%
4 1
0.2%
5 1
0.2%
6 1
0.2%
7 1
0.2%
8 1
0.2%
9 1
0.2%
10 1
0.2%
ValueCountFrequency (%)
418 1
0.2%
417 1
0.2%
416 1
0.2%
415 1
0.2%
414 1
0.2%
413 1
0.2%
412 1
0.2%
411 1
0.2%
410 1
0.2%
409 1
0.2%

시도
Categorical

HIGH CORRELATION 

Distinct15
Distinct (%)3.6%
Missing0
Missing (%)0.0%
Memory size3.4 KiB
강원
83 
경북
50 
경남
43 
경기
35 
전북
35 
Other values (10)
172 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row경기
2nd row경기
3rd row경기
4th row경기
5th row경기

Common Values

ValueCountFrequency (%)
강원 83
19.9%
경북 50
12.0%
경남 43
10.3%
경기 35
8.4%
전북 35
8.4%
서울 32
 
7.7%
대구 29
 
6.9%
충남 23
 
5.5%
전남 17
 
4.1%
충북 15
 
3.6%
Other values (5) 56
13.4%

Length

2023-12-12T13:52:43.528221image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
강원 83
19.9%
경북 50
12.0%
경남 43
10.3%
경기 35
8.4%
전북 35
8.4%
서울 32
 
7.7%
대구 29
 
6.9%
충남 23
 
5.5%
전남 17
 
4.1%
충북 15
 
3.6%
Other values (5) 56
13.4%

시군구
Categorical

HIGH CORRELATION 

Distinct31
Distinct (%)7.4%
Missing0
Missing (%)0.0%
Memory size3.4 KiB
진주시
 
29
수원시
 
23
동대문구
 
20
속초시
 
19
삼척시
 
19
Other values (26)
308 

Length

Max length4
Median length3
Mean length3.0263158
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row평택시
2nd row평택시
3rd row평택시
4th row평택시
5th row평택시

Common Values

ValueCountFrequency (%)
진주시 29
 
6.9%
수원시 23
 
5.5%
동대문구 20
 
4.8%
속초시 19
 
4.5%
삼척시 19
 
4.5%
달성군 19
 
4.5%
춘천시 18
 
4.3%
원주시 17
 
4.1%
여수시 17
 
4.1%
구미시 16
 
3.8%
Other values (21) 221
52.9%

Length

2023-12-12T13:52:43.734673image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
진주시 29
 
6.9%
수원시 23
 
5.5%
동대문구 20
 
4.8%
속초시 19
 
4.5%
삼척시 19
 
4.5%
달성군 19
 
4.5%
춘천시 18
 
4.3%
원주시 17
 
4.1%
여수시 17
 
4.1%
구미시 16
 
3.8%
Other values (21) 221
52.9%

시장명(청년몰)
Categorical

HIGH CORRELATION 

Distinct33
Distinct (%)7.9%
Missing0
Missing (%)0.0%
Memory size3.4 KiB
영동시장
 
23
경동시장
 
20
진주중앙지하도상가
 
20
설악로데오상점가
 
19
삼척중앙시장
 
19
Other values (28)
317 

Length

Max length10
Median length9
Mean length6.3421053
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row통복시장
2nd row통복시장
3rd row통복시장
4th row통복시장
5th row통복시장

Common Values

ValueCountFrequency (%)
영동시장 23
 
5.5%
경동시장 20
 
4.8%
진주중앙지하도상가 20
 
4.8%
설악로데오상점가 19
 
4.5%
삼척중앙시장 19
 
4.5%
현풍백년도깨비시장 19
 
4.5%
육림고개상점가 18
 
4.3%
원주중앙시장 17
 
4.1%
여수중앙시장 17
 
4.1%
선산봉황시장 16
 
3.8%
Other values (23) 230
55.0%

Length

2023-12-12T13:52:43.976252image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
영동시장 23
 
5.5%
진주중앙지하도상가 20
 
4.8%
경동시장 20
 
4.8%
설악로데오상점가 19
 
4.5%
삼척중앙시장 19
 
4.5%
현풍백년도깨비시장 19
 
4.5%
육림고개상점가 18
 
4.3%
원주중앙시장 17
 
4.1%
여수중앙시장 17
 
4.1%
선산봉황시장 16
 
3.8%
Other values (23) 230
55.0%
Distinct414
Distinct (%)99.0%
Missing0
Missing (%)0.0%
Memory size3.4 KiB
2023-12-12T13:52:44.443728image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length19
Median length16
Mean length5.6028708
Min length1

Characters and Unicode

Total characters2342
Distinct characters520
Distinct categories11 ?
Distinct scripts4 ?
Distinct blocks5 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique412 ?
Unique (%)98.6%

Sample

1st row막치삼
2nd row프라온
3rd row지인공간
4th row마크라메스튜디오
5th row떡퍼맨&걸
ValueCountFrequency (%)
아리랑브루어리 4
 
0.8%
2
 
0.4%
better 2
 
0.4%
홍차 2
 
0.4%
어스푼 2
 
0.4%
라피네 2
 
0.4%
아꼬앙 1
 
0.2%
옥야180 1
 
0.2%
정스샌드위치 1
 
0.2%
비슬 1
 
0.2%
Other values (472) 472
96.3%
2023-12-12T13:52:45.140139image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
75
 
3.2%
60
 
2.6%
49
 
2.1%
41
 
1.8%
( 31
 
1.3%
) 31
 
1.3%
26
 
1.1%
e 23
 
1.0%
23
 
1.0%
22
 
0.9%
Other values (510) 1961
83.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1872
79.9%
Lowercase Letter 162
 
6.9%
Uppercase Letter 117
 
5.0%
Space Separator 75
 
3.2%
Open Punctuation 35
 
1.5%
Close Punctuation 35
 
1.5%
Decimal Number 27
 
1.2%
Other Punctuation 13
 
0.6%
Modifier Symbol 3
 
0.1%
Dash Punctuation 2
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
60
 
3.2%
49
 
2.6%
41
 
2.2%
26
 
1.4%
23
 
1.2%
22
 
1.2%
21
 
1.1%
21
 
1.1%
20
 
1.1%
20
 
1.1%
Other values (441) 1569
83.8%
Uppercase Letter
ValueCountFrequency (%)
A 15
 
12.8%
E 10
 
8.5%
S 10
 
8.5%
B 8
 
6.8%
M 8
 
6.8%
T 7
 
6.0%
I 6
 
5.1%
O 5
 
4.3%
U 5
 
4.3%
P 5
 
4.3%
Other values (14) 38
32.5%
Lowercase Letter
ValueCountFrequency (%)
e 23
14.2%
a 18
11.1%
t 17
10.5%
i 13
8.0%
o 13
8.0%
n 12
 
7.4%
u 11
 
6.8%
r 10
 
6.2%
s 8
 
4.9%
h 6
 
3.7%
Other values (12) 31
19.1%
Decimal Number
ValueCountFrequency (%)
1 7
25.9%
9 5
18.5%
3 4
14.8%
0 3
11.1%
8 2
 
7.4%
2 2
 
7.4%
4 2
 
7.4%
7 1
 
3.7%
5 1
 
3.7%
Other Punctuation
ValueCountFrequency (%)
& 4
30.8%
. 3
23.1%
, 3
23.1%
: 1
 
7.7%
# 1
 
7.7%
' 1
 
7.7%
Open Punctuation
ValueCountFrequency (%)
( 31
88.6%
4
 
11.4%
Close Punctuation
ValueCountFrequency (%)
) 31
88.6%
4
 
11.4%
Space Separator
ValueCountFrequency (%)
75
100.0%
Modifier Symbol
ValueCountFrequency (%)
´ 3
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 2
100.0%
Final Punctuation
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1858
79.3%
Latin 279
 
11.9%
Common 191
 
8.2%
Han 14
 
0.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
60
 
3.2%
49
 
2.6%
41
 
2.2%
26
 
1.4%
23
 
1.2%
22
 
1.2%
21
 
1.1%
21
 
1.1%
20
 
1.1%
20
 
1.1%
Other values (436) 1555
83.7%
Latin
ValueCountFrequency (%)
e 23
 
8.2%
a 18
 
6.5%
t 17
 
6.1%
A 15
 
5.4%
i 13
 
4.7%
o 13
 
4.7%
n 12
 
4.3%
u 11
 
3.9%
r 10
 
3.6%
E 10
 
3.6%
Other values (36) 137
49.1%
Common
ValueCountFrequency (%)
75
39.3%
( 31
16.2%
) 31
16.2%
1 7
 
3.7%
9 5
 
2.6%
3 4
 
2.1%
4
 
2.1%
& 4
 
2.1%
4
 
2.1%
. 3
 
1.6%
Other values (13) 23
 
12.0%
Han
ValueCountFrequency (%)
10
71.4%
1
 
7.1%
1
 
7.1%
1
 
7.1%
1
 
7.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1858
79.3%
ASCII 458
 
19.6%
CJK 14
 
0.6%
None 11
 
0.5%
Punctuation 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
75
 
16.4%
( 31
 
6.8%
) 31
 
6.8%
e 23
 
5.0%
a 18
 
3.9%
t 17
 
3.7%
A 15
 
3.3%
i 13
 
2.8%
o 13
 
2.8%
n 12
 
2.6%
Other values (55) 210
45.9%
Hangul
ValueCountFrequency (%)
60
 
3.2%
49
 
2.6%
41
 
2.2%
26
 
1.4%
23
 
1.2%
22
 
1.2%
21
 
1.1%
21
 
1.1%
20
 
1.1%
20
 
1.1%
Other values (436) 1555
83.7%
CJK
ValueCountFrequency (%)
10
71.4%
1
 
7.1%
1
 
7.1%
1
 
7.1%
1
 
7.1%
None
ValueCountFrequency (%)
4
36.4%
4
36.4%
´ 3
27.3%
Punctuation
ValueCountFrequency (%)
1
100.0%

업종
Categorical

Distinct5
Distinct (%)1.2%
Missing0
Missing (%)0.0%
Memory size3.4 KiB
음식업
235 
공방
73 
도소매업
57 
서비스업
52 
-
 
1

Length

Max length4
Median length3
Mean length3.0813397
Min length1

Unique

Unique1 ?
Unique (%)0.2%

Sample

1st row음식업
2nd row음식업
3rd row공방
4th row공방
5th row음식업

Common Values

ValueCountFrequency (%)
음식업 235
56.2%
공방 73
 
17.5%
도소매업 57
 
13.6%
서비스업 52
 
12.4%
- 1
 
0.2%

Length

2023-12-12T13:52:45.366789image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T13:52:45.551215image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
음식업 235
56.2%
공방 73
 
17.5%
도소매업 57
 
13.6%
서비스업 52
 
12.4%
1
 
0.2%
Distinct401
Distinct (%)95.9%
Missing0
Missing (%)0.0%
Memory size3.4 KiB
2023-12-12T13:52:45.939332image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length84
Median length45
Mean length28.272727
Min length15

Characters and Unicode

Total characters11818
Distinct characters150
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique392 ?
Unique (%)93.8%

Sample

1st row경기도 평택시 통복동 95-30
2nd row경기도 평택시 통복동 95-30
3rd row경기도 평택시 통복동 95-85
4th row경기도 평택시 통복동 95-35
5th row경기도 평택시 통복동 95-49
ValueCountFrequency (%)
2층 108
 
4.0%
강원도 65
 
2.4%
청년몰 51
 
1.9%
중앙로 50
 
1.9%
6 49
 
1.8%
진주시 36
 
1.3%
경상남도 36
 
1.3%
경기도 35
 
1.3%
경북 31
 
1.2%
3층 29
 
1.1%
Other values (514) 2203
81.8%
2023-12-12T13:52:46.589059image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2295
 
19.4%
1 594
 
5.0%
2 545
 
4.6%
441
 
3.7%
353
 
3.0%
3 328
 
2.8%
308
 
2.6%
, 297
 
2.5%
291
 
2.5%
0 273
 
2.3%
Other values (140) 6093
51.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 6083
51.5%
Decimal Number 2757
23.3%
Space Separator 2295
 
19.4%
Other Punctuation 314
 
2.7%
Dash Punctuation 158
 
1.3%
Open Punctuation 73
 
0.6%
Close Punctuation 72
 
0.6%
Uppercase Letter 61
 
0.5%
Control 5
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
441
 
7.2%
353
 
5.8%
308
 
5.1%
291
 
4.8%
247
 
4.1%
226
 
3.7%
196
 
3.2%
191
 
3.1%
153
 
2.5%
142
 
2.3%
Other values (118) 3535
58.1%
Decimal Number
ValueCountFrequency (%)
1 594
21.5%
2 545
19.8%
3 328
11.9%
0 273
9.9%
5 253
9.2%
6 223
 
8.1%
4 192
 
7.0%
7 136
 
4.9%
8 120
 
4.4%
9 93
 
3.4%
Uppercase Letter
ValueCountFrequency (%)
A 39
63.9%
D 20
32.8%
B 2
 
3.3%
Other Punctuation
ValueCountFrequency (%)
, 297
94.6%
/ 17
 
5.4%
Open Punctuation
ValueCountFrequency (%)
( 51
69.9%
22
30.1%
Close Punctuation
ValueCountFrequency (%)
) 51
70.8%
21
29.2%
Space Separator
ValueCountFrequency (%)
2295
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 158
100.0%
Control
ValueCountFrequency (%)
5
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 6083
51.5%
Common 5674
48.0%
Latin 61
 
0.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
441
 
7.2%
353
 
5.8%
308
 
5.1%
291
 
4.8%
247
 
4.1%
226
 
3.7%
196
 
3.2%
191
 
3.1%
153
 
2.5%
142
 
2.3%
Other values (118) 3535
58.1%
Common
ValueCountFrequency (%)
2295
40.4%
1 594
 
10.5%
2 545
 
9.6%
3 328
 
5.8%
, 297
 
5.2%
0 273
 
4.8%
5 253
 
4.5%
6 223
 
3.9%
4 192
 
3.4%
- 158
 
2.8%
Other values (9) 516
 
9.1%
Latin
ValueCountFrequency (%)
A 39
63.9%
D 20
32.8%
B 2
 
3.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 6083
51.5%
ASCII 5692
48.2%
None 43
 
0.4%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2295
40.3%
1 594
 
10.4%
2 545
 
9.6%
3 328
 
5.8%
, 297
 
5.2%
0 273
 
4.8%
5 253
 
4.4%
6 223
 
3.9%
4 192
 
3.4%
- 158
 
2.8%
Other values (10) 534
 
9.4%
Hangul
ValueCountFrequency (%)
441
 
7.2%
353
 
5.8%
308
 
5.1%
291
 
4.8%
247
 
4.1%
226
 
3.7%
196
 
3.2%
191
 
3.1%
153
 
2.5%
142
 
2.3%
Other values (118) 3535
58.1%
None
ValueCountFrequency (%)
22
51.2%
21
48.8%

Interactions

2023-12-12T13:52:42.568744image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T13:52:46.720152image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
구분시도시군구시장명(청년몰)업종
구분1.0000.9080.9900.9930.394
시도0.9081.0000.9981.0000.343
시군구0.9900.9981.0001.0000.421
시장명(청년몰)0.9931.0001.0001.0000.462
업종0.3940.3430.4210.4621.000
2023-12-12T13:52:46.859479image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
업종시장명(청년몰)시도시군구
업종1.0000.2300.1510.206
시장명(청년몰)0.2301.0000.9770.997
시도0.1510.9771.0000.951
시군구0.2060.9970.9511.000
2023-12-12T13:52:46.994554image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
구분시도시군구시장명(청년몰)업종
구분1.0000.6260.8960.9190.173
시도0.6261.0000.9510.9770.151
시군구0.8960.9511.0000.9970.206
시장명(청년몰)0.9190.9770.9971.0000.230
업종0.1730.1510.2060.2301.000

Missing values

2023-12-12T13:52:42.717063image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T13:52:42.965706image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

구분시도시군구시장명(청년몰)상호명업종소재지
01경기평택시통복시장막치삼음식업경기도 평택시 통복동 95-30
12경기평택시통복시장프라온음식업경기도 평택시 통복동 95-30
23경기평택시통복시장지인공간공방경기도 평택시 통복동 95-85
34경기평택시통복시장마크라메스튜디오공방경기도 평택시 통복동 95-35
45경기평택시통복시장떡퍼맨&걸음식업경기도 평택시 통복동 95-49
56경기평택시통복시장불독스테이크음식업경기도 평택시 통복동 95-31
67경기평택시통복시장예원닭강정음식업경기도 평택시 통복동 95-35
78경기평택시통복시장통복이네 떡갈비음식업경기도 평택시 통복동 95-33
89경기평택시통복시장나우텔레콤도소매업경기도 평택시 통복동 95-87
910경기평택시통복시장오츠카레음식업경기도 평택시 통복동 95-104
구분시도시군구시장명(청년몰)상호명업종소재지
408409제주제주시제주중앙로상점가한땀작업실공방제주시 중앙로 11길 1, 1층 107호(이도일동 1362-1)
409410제주제주시제주중앙로상점가온정떡방음식업제주시 중앙로 11길 1, 2층 201호(이도일동 1362-1)
410411제주제주시제주중앙로상점가치즈식당음식업제주시 중앙로 11길 1, 2층 202호(이도일동 1362-1)
411412제주제주시제주중앙로상점가비스듬히음식업제주시 중앙로 11길 1, 2층 203호(이도일동 1362-1)
412413제주제주시제주중앙로상점가지조스바오음식업제주시 중앙로 11길 1, 2층 204호(이도일동 1362-1)
413414제주제주시제주중앙로상점가착한혼밥음식업제주시 중앙로 11길 1, 2층 205호(이도일동 1362-1)
414415제주제주시제주중앙로상점가아꼬앙음식업제주시 중앙로 11길 1, 2층 206호(이도일동 1362-1)
415416제주제주시제주중앙로상점가떡갈비먹잰음식업제주시 중앙로 11길 1, 2층 207호(이도일동 1362-1)
416417제주제주시제주중앙로상점가제주전유화(花)음식업제주시 중앙로 11길 1, 2층 210호(이도일동 1362-1)
417418제주제주시제주중앙로상점가고르고 고른 제주도소매업제주시 중앙로 11길 1, 3층 302호(이도일동 1362-1)