Overview

Dataset statistics

Number of variables10
Number of observations70
Missing cells61
Missing cells (%)8.7%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory5.7 KiB
Average record size in memory82.9 B

Variable types

Text3
Categorical6
Numeric1

Dataset

Description본 데이터는 전통시장 및 상점가 육성을 위한 특별법에 따라 시행되는 실태조사 자료 중 골목형상점가 조사 결과 자료 입니다.
URLhttps://www.data.go.kr/data/15118622/fileData.do

Alerts

기준일자 has constant value ""Constant
개설주기 is highly imbalanced (89.2%)Imbalance
정기 휴일 유무 is highly imbalanced (57.8%)Imbalance
시장전용 고객주차장_보유여부 is highly imbalanced (53.1%)Imbalance
취급상품 has 61 (87.1%) missing valuesMissing
시장명 has unique valuesUnique
주소 has unique valuesUnique
전체 점포 수 has 1 (1.4%) zerosZeros

Reproduction

Analysis started2023-12-12 11:41:22.101192
Analysis finished2023-12-12 11:41:23.503486
Duration1.4 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

시장명
Text

UNIQUE 

Distinct70
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size692.0 B
2023-12-12T20:41:23.832030image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length18
Median length13.5
Mean length9.7
Min length4

Characters and Unicode

Total characters679
Distinct characters138
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique70 ?
Unique (%)100.0%

Sample

1st row의왕예술의거리
2nd row서재소상공인연합골목형상점가
3rd row용봉지구 골목형상점가
4th row전남대후문 골목형상점가
5th row부평테마의거리 상인회
ValueCountFrequency (%)
상점가 13
 
11.6%
골목형 11
 
9.8%
골목형상점가 8
 
7.1%
의왕예술의거리 1
 
0.9%
월드상가 1
 
0.9%
번성상인회 1
 
0.9%
장곡 1
 
0.9%
꿈의숲 1
 
0.9%
가래비중앙로상점가(양주 1
 
0.9%
백운대성시장 1
 
0.9%
Other values (73) 73
65.2%
2023-12-12T20:41:24.530656image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
65
 
9.6%
59
 
8.7%
52
 
7.7%
52
 
7.7%
50
 
7.4%
48
 
7.1%
42
 
6.2%
14
 
2.1%
12
 
1.8%
12
 
1.8%
Other values (128) 273
40.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 633
93.2%
Space Separator 42
 
6.2%
Decimal Number 2
 
0.3%
Open Punctuation 1
 
0.1%
Close Punctuation 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
65
 
10.3%
59
 
9.3%
52
 
8.2%
52
 
8.2%
50
 
7.9%
48
 
7.6%
14
 
2.2%
12
 
1.9%
12
 
1.9%
8
 
1.3%
Other values (123) 261
41.2%
Decimal Number
ValueCountFrequency (%)
4 1
50.0%
5 1
50.0%
Space Separator
ValueCountFrequency (%)
42
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 633
93.2%
Common 46
 
6.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
65
 
10.3%
59
 
9.3%
52
 
8.2%
52
 
8.2%
50
 
7.9%
48
 
7.6%
14
 
2.2%
12
 
1.9%
12
 
1.9%
8
 
1.3%
Other values (123) 261
41.2%
Common
ValueCountFrequency (%)
42
91.3%
( 1
 
2.2%
) 1
 
2.2%
4 1
 
2.2%
5 1
 
2.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 633
93.2%
ASCII 46
 
6.8%

Most frequent character per block

Hangul
ValueCountFrequency (%)
65
 
10.3%
59
 
9.3%
52
 
8.2%
52
 
8.2%
50
 
7.9%
48
 
7.6%
14
 
2.2%
12
 
1.9%
12
 
1.9%
8
 
1.3%
Other values (123) 261
41.2%
ASCII
ValueCountFrequency (%)
42
91.3%
( 1
 
2.2%
) 1
 
2.2%
4 1
 
2.2%
5 1
 
2.2%

개설주기
Categorical

IMBALANCE 

Distinct2
Distinct (%)2.9%
Missing0
Missing (%)0.0%
Memory size692.0 B
상설
69 
정기
 
1

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique1 ?
Unique (%)1.4%

Sample

1st row상설
2nd row상설
3rd row상설
4th row상설
5th row상설

Common Values

ValueCountFrequency (%)
상설 69
98.6%
정기 1
 
1.4%

Length

2023-12-12T20:41:24.768228image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T20:41:24.925334image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
상설 69
98.6%
정기 1
 
1.4%

주소
Text

UNIQUE 

Distinct70
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size692.0 B
2023-12-12T20:41:25.353799image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length31
Median length28
Mean length19.942857
Min length15

Characters and Unicode

Total characters1396
Distinct characters131
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique70 ?
Unique (%)100.0%

Sample

1st row경기도 의왕시 내손동746 일원
2nd row대구광역시 달성군 다사읍 서재리 146-7
3rd row광주광역시 북구 저불로39번길 19
4th row광주광역시 북구 우치로 90
5th row인천광역시 부평구 경원대로 103번길 33 부평동 일원
ValueCountFrequency (%)
서울특별시 17
 
5.4%
인천광역시 15
 
4.8%
서구 14
 
4.5%
경기도 10
 
3.2%
일원 9
 
2.9%
대전광역시 8
 
2.6%
광주광역시 6
 
1.9%
중구 5
 
1.6%
동구 4
 
1.3%
4
 
1.3%
Other values (193) 221
70.6%
2023-12-12T20:41:26.088284image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
244
 
17.5%
66
 
4.7%
65
 
4.7%
58
 
4.2%
1 56
 
4.0%
- 50
 
3.6%
43
 
3.1%
2 40
 
2.9%
3 37
 
2.7%
35
 
2.5%
Other values (121) 702
50.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 793
56.8%
Decimal Number 305
 
21.8%
Space Separator 244
 
17.5%
Dash Punctuation 50
 
3.6%
Math Symbol 4
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
66
 
8.3%
65
 
8.2%
58
 
7.3%
43
 
5.4%
35
 
4.4%
34
 
4.3%
19
 
2.4%
19
 
2.4%
19
 
2.4%
18
 
2.3%
Other values (108) 417
52.6%
Decimal Number
ValueCountFrequency (%)
1 56
18.4%
2 40
13.1%
3 37
12.1%
7 29
9.5%
4 28
9.2%
5 27
8.9%
6 26
8.5%
8 23
7.5%
0 20
 
6.6%
9 19
 
6.2%
Space Separator
ValueCountFrequency (%)
244
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 50
100.0%
Math Symbol
ValueCountFrequency (%)
~ 4
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 793
56.8%
Common 603
43.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
66
 
8.3%
65
 
8.2%
58
 
7.3%
43
 
5.4%
35
 
4.4%
34
 
4.3%
19
 
2.4%
19
 
2.4%
19
 
2.4%
18
 
2.3%
Other values (108) 417
52.6%
Common
ValueCountFrequency (%)
244
40.5%
1 56
 
9.3%
- 50
 
8.3%
2 40
 
6.6%
3 37
 
6.1%
7 29
 
4.8%
4 28
 
4.6%
5 27
 
4.5%
6 26
 
4.3%
8 23
 
3.8%
Other values (3) 43
 
7.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 793
56.8%
ASCII 603
43.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
244
40.5%
1 56
 
9.3%
- 50
 
8.3%
2 40
 
6.6%
3 37
 
6.1%
7 29
 
4.8%
4 28
 
4.6%
5 27
 
4.5%
6 26
 
4.3%
8 23
 
3.8%
Other values (3) 43
 
7.1%
Hangul
ValueCountFrequency (%)
66
 
8.3%
65
 
8.2%
58
 
7.3%
43
 
5.4%
35
 
4.4%
34
 
4.3%
19
 
2.4%
19
 
2.4%
19
 
2.4%
18
 
2.3%
Other values (108) 417
52.6%

취급상품
Text

MISSING 

Distinct8
Distinct (%)88.9%
Missing61
Missing (%)87.1%
Memory size692.0 B
2023-12-12T20:41:26.341652image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length9
Median length3
Mean length3.5555556
Min length2

Characters and Unicode

Total characters32
Distinct characters20
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique7 ?
Unique (%)77.8%

Sample

1st row음식점업
2nd row음식점 노래연습장
3rd row공산품
4th row음식
5th row음식점
ValueCountFrequency (%)
음식 2
20.0%
음식점 2
20.0%
음식점업 1
10.0%
노래연습장 1
10.0%
공산품 1
10.0%
떡볶이 1
10.0%
건어물 1
10.0%
수산물 1
10.0%
2023-12-12T20:41:26.794958image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
5
15.6%
5
15.6%
3
 
9.4%
2
 
6.2%
2
 
6.2%
1
 
3.1%
1
 
3.1%
1
 
3.1%
1
 
3.1%
1
 
3.1%
Other values (10) 10
31.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 31
96.9%
Space Separator 1
 
3.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
5
16.1%
5
16.1%
3
 
9.7%
2
 
6.5%
2
 
6.5%
1
 
3.2%
1
 
3.2%
1
 
3.2%
1
 
3.2%
1
 
3.2%
Other values (9) 9
29.0%
Space Separator
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 31
96.9%
Common 1
 
3.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
5
16.1%
5
16.1%
3
 
9.7%
2
 
6.5%
2
 
6.5%
1
 
3.2%
1
 
3.2%
1
 
3.2%
1
 
3.2%
1
 
3.2%
Other values (9) 9
29.0%
Common
ValueCountFrequency (%)
1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 31
96.9%
ASCII 1
 
3.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
5
16.1%
5
16.1%
3
 
9.7%
2
 
6.5%
2
 
6.5%
1
 
3.2%
1
 
3.2%
1
 
3.2%
1
 
3.2%
1
 
3.2%
Other values (9) 9
29.0%
ASCII
ValueCountFrequency (%)
1
100.0%

전체 점포 수
Real number (ℝ)

ZEROS 

Distinct57
Distinct (%)81.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean91.171429
Minimum0
Maximum532
Zeros1
Zeros (%)1.4%
Negative0
Negative (%)0.0%
Memory size762.0 B
2023-12-12T20:41:27.027227image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile33.45
Q152.25
median68
Q3102
95-th percentile243.45
Maximum532
Range532
Interquartile range (IQR)49.75

Descriptive statistics

Standard deviation78.540486
Coefficient of variation (CV)0.86145942
Kurtosis15.004238
Mean91.171429
Median Absolute Deviation (MAD)24.5
Skewness3.4063333
Sum6382
Variance6168.6079
MonotonicityDecreasing
2023-12-12T20:41:27.776203image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
65 3
 
4.3%
62 3
 
4.3%
40 3
 
4.3%
53 3
 
4.3%
70 2
 
2.9%
102 2
 
2.9%
56 2
 
2.9%
112 2
 
2.9%
101 2
 
2.9%
64 1
 
1.4%
Other values (47) 47
67.1%
ValueCountFrequency (%)
0 1
 
1.4%
24 1
 
1.4%
30 1
 
1.4%
33 1
 
1.4%
34 1
 
1.4%
36 1
 
1.4%
38 1
 
1.4%
40 3
4.3%
42 1
 
1.4%
43 1
 
1.4%
ValueCountFrequency (%)
532 1
1.4%
320 1
1.4%
287 1
1.4%
279 1
1.4%
200 1
1.4%
170 1
1.4%
163 1
1.4%
147 1
1.4%
136 1
1.4%
127 1
1.4%

정기 휴일 유무
Categorical

IMBALANCE 

Distinct2
Distinct (%)2.9%
Missing0
Missing (%)0.0%
Memory size692.0 B
없음
64 
있음
 
6

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row없음
2nd row있음
3rd row없음
4th row없음
5th row없음

Common Values

ValueCountFrequency (%)
없음 64
91.4%
있음 6
 
8.6%

Length

2023-12-12T20:41:28.017509image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T20:41:28.200909image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
없음 64
91.4%
있음 6
 
8.6%
Distinct2
Distinct (%)2.9%
Missing0
Missing (%)0.0%
Memory size692.0 B
미보유
49 
보유
21 

Length

Max length3
Median length3
Mean length2.7
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row미보유
2nd row미보유
3rd row보유
4th row보유
5th row미보유

Common Values

ValueCountFrequency (%)
미보유 49
70.0%
보유 21
30.0%

Length

2023-12-12T20:41:28.366317image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T20:41:28.552958image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
미보유 49
70.0%
보유 21
30.0%
Distinct2
Distinct (%)2.9%
Missing0
Missing (%)0.0%
Memory size692.0 B
미보유
60 
보유
10 

Length

Max length3
Median length3
Mean length2.8571429
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row미보유
2nd row미보유
3rd row미보유
4th row미보유
5th row미보유

Common Values

ValueCountFrequency (%)
미보유 60
85.7%
보유 10
 
14.3%

Length

2023-12-12T20:41:28.816296image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T20:41:29.030858image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
미보유 60
85.7%
보유 10
 
14.3%
Distinct2
Distinct (%)2.9%
Missing0
Missing (%)0.0%
Memory size692.0 B
미보유
63 
보유

Length

Max length3
Median length3
Mean length2.9
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row미보유
2nd row미보유
3rd row미보유
4th row보유
5th row미보유

Common Values

ValueCountFrequency (%)
미보유 63
90.0%
보유 7
 
10.0%

Length

2023-12-12T20:41:29.191751image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T20:41:29.395520image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
미보유 63
90.0%
보유 7
 
10.0%

기준일자
Categorical

CONSTANT 

Distinct1
Distinct (%)1.4%
Missing0
Missing (%)0.0%
Memory size692.0 B
2023-08-14
70 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2023-08-14
2nd row2023-08-14
3rd row2023-08-14
4th row2023-08-14
5th row2023-08-14

Common Values

ValueCountFrequency (%)
2023-08-14 70
100.0%

Length

2023-12-12T20:41:29.623474image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T20:41:29.800133image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2023-08-14 70
100.0%

Interactions

2023-12-12T20:41:22.890558image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T20:41:29.918860image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
시장명개설주기주소취급상품전체 점포 수정기 휴일 유무CCTV_보유여부공동화장실_보유여부시장전용 고객주차장_보유여부
시장명1.0001.0001.0001.0001.0001.0001.0001.0001.000
개설주기1.0001.0001.000NaN0.0000.0000.0000.0450.000
주소1.0001.0001.0001.0001.0001.0001.0001.0001.000
취급상품1.000NaN1.0001.0001.0001.0001.0000.0000.000
전체 점포 수1.0000.0001.0001.0001.0000.3390.1270.0000.099
정기 휴일 유무1.0000.0001.0001.0000.3391.0000.0000.0000.151
CCTV_보유여부1.0000.0001.0001.0000.1270.0001.0000.0940.000
공동화장실_보유여부1.0000.0451.0000.0000.0000.0000.0941.0000.483
시장전용 고객주차장_보유여부1.0000.0001.0000.0000.0990.1510.0000.4831.000
2023-12-12T20:41:30.147000image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
정기 휴일 유무CCTV_보유여부시장전용 고객주차장_보유여부공동화장실_보유여부개설주기
정기 휴일 유무1.0000.0000.0950.0000.000
CCTV_보유여부0.0001.0000.0000.0580.000
시장전용 고객주차장_보유여부0.0950.0001.0000.3210.000
공동화장실_보유여부0.0000.0580.3211.0000.025
개설주기0.0000.0000.0000.0251.000
2023-12-12T20:41:30.329392image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
전체 점포 수개설주기정기 휴일 유무CCTV_보유여부공동화장실_보유여부시장전용 고객주차장_보유여부
전체 점포 수1.0000.0000.3480.1260.0000.096
개설주기0.0001.0000.0000.0000.0250.000
정기 휴일 유무0.3480.0001.0000.0000.0000.095
CCTV_보유여부0.1260.0000.0001.0000.0580.000
공동화장실_보유여부0.0000.0250.0000.0581.0000.321
시장전용 고객주차장_보유여부0.0960.0000.0950.0000.3211.000

Missing values

2023-12-12T20:41:23.093729image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T20:41:23.388811image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

시장명개설주기주소취급상품전체 점포 수정기 휴일 유무CCTV_보유여부공동화장실_보유여부시장전용 고객주차장_보유여부기준일자
0의왕예술의거리상설경기도 의왕시 내손동746 일원<NA>532없음미보유미보유미보유2023-08-14
1서재소상공인연합골목형상점가상설대구광역시 달성군 다사읍 서재리 146-7<NA>320있음미보유미보유미보유2023-08-14
2용봉지구 골목형상점가상설광주광역시 북구 저불로39번길 19<NA>287없음보유미보유미보유2023-08-14
3전남대후문 골목형상점가상설광주광역시 북구 우치로 90<NA>279없음보유미보유보유2023-08-14
4부평테마의거리 상인회상설인천광역시 부평구 경원대로 103번길 33 부평동 일원음식점업200없음미보유미보유미보유2023-08-14
5비래동 골목형 상점가상설대전광역시 대덕구 비래동 125-6<NA>170없음미보유미보유미보유2023-08-14
6성수역골목형상점가상설서울특별시 성동구 성수동2가<NA>163있음미보유미보유미보유2023-08-14
7산정상인회상설광주광역시 광산구 산정동 965-7음식점 노래연습장147없음보유미보유미보유2023-08-14
8구월문화로상점가상설인천광역시 남동구 구월동 1368<NA>136없음미보유미보유미보유2023-08-14
9당동로시장상인회상설경기도 군포시 당동 785-40<NA>127없음미보유미보유미보유2023-08-14
시장명개설주기주소취급상품전체 점포 수정기 휴일 유무CCTV_보유여부공동화장실_보유여부시장전용 고객주차장_보유여부기준일자
60연희로 골목형 상점가상설인천광역시 서구 연희동 686-12<NA>40없음미보유미보유미보유2023-08-14
61중부건어물골목형상점가상설대전광역시 동구 중동 30-3건어물40있음미보유미보유미보유2023-08-14
62무거현대시장상설울산광역시 남구 무거동 1546-5<NA>40없음미보유보유미보유2023-08-14
63수암회수산시장상설울산광역시 남구 야음동 700-7수산물38있음보유보유보유2023-08-14
64꿈꾸는 건지골 골목형 상점가상설인천광역시 서구 가좌동 209-15<NA>36없음미보유미보유미보유2023-08-14
65원적로 골목형상점가상설인천광역시 서구 가좌동 78-12 ~ 81-52<NA>34없음미보유미보유미보유2023-08-14
66대림중앙골목형상점가상설영등포구 디지털로 37길 일대<NA>33없음미보유미보유미보유2023-08-14
67탁옥로 골목형 상점가상설인천광역시 서구 심곡동 335-5<NA>30없음미보유미보유미보유2023-08-14
68필동골목형상점가상설서울특별시 중구 필동3가 30-2<NA>24없음보유보유미보유2023-08-14
69모란민속5일장정기경기도 성남시 중원구 성남동 4214<NA>0없음보유보유미보유2023-08-14