Overview

Dataset statistics

Number of variables3
Number of observations10000
Missing cells0
Missing cells (%)0.0%
Duplicate rows17
Duplicate rows (%)0.2%
Total size in memory312.5 KiB
Average record size in memory32.0 B

Variable types

Text2
Categorical1

Dataset

Description경기도 광주시 관내 업소정보(음식점, 병원, 약국, 공공시설 등) 현황에 대한 데이터로 업소명, 읍면동명, 도로명주소 등을 제공합니다.
Author경기도 광주시
URLhttps://www.data.go.kr/data/15042402/fileData.do

Alerts

Dataset has 17 (0.2%) duplicate rowsDuplicates

Reproduction

Analysis started2023-12-16 15:16:19.585147
Analysis finished2023-12-16 15:16:24.130392
Duration4.55 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

업소
Text

Distinct9807
Distinct (%)98.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-16T15:16:24.802329image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length34
Median length27
Mean length6.7475
Min length1

Characters and Unicode

Total characters67475
Distinct characters1023
Distinct categories12 ?
Distinct scripts4 ?
Distinct blocks6 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique9670 ?
Unique (%)96.7%

Sample

1st row풍선홀릭
2nd row퍼시픽SA
3rd row주식회사 에이치제이건설
4th row주식회사 승진도시가스
5th row참숯구이 아리아리
ValueCountFrequency (%)
주식회사 440
 
3.7%
111
 
0.9%
어린이집 32
 
0.3%
노래연습장 19
 
0.2%
사무소 18
 
0.1%
광주 17
 
0.1%
농업회사법인 17
 
0.1%
gs25 15
 
0.1%
14
 
0.1%
경기광주점 14
 
0.1%
Other values (10552) 11325
94.2%
2023-12-16T15:16:28.254184image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2148
 
3.2%
2030
 
3.0%
1738
 
2.6%
1516
 
2.2%
1480
 
2.2%
( 1287
 
1.9%
) 1286
 
1.9%
950
 
1.4%
794
 
1.2%
765
 
1.1%
Other values (1013) 53481
79.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 57346
85.0%
Uppercase Letter 2609
 
3.9%
Space Separator 2032
 
3.0%
Lowercase Letter 1759
 
2.6%
Close Punctuation 1322
 
2.0%
Open Punctuation 1320
 
2.0%
Decimal Number 573
 
0.8%
Other Punctuation 490
 
0.7%
Dash Punctuation 18
 
< 0.1%
Other Symbol 3
 
< 0.1%
Other values (2) 3
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2148
 
3.7%
1738
 
3.0%
1516
 
2.6%
1480
 
2.6%
950
 
1.7%
794
 
1.4%
765
 
1.3%
709
 
1.2%
688
 
1.2%
670
 
1.2%
Other values (929) 45888
80.0%
Uppercase Letter
ValueCountFrequency (%)
S 231
 
8.9%
C 206
 
7.9%
E 179
 
6.9%
A 160
 
6.1%
O 153
 
5.9%
N 151
 
5.8%
G 135
 
5.2%
T 134
 
5.1%
L 121
 
4.6%
I 116
 
4.4%
Other values (16) 1023
39.2%
Lowercase Letter
ValueCountFrequency (%)
e 221
12.6%
o 176
 
10.0%
a 148
 
8.4%
n 138
 
7.8%
i 137
 
7.8%
r 101
 
5.7%
s 100
 
5.7%
t 92
 
5.2%
l 82
 
4.7%
c 78
 
4.4%
Other values (16) 486
27.6%
Other Punctuation
ValueCountFrequency (%)
* 314
64.1%
. 89
 
18.2%
& 42
 
8.6%
, 24
 
4.9%
' 7
 
1.4%
# 6
 
1.2%
3
 
0.6%
/ 2
 
0.4%
? 2
 
0.4%
· 1
 
0.2%
Decimal Number
ValueCountFrequency (%)
2 133
23.2%
1 107
18.7%
5 78
13.6%
0 54
9.4%
3 50
 
8.7%
4 46
 
8.0%
9 34
 
5.9%
6 31
 
5.4%
7 24
 
4.2%
8 16
 
2.8%
Open Punctuation
ValueCountFrequency (%)
( 1287
97.5%
32
 
2.4%
[ 1
 
0.1%
Close Punctuation
ValueCountFrequency (%)
) 1286
97.3%
35
 
2.6%
] 1
 
0.1%
Space Separator
ValueCountFrequency (%)
2030
99.9%
  2
 
0.1%
Dash Punctuation
ValueCountFrequency (%)
- 18
100.0%
Other Symbol
ValueCountFrequency (%)
3
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 2
100.0%
Math Symbol
ValueCountFrequency (%)
~ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 57332
85.0%
Common 5758
 
8.5%
Latin 4368
 
6.5%
Han 17
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2148
 
3.7%
1738
 
3.0%
1516
 
2.6%
1480
 
2.6%
950
 
1.7%
794
 
1.4%
765
 
1.3%
709
 
1.2%
688
 
1.2%
670
 
1.2%
Other values (914) 45874
80.0%
Latin
ValueCountFrequency (%)
S 231
 
5.3%
e 221
 
5.1%
C 206
 
4.7%
E 179
 
4.1%
o 176
 
4.0%
A 160
 
3.7%
O 153
 
3.5%
N 151
 
3.5%
a 148
 
3.4%
n 138
 
3.2%
Other values (42) 2605
59.6%
Common
ValueCountFrequency (%)
2030
35.3%
( 1287
22.4%
) 1286
22.3%
* 314
 
5.5%
2 133
 
2.3%
1 107
 
1.9%
. 89
 
1.5%
5 78
 
1.4%
0 54
 
0.9%
3 50
 
0.9%
Other values (21) 330
 
5.7%
Han
ValueCountFrequency (%)
2
 
11.8%
1
 
5.9%
1
 
5.9%
1
 
5.9%
1
 
5.9%
1
 
5.9%
1
 
5.9%
1
 
5.9%
1
 
5.9%
1
 
5.9%
Other values (6) 6
35.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 57327
85.0%
ASCII 10053
 
14.9%
None 76
 
0.1%
CJK 16
 
< 0.1%
Compat Jamo 2
 
< 0.1%
CJK Compat Ideographs 1
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
2148
 
3.7%
1738
 
3.0%
1516
 
2.6%
1480
 
2.6%
950
 
1.7%
794
 
1.4%
765
 
1.3%
709
 
1.2%
688
 
1.2%
670
 
1.2%
Other values (911) 45869
80.0%
ASCII
ValueCountFrequency (%)
2030
20.2%
( 1287
 
12.8%
) 1286
 
12.8%
* 314
 
3.1%
S 231
 
2.3%
e 221
 
2.2%
C 206
 
2.0%
E 179
 
1.8%
o 176
 
1.8%
A 160
 
1.6%
Other values (68) 3963
39.4%
None
ValueCountFrequency (%)
35
46.1%
32
42.1%
3
 
3.9%
3
 
3.9%
  2
 
2.6%
· 1
 
1.3%
CJK
ValueCountFrequency (%)
2
 
12.5%
1
 
6.2%
1
 
6.2%
1
 
6.2%
1
 
6.2%
1
 
6.2%
1
 
6.2%
1
 
6.2%
1
 
6.2%
1
 
6.2%
Other values (5) 5
31.2%
CJK Compat Ideographs
ValueCountFrequency (%)
1
100.0%
Compat Jamo
ValueCountFrequency (%)
1
50.0%
1
50.0%

읍면동
Categorical

Distinct23
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
초월읍
1632 
곤지암읍
1111 
오포1동
770 
오포2동
758 
도척면
636 
Other values (18)
5093 

Length

Max length5
Median length3
Mean length3.2252
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row곤지암읍
2nd row삼동
3rd row초월읍
4th row초월읍
5th row오포1동

Common Values

ValueCountFrequency (%)
초월읍 1632
16.3%
곤지암읍 1111
11.1%
오포1동 770
 
7.7%
오포2동 758
 
7.6%
도척면 636
 
6.4%
태전동 596
 
6.0%
경안동 567
 
5.7%
신현동 540
 
5.4%
송정동 537
 
5.4%
능평동 456
 
4.6%
Other values (13) 2397
24.0%

Length

2023-12-16T15:16:29.279666image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
초월읍 1632
16.3%
곤지암읍 1111
11.1%
오포1동 770
 
7.7%
오포2동 758
 
7.6%
도척면 636
 
6.4%
태전동 596
 
6.0%
경안동 567
 
5.7%
신현동 540
 
5.4%
송정동 537
 
5.4%
능평동 456
 
4.6%
Other values (13) 2397
24.0%
Distinct7189
Distinct (%)71.9%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-16T15:16:30.605742image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length30
Median length26
Mean length18.6824
Min length9

Characters and Unicode

Total characters186824
Distinct characters194
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique5592 ?
Unique (%)55.9%

Sample

1st row경기도 광주시 곤지암읍 평촌길 59-16
2nd row경기도 광주시 고불로 452
3rd row경기도 광주시 초월읍 설월길36번길 15-21
4th row경기도 광주시 초월읍 현산로 98
5th row경기도 광주시 오포로 485
ValueCountFrequency (%)
경기도 10000
22.8%
광주시 10000
22.8%
초월읍 1632
 
3.7%
곤지암읍 1111
 
2.5%
도척면 636
 
1.4%
경충대로 456
 
1.0%
퇴촌면 351
 
0.8%
오포로 332
 
0.8%
중앙로 279
 
0.6%
회안대로 247
 
0.6%
Other values (3977) 18834
42.9%
2023-12-16T15:16:32.225204image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
33882
18.1%
11239
 
6.0%
11052
 
5.9%
10514
 
5.6%
10180
 
5.4%
10038
 
5.4%
10005
 
5.4%
1 7618
 
4.1%
6571
 
3.5%
5943
 
3.2%
Other values (184) 69782
37.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 112196
60.1%
Decimal Number 36728
 
19.7%
Space Separator 33882
 
18.1%
Dash Punctuation 4018
 
2.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
11239
 
10.0%
11052
 
9.9%
10514
 
9.4%
10180
 
9.1%
10038
 
8.9%
10005
 
8.9%
6571
 
5.9%
5943
 
5.3%
2743
 
2.4%
2509
 
2.2%
Other values (172) 31402
28.0%
Decimal Number
ValueCountFrequency (%)
1 7618
20.7%
2 4910
13.4%
3 4238
11.5%
4 3800
10.3%
5 3229
8.8%
6 2926
 
8.0%
7 2715
 
7.4%
0 2486
 
6.8%
8 2441
 
6.6%
9 2365
 
6.4%
Space Separator
ValueCountFrequency (%)
33882
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 4018
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 112196
60.1%
Common 74628
39.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
11239
 
10.0%
11052
 
9.9%
10514
 
9.4%
10180
 
9.1%
10038
 
8.9%
10005
 
8.9%
6571
 
5.9%
5943
 
5.3%
2743
 
2.4%
2509
 
2.2%
Other values (172) 31402
28.0%
Common
ValueCountFrequency (%)
33882
45.4%
1 7618
 
10.2%
2 4910
 
6.6%
3 4238
 
5.7%
- 4018
 
5.4%
4 3800
 
5.1%
5 3229
 
4.3%
6 2926
 
3.9%
7 2715
 
3.6%
0 2486
 
3.3%
Other values (2) 4806
 
6.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 112196
60.1%
ASCII 74628
39.9%

Most frequent character per block

ASCII
ValueCountFrequency (%)
33882
45.4%
1 7618
 
10.2%
2 4910
 
6.6%
3 4238
 
5.7%
- 4018
 
5.4%
4 3800
 
5.1%
5 3229
 
4.3%
6 2926
 
3.9%
7 2715
 
3.6%
0 2486
 
3.3%
Other values (2) 4806
 
6.4%
Hangul
ValueCountFrequency (%)
11239
 
10.0%
11052
 
9.9%
10514
 
9.4%
10180
 
9.1%
10038
 
8.9%
10005
 
8.9%
6571
 
5.9%
5943
 
5.3%
2743
 
2.4%
2509
 
2.2%
Other values (172) 31402
28.0%

Missing values

2023-12-16T15:16:23.132969image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-16T15:16:23.923433image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

업소읍면동도로명주소
29969풍선홀릭곤지암읍경기도 광주시 곤지암읍 평촌길 59-16
10674퍼시픽SA삼동경기도 광주시 고불로 452
28464주식회사 에이치제이건설초월읍경기도 광주시 초월읍 설월길36번길 15-21
23964주식회사 승진도시가스초월읍경기도 광주시 초월읍 현산로 98
16203참숯구이 아리아리오포1동경기도 광주시 오포로 485
27926진성무역초월읍경기도 광주시 초월읍 현산로361번길 5-11
17250코웨이(주) 오포지국오포1동경기도 광주시 고산길 4
34978스타당구클럽도척면경기도 광주시 도척면 다람로 4
35192(주)제국포장도척면경기도 광주시 도척면 국사봉로 25
12529명랑핫도그 태전점태전동경기도 광주시 태전동로 21
업소읍면동도로명주소
10512오늘도어여쁨중대동경기도 광주시 텃골길47번길 18
3602용해금속회덕동경기도 광주시 회덕길 34-10
29097모세수산퇴촌면경기도 광주시 퇴촌면 정영로 524
31549금金초밥곤지암읍경기도 광주시 곤지암읍 경충대로 691
22607새빛경영컨설팅신현동경기도 광주시 신현로 54
22337최고봉홍보사능평동경기도 광주시 수레실길 143-36
940플랜에이치과의원경안동경기도 광주시 중앙로 107
8280젠타코리아목동경기도 광주시 광남안로 256
916024시전주콩나물국밥태전동경기도 광주시 고불로 68
8065(주) 예스런던 강남 300CC목동경기도 광주시 새말길 353

Duplicate rows

Most frequently occurring

업소읍면동도로명주소# duplicates
7세일통운송정동경기도 광주시 회안대로 98411
3대륙화물퇴촌면경기도 광주시 퇴촌면 천진암로 3967
8소백운수송정동경기도 광주시 회안대로 9847
13조은나라장지동경기도 광주시 포은대로 692-127
10아모레카운셀러경안동경기도 광주시 광주대로 646
0건우특송퇴촌면경기도 광주시 퇴촌면 천진암로 3965
6상우모터스송정동경기도 광주시 회안대로 9845
14한성운수송정동경기도 광주시 회안대로 9845
11알파로지스퇴촌면경기도 광주시 퇴촌면 천진암로 3964
12정성기업송정동경기도 광주시 회안대로 9843