Overview

Dataset statistics

Number of variables4
Number of observations1018
Missing cells0
Missing cells (%)0.0%
Duplicate rows1
Duplicate rows (%)0.1%
Total size in memory31.9 KiB
Average record size in memory32.1 B

Variable types

Categorical2
Text2

Dataset

Description파일 다운로드
Author서울특별시
URLhttps://data.seoul.go.kr/dataList/OA-15178/S/1/datasetView.do

Alerts

Dataset has 1 (0.1%) duplicate rowsDuplicates

Reproduction

Analysis started2023-12-11 09:56:48.849974
Analysis finished2023-12-11 09:56:49.460104
Duration0.61 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

CU
Categorical

Distinct6
Distinct (%)0.6%
Missing0
Missing (%)0.0%
Memory size8.1 KiB
GS25
319 
CU
270 
세븐일레븐
258 
미니스톱
158 
씨스페이스
 
12

Length

Max length6
Median length5
Mean length3.7367387
Min length2

Unique

Unique1 ?
Unique (%)0.1%

Sample

1st rowCU
2nd rowCU
3rd rowCU
4th rowCU
5th rowCU

Common Values

ValueCountFrequency (%)
GS25 319
31.3%
CU 270
26.5%
세븐일레븐 258
25.3%
미니스톱 158
15.5%
씨스페이스 12
 
1.2%
CU(신규) 1
 
0.1%

Length

2023-12-11T18:56:49.542536image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T18:56:49.683984image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
gs25 319
31.3%
cu 270
26.5%
세븐일레븐 258
25.3%
미니스톱 158
15.5%
씨스페이스 12
 
1.2%
cu(신규 1
 
0.1%

종로구
Categorical

Distinct26
Distinct (%)2.6%
Missing0
Missing (%)0.0%
Memory size8.1 KiB
강남구
146 
송파구
 
63
서초구
 
61
동대문구
 
50
강동구
 
47
Other values (21)
651 

Length

Max length4
Median length3
Mean length3.0736739
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row종로구
2nd row종로구
3rd row종로구
4th row종로구
5th row종로구

Common Values

ValueCountFrequency (%)
강남구 146
 
14.3%
송파구 63
 
6.2%
서초구 61
 
6.0%
동대문구 50
 
4.9%
강동구 47
 
4.6%
마포구 47
 
4.6%
관악구 42
 
4.1%
성북구 42
 
4.1%
영등포구 39
 
3.8%
강북구 37
 
3.6%
Other values (16) 444
43.6%

Length

2023-12-11T18:56:49.823654image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
강남구 146
 
14.3%
송파구 63
 
6.2%
서초구 61
 
6.0%
동대문구 50
 
4.9%
강동구 47
 
4.6%
마포구 47
 
4.6%
관악구 42
 
4.1%
성북구 42
 
4.1%
영등포구 39
 
3.8%
강북구 37
 
3.6%
Other values (16) 444
43.6%
Distinct1003
Distinct (%)98.5%
Missing0
Missing (%)0.0%
Memory size8.1 KiB
2023-12-11T18:56:50.079479image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length14
Median length13
Mean length5.608055
Min length2

Characters and Unicode

Total characters5709
Distinct characters383
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique988 ?
Unique (%)97.1%

Sample

1st row내자중앙점
2nd row대학로광장점
3rd row동대문역점
4th row동숭아트점
5th row마로니에점
ValueCountFrequency (%)
세븐일레븐 48
 
4.4%
b 17
 
1.6%
보라매점 2
 
0.2%
포이그린 2
 
0.2%
구일역점 2
 
0.2%
문정공원점 2
 
0.2%
수유점 2
 
0.2%
서초염곡점(내곡 2
 
0.2%
남대문점 2
 
0.2%
장안쉐르빌점 2
 
0.2%
Other values (994) 1004
92.5%
2023-12-11T18:56:50.545366image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
743
 
13.0%
167
 
2.9%
129
 
2.3%
101
 
1.8%
97
 
1.7%
91
 
1.6%
82
 
1.4%
81
 
1.4%
80
 
1.4%
78
 
1.4%
Other values (373) 4060
71.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 5317
93.1%
Decimal Number 146
 
2.6%
Space Separator 73
 
1.3%
Close Punctuation 62
 
1.1%
Open Punctuation 62
 
1.1%
Uppercase Letter 48
 
0.8%
Other Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
743
 
14.0%
167
 
3.1%
129
 
2.4%
101
 
1.9%
97
 
1.8%
91
 
1.7%
82
 
1.5%
81
 
1.5%
80
 
1.5%
78
 
1.5%
Other values (342) 3668
69.0%
Uppercase Letter
ValueCountFrequency (%)
B 18
37.5%
C 4
 
8.3%
K 3
 
6.2%
I 3
 
6.2%
M 2
 
4.2%
L 2
 
4.2%
N 2
 
4.2%
E 2
 
4.2%
G 2
 
4.2%
A 2
 
4.2%
Other values (7) 8
16.7%
Decimal Number
ValueCountFrequency (%)
2 48
32.9%
1 33
22.6%
3 31
21.2%
4 13
 
8.9%
5 6
 
4.1%
6 4
 
2.7%
8 4
 
2.7%
7 3
 
2.1%
9 2
 
1.4%
0 2
 
1.4%
Space Separator
ValueCountFrequency (%)
73
100.0%
Close Punctuation
ValueCountFrequency (%)
) 62
100.0%
Open Punctuation
ValueCountFrequency (%)
( 62
100.0%
Other Punctuation
ValueCountFrequency (%)
& 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 5317
93.1%
Common 344
 
6.0%
Latin 48
 
0.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
743
 
14.0%
167
 
3.1%
129
 
2.4%
101
 
1.9%
97
 
1.8%
91
 
1.7%
82
 
1.5%
81
 
1.5%
80
 
1.5%
78
 
1.5%
Other values (342) 3668
69.0%
Latin
ValueCountFrequency (%)
B 18
37.5%
C 4
 
8.3%
K 3
 
6.2%
I 3
 
6.2%
M 2
 
4.2%
L 2
 
4.2%
N 2
 
4.2%
E 2
 
4.2%
G 2
 
4.2%
A 2
 
4.2%
Other values (7) 8
16.7%
Common
ValueCountFrequency (%)
73
21.2%
) 62
18.0%
( 62
18.0%
2 48
14.0%
1 33
9.6%
3 31
9.0%
4 13
 
3.8%
5 6
 
1.7%
6 4
 
1.2%
8 4
 
1.2%
Other values (4) 8
 
2.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 5317
93.1%
ASCII 392
 
6.9%

Most frequent character per block

Hangul
ValueCountFrequency (%)
743
 
14.0%
167
 
3.1%
129
 
2.4%
101
 
1.9%
97
 
1.8%
91
 
1.7%
82
 
1.5%
81
 
1.5%
80
 
1.5%
78
 
1.5%
Other values (342) 3668
69.0%
ASCII
ValueCountFrequency (%)
73
18.6%
) 62
15.8%
( 62
15.8%
2 48
12.2%
1 33
8.4%
3 31
7.9%
B 18
 
4.6%
4 13
 
3.3%
5 6
 
1.5%
6 4
 
1.0%
Other values (21) 42
10.7%
Distinct1017
Distinct (%)99.9%
Missing0
Missing (%)0.0%
Memory size8.1 KiB
2023-12-11T18:56:50.898499image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length60
Median length45
Mean length22.702358
Min length6

Characters and Unicode

Total characters23111
Distinct characters381
Distinct categories11 ?
Distinct scripts4 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1016 ?
Unique (%)99.8%

Sample

1st row서울특별시 종로구 내자동 61-1번지
2nd row서울특별시 종로구 혜화동 93번지
3rd row서울특별시 종로구 창신동 464-16번지
4th row서울특별시 종로구 동숭동 28번지
5th row서울특별시 종로구 연건동 78-2번지
ValueCountFrequency (%)
서울 353
 
7.9%
서울특별시 319
 
7.1%
강남구 132
 
2.9%
1층 128
 
2.9%
서울시 116
 
2.6%
동대문구 47
 
1.0%
송파구 45
 
1.0%
성북구 40
 
0.9%
강동구 37
 
0.8%
마포구 35
 
0.8%
Other values (2086) 3228
72.1%
2023-12-11T18:56:51.424396image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
3560
 
15.4%
1 1466
 
6.3%
1202
 
5.2%
899
 
3.9%
882
 
3.8%
788
 
3.4%
2 776
 
3.4%
- 752
 
3.3%
3 652
 
2.8%
4 555
 
2.4%
Other values (371) 11579
50.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 11847
51.3%
Decimal Number 5888
25.5%
Space Separator 3560
 
15.4%
Dash Punctuation 752
 
3.3%
Open Punctuation 368
 
1.6%
Close Punctuation 368
 
1.6%
Other Punctuation 238
 
1.0%
Uppercase Letter 71
 
0.3%
Lowercase Letter 17
 
0.1%
Control 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1202
 
10.1%
899
 
7.6%
882
 
7.4%
788
 
6.7%
475
 
4.0%
446
 
3.8%
320
 
2.7%
319
 
2.7%
281
 
2.4%
274
 
2.3%
Other values (320) 5961
50.3%
Uppercase Letter
ValueCountFrequency (%)
B 15
21.1%
A 11
15.5%
S 5
 
7.0%
C 5
 
7.0%
M 5
 
7.0%
E 4
 
5.6%
D 3
 
4.2%
L 3
 
4.2%
I 3
 
4.2%
P 3
 
4.2%
Other values (10) 14
19.7%
Lowercase Letter
ValueCountFrequency (%)
a 3
17.6%
l 2
11.8%
e 2
11.8%
o 2
11.8%
k 2
11.8%
g 1
 
5.9%
j 1
 
5.9%
n 1
 
5.9%
t 1
 
5.9%
s 1
 
5.9%
Decimal Number
ValueCountFrequency (%)
1 1466
24.9%
2 776
13.2%
3 652
11.1%
4 555
 
9.4%
0 470
 
8.0%
6 448
 
7.6%
5 429
 
7.3%
7 394
 
6.7%
8 378
 
6.4%
9 320
 
5.4%
Other Punctuation
ValueCountFrequency (%)
, 223
93.7%
. 11
 
4.6%
@ 3
 
1.3%
/ 1
 
0.4%
Space Separator
ValueCountFrequency (%)
3560
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 752
100.0%
Open Punctuation
ValueCountFrequency (%)
( 368
100.0%
Close Punctuation
ValueCountFrequency (%)
) 368
100.0%
Control
ValueCountFrequency (%)
1
100.0%
Math Symbol
ValueCountFrequency (%)
~ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 11846
51.3%
Common 11176
48.4%
Latin 88
 
0.4%
Han 1
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1202
 
10.1%
899
 
7.6%
882
 
7.4%
788
 
6.7%
475
 
4.0%
446
 
3.8%
320
 
2.7%
319
 
2.7%
281
 
2.4%
274
 
2.3%
Other values (319) 5960
50.3%
Latin
ValueCountFrequency (%)
B 15
17.0%
A 11
 
12.5%
S 5
 
5.7%
C 5
 
5.7%
M 5
 
5.7%
E 4
 
4.5%
D 3
 
3.4%
L 3
 
3.4%
a 3
 
3.4%
I 3
 
3.4%
Other values (21) 31
35.2%
Common
ValueCountFrequency (%)
3560
31.9%
1 1466
13.1%
2 776
 
6.9%
- 752
 
6.7%
3 652
 
5.8%
4 555
 
5.0%
0 470
 
4.2%
6 448
 
4.0%
5 429
 
3.8%
7 394
 
3.5%
Other values (10) 1674
15.0%
Han
ValueCountFrequency (%)
1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 11846
51.3%
ASCII 11264
48.7%
CJK 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
3560
31.6%
1 1466
13.0%
2 776
 
6.9%
- 752
 
6.7%
3 652
 
5.8%
4 555
 
4.9%
0 470
 
4.2%
6 448
 
4.0%
5 429
 
3.8%
7 394
 
3.5%
Other values (41) 1762
15.6%
Hangul
ValueCountFrequency (%)
1202
 
10.1%
899
 
7.6%
882
 
7.4%
788
 
6.7%
475
 
4.0%
446
 
3.8%
320
 
2.7%
319
 
2.7%
281
 
2.4%
274
 
2.3%
Other values (319) 5960
50.3%
CJK
ValueCountFrequency (%)
1
100.0%

Correlations

2023-12-11T18:56:51.515611image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
CU종로구
CU1.0000.395
종로구0.3951.000
2023-12-11T18:56:51.599390image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
종로구CU
종로구1.0000.187
CU0.1871.000
2023-12-11T18:56:51.692060image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
CU종로구
CU1.0000.187
종로구0.1871.000

Missing values

2023-12-11T18:56:49.336063image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T18:56:49.419094image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

CU종로구광화문광장점서울특별시 종로구 신문로1가 1-1번지
0CU종로구내자중앙점서울특별시 종로구 내자동 61-1번지
1CU종로구대학로광장점서울특별시 종로구 혜화동 93번지
2CU종로구동대문역점서울특별시 종로구 창신동 464-16번지
3CU종로구동숭아트점서울특별시 종로구 동숭동 28번지
4CU종로구마로니에점서울특별시 종로구 연건동 78-2번지
5CU종로구명륜성대점서울특별시 종로구 명륜4가 187번지
6CU종로구종로공원점서울 종로구 율곡로271,(종로6가) 1층
7CU종로구종로삼청점서울특별시 종로구 삼청동 22번지
8CU종로구종로신교점서울특별시 종로구 신교동 36번지 수만빌리지
9CU종로구종로연지점서울특별시 종로구 종로31길 54 아르젠오피스텔 101호
CU종로구광화문광장점서울특별시 종로구 신문로1가 1-1번지
1008세븐일레븐강동구명일삼익점서울특별시 강동구 명일동 양재대로 128길 47
1009세븐일레븐강동구명일점서울 강동구 명일동 306-5
1010세븐일레븐강동구암사희망점암사동 469-17
1011세븐일레븐강동구천호쌍용점서울 강동구 천호동 432-10
1012세븐일레븐강동구천호역점서울 강동구 천호2동 429-2
1013세븐일레븐강동구세븐일레븐 성내삼성점서울특별시 강동구 성내로9길 351층 (성내동)
1014세븐일레븐강동구세븐일레븐 강동고덕점서울특별시 강동구 동남로75길 13-25
1015세븐일레븐강동구세븐일레븐 길동4호점서울특별시 강동구 명일로210 (길동)
1016씨스페이스강동구강동상일점서울시 강동구 상일동 437-8 1층
1017GS25강동구천호중앙점상암로162

Duplicate rows

Most frequently occurring

CU종로구광화문광장점서울특별시 종로구 신문로1가 1-1번지# duplicates
0CU서초구서초염곡점(내곡)서울특별시 서초구 염곡동 106-7번지2