Overview

Dataset statistics

Number of variables4
Number of observations1040
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory32.6 KiB
Average record size in memory32.1 B

Variable types

Text3
Categorical1

Dataset

Description파일 다운로드
Author강서구
URLhttps://data.seoul.go.kr/dataList/OA-21840/F/1/datasetView.do

Alerts

기준일 has constant value ""Constant

Reproduction

Analysis started2024-04-21 20:46:02.865618
Analysis finished2024-04-21 20:46:04.345681
Duration1.48 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct1008
Distinct (%)96.9%
Missing0
Missing (%)0.0%
Memory size8.2 KiB
2024-04-22T05:46:05.155926image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length29
Median length21
Mean length8.4826923
Min length2

Characters and Unicode

Total characters8822
Distinct characters474
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique988 ?
Unique (%)95.0%

Sample

1st row세븐일레븐 화곡메인점
2nd rowGS25 염창사랑점
3rd row전자담배
4th row금원마트
5th row전자담배
ValueCountFrequency (%)
씨유 125
 
7.7%
gs25 98
 
6.0%
세븐일레븐 48
 
3.0%
주)코리아세븐 40
 
2.5%
이마트24 30
 
1.8%
지에스25 24
 
1.5%
미니스톱 20
 
1.2%
주)비지에프휴먼넷 12
 
0.7%
전자담배 9
 
0.6%
주식회사 9
 
0.6%
Other values (1016) 1210
74.5%
2024-04-22T05:46:06.591844image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
587
 
6.7%
521
 
5.9%
278
 
3.2%
227
 
2.6%
221
 
2.5%
207
 
2.3%
2 201
 
2.3%
5 170
 
1.9%
162
 
1.8%
154
 
1.7%
Other values (464) 6094
69.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 7091
80.4%
Space Separator 587
 
6.7%
Decimal Number 490
 
5.6%
Uppercase Letter 337
 
3.8%
Close Punctuation 124
 
1.4%
Open Punctuation 123
 
1.4%
Lowercase Letter 54
 
0.6%
Other Punctuation 10
 
0.1%
Other Symbol 4
 
< 0.1%
Dash Punctuation 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
521
 
7.3%
278
 
3.9%
227
 
3.2%
221
 
3.1%
207
 
2.9%
162
 
2.3%
154
 
2.2%
147
 
2.1%
129
 
1.8%
128
 
1.8%
Other values (406) 4917
69.3%
Uppercase Letter
ValueCountFrequency (%)
S 132
39.2%
G 124
36.8%
C 11
 
3.3%
U 8
 
2.4%
L 8
 
2.4%
K 7
 
2.1%
M 7
 
2.1%
T 5
 
1.5%
P 4
 
1.2%
D 4
 
1.2%
Other values (13) 27
 
8.0%
Lowercase Letter
ValueCountFrequency (%)
e 12
22.2%
r 5
9.3%
a 5
9.3%
k 4
 
7.4%
i 4
 
7.4%
t 3
 
5.6%
n 3
 
5.6%
m 3
 
5.6%
s 3
 
5.6%
y 3
 
5.6%
Other values (8) 9
16.7%
Decimal Number
ValueCountFrequency (%)
2 201
41.0%
5 170
34.7%
4 40
 
8.2%
1 25
 
5.1%
9 18
 
3.7%
0 11
 
2.2%
3 10
 
2.0%
8 5
 
1.0%
7 5
 
1.0%
6 5
 
1.0%
Other Punctuation
ValueCountFrequency (%)
. 8
80.0%
& 2
 
20.0%
Space Separator
ValueCountFrequency (%)
587
100.0%
Close Punctuation
ValueCountFrequency (%)
) 124
100.0%
Open Punctuation
ValueCountFrequency (%)
( 123
100.0%
Other Symbol
ValueCountFrequency (%)
4
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 7095
80.4%
Common 1336
 
15.1%
Latin 391
 
4.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
521
 
7.3%
278
 
3.9%
227
 
3.2%
221
 
3.1%
207
 
2.9%
162
 
2.3%
154
 
2.2%
147
 
2.1%
129
 
1.8%
128
 
1.8%
Other values (407) 4921
69.4%
Latin
ValueCountFrequency (%)
S 132
33.8%
G 124
31.7%
e 12
 
3.1%
C 11
 
2.8%
U 8
 
2.0%
L 8
 
2.0%
K 7
 
1.8%
M 7
 
1.8%
T 5
 
1.3%
r 5
 
1.3%
Other values (31) 72
18.4%
Common
ValueCountFrequency (%)
587
43.9%
2 201
 
15.0%
5 170
 
12.7%
) 124
 
9.3%
( 123
 
9.2%
4 40
 
3.0%
1 25
 
1.9%
9 18
 
1.3%
0 11
 
0.8%
3 10
 
0.7%
Other values (6) 27
 
2.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 7091
80.4%
ASCII 1727
 
19.6%
None 4
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
587
34.0%
2 201
 
11.6%
5 170
 
9.8%
S 132
 
7.6%
G 124
 
7.2%
) 124
 
7.2%
( 123
 
7.1%
4 40
 
2.3%
1 25
 
1.4%
9 18
 
1.0%
Other values (47) 183
 
10.6%
Hangul
ValueCountFrequency (%)
521
 
7.3%
278
 
3.9%
227
 
3.2%
221
 
3.1%
207
 
2.9%
162
 
2.3%
154
 
2.2%
147
 
2.1%
129
 
1.8%
128
 
1.8%
Other values (406) 4917
69.3%
None
ValueCountFrequency (%)
4
100.0%
Distinct1024
Distinct (%)98.5%
Missing0
Missing (%)0.0%
Memory size8.2 KiB
2024-04-22T05:46:07.699332image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length69
Median length46
Mean length26.746154
Min length4

Characters and Unicode

Total characters27816
Distinct characters312
Distinct categories11 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1009 ?
Unique (%)97.0%

Sample

1st row서울특별시 강서구 화곡동 465번지 8호
2nd row서울특별시 강서구 염창동 293번지 한강동아아파트
3rd row서울특별시 강서구 마곡동 799번지 16호
4th row서울특별시 강서구 방화동 607번지 153호
5th row서울특별시 강서구 마곡동 797번지
ValueCountFrequency (%)
서울특별시 1039
 
17.8%
강서구 1039
 
17.8%
화곡동 299
 
5.1%
마곡동 115
 
2.0%
1층 112
 
1.9%
1호 105
 
1.8%
방화동 102
 
1.7%
등촌동 93
 
1.6%
공항동 66
 
1.1%
내발산동 60
 
1.0%
Other values (1196) 2800
48.0%
2024-04-22T05:46:09.000295image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
5178
18.6%
2098
 
7.5%
1 1253
 
4.5%
1119
 
4.0%
1092
 
3.9%
1067
 
3.8%
1057
 
3.8%
1054
 
3.8%
1048
 
3.8%
1042
 
3.7%
Other values (302) 11808
42.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 16977
61.0%
Decimal Number 5510
 
19.8%
Space Separator 5178
 
18.6%
Uppercase Letter 57
 
0.2%
Other Punctuation 28
 
0.1%
Close Punctuation 18
 
0.1%
Open Punctuation 18
 
0.1%
Lowercase Letter 11
 
< 0.1%
Math Symbol 10
 
< 0.1%
Dash Punctuation 8
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2098
 
12.4%
1119
 
6.6%
1092
 
6.4%
1067
 
6.3%
1057
 
6.2%
1054
 
6.2%
1048
 
6.2%
1042
 
6.1%
1039
 
6.1%
1039
 
6.1%
Other values (264) 5322
31.3%
Uppercase Letter
ValueCountFrequency (%)
B 23
40.4%
A 6
 
10.5%
P 6
 
10.5%
X 6
 
10.5%
S 3
 
5.3%
T 3
 
5.3%
C 2
 
3.5%
K 2
 
3.5%
I 1
 
1.8%
V 1
 
1.8%
Other values (4) 4
 
7.0%
Decimal Number
ValueCountFrequency (%)
1 1253
22.7%
2 575
10.4%
7 516
9.4%
6 492
 
8.9%
0 492
 
8.9%
3 482
 
8.7%
4 475
 
8.6%
5 436
 
7.9%
8 410
 
7.4%
9 379
 
6.9%
Lowercase Letter
ValueCountFrequency (%)
k 4
36.4%
y 2
18.2%
a 2
18.2%
r 2
18.2%
e 1
 
9.1%
Other Punctuation
ValueCountFrequency (%)
. 24
85.7%
@ 3
 
10.7%
& 1
 
3.6%
Space Separator
ValueCountFrequency (%)
5178
100.0%
Close Punctuation
ValueCountFrequency (%)
) 18
100.0%
Open Punctuation
ValueCountFrequency (%)
( 18
100.0%
Math Symbol
ValueCountFrequency (%)
~ 10
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 8
100.0%
Letter Number
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 16977
61.0%
Common 10770
38.7%
Latin 69
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2098
 
12.4%
1119
 
6.6%
1092
 
6.4%
1067
 
6.3%
1057
 
6.2%
1054
 
6.2%
1048
 
6.2%
1042
 
6.1%
1039
 
6.1%
1039
 
6.1%
Other values (264) 5322
31.3%
Latin
ValueCountFrequency (%)
B 23
33.3%
A 6
 
8.7%
P 6
 
8.7%
X 6
 
8.7%
k 4
 
5.8%
S 3
 
4.3%
T 3
 
4.3%
y 2
 
2.9%
a 2
 
2.9%
r 2
 
2.9%
Other values (10) 12
17.4%
Common
ValueCountFrequency (%)
5178
48.1%
1 1253
 
11.6%
2 575
 
5.3%
7 516
 
4.8%
6 492
 
4.6%
0 492
 
4.6%
3 482
 
4.5%
4 475
 
4.4%
5 436
 
4.0%
8 410
 
3.8%
Other values (8) 461
 
4.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 16977
61.0%
ASCII 10838
39.0%
Number Forms 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
5178
47.8%
1 1253
 
11.6%
2 575
 
5.3%
7 516
 
4.8%
6 492
 
4.5%
0 492
 
4.5%
3 482
 
4.4%
4 475
 
4.4%
5 436
 
4.0%
8 410
 
3.8%
Other values (27) 529
 
4.9%
Hangul
ValueCountFrequency (%)
2098
 
12.4%
1119
 
6.6%
1092
 
6.4%
1067
 
6.3%
1057
 
6.2%
1054
 
6.2%
1048
 
6.2%
1042
 
6.1%
1039
 
6.1%
1039
 
6.1%
Other values (264) 5322
31.3%
Number Forms
ValueCountFrequency (%)
1
100.0%
Distinct1013
Distinct (%)97.4%
Missing0
Missing (%)0.0%
Memory size8.2 KiB
2024-04-22T05:46:09.967749image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length72
Median length52
Mean length31.672115
Min length1

Characters and Unicode

Total characters32939
Distinct characters348
Distinct categories11 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1010 ?
Unique (%)97.1%

Sample

1st row서울특별시 강서구 곰달래로49길 112. 1층 (화곡동)
2nd row서울특별시 강서구 양천로73길 80. 상가동 103호 (염창동. 한강동아아파트)
3rd row서울특별시 강서구 마곡중앙2로 11. 108호 (마곡동)
4th row서울특별시 강서구 방화동로12길 14. 1층 (방화동)
5th row서울특별시 강서구 공항대로 237. 113호 (마곡동)
ValueCountFrequency (%)
서울특별시 1013
 
16.3%
강서구 1013
 
16.3%
화곡동 352
 
5.7%
1층 282
 
4.5%
방화동 115
 
1.9%
마곡동 114
 
1.8%
등촌동 101
 
1.6%
강서로 73
 
1.2%
101호 70
 
1.1%
양천로 61
 
1.0%
Other values (1297) 3015
48.6%
2024-04-22T05:46:11.264740image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
5508
 
16.7%
2256
 
6.8%
1 1614
 
4.9%
1233
 
3.7%
1163
 
3.5%
1035
 
3.1%
( 1028
 
3.1%
) 1027
 
3.1%
1023
 
3.1%
1020
 
3.1%
Other values (338) 16032
48.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 19075
57.9%
Space Separator 5508
 
16.7%
Decimal Number 5167
 
15.7%
Open Punctuation 1028
 
3.1%
Close Punctuation 1027
 
3.1%
Other Punctuation 940
 
2.9%
Dash Punctuation 96
 
0.3%
Uppercase Letter 68
 
0.2%
Math Symbol 14
 
< 0.1%
Lowercase Letter 11
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2256
 
11.8%
1233
 
6.5%
1163
 
6.1%
1035
 
5.4%
1023
 
5.4%
1020
 
5.3%
1019
 
5.3%
1013
 
5.3%
1013
 
5.3%
774
 
4.1%
Other values (295) 7526
39.5%
Uppercase Letter
ValueCountFrequency (%)
B 29
42.6%
A 9
 
13.2%
P 5
 
7.4%
E 3
 
4.4%
S 3
 
4.4%
X 3
 
4.4%
C 2
 
2.9%
N 2
 
2.9%
T 2
 
2.9%
G 1
 
1.5%
Other values (9) 9
 
13.2%
Decimal Number
ValueCountFrequency (%)
1 1614
31.2%
2 550
 
10.6%
0 535
 
10.4%
3 480
 
9.3%
4 447
 
8.7%
5 408
 
7.9%
6 352
 
6.8%
7 312
 
6.0%
8 262
 
5.1%
9 207
 
4.0%
Lowercase Letter
ValueCountFrequency (%)
k 4
36.4%
a 2
18.2%
r 2
18.2%
y 2
18.2%
e 1
 
9.1%
Other Punctuation
ValueCountFrequency (%)
. 938
99.8%
@ 2
 
0.2%
Letter Number
ValueCountFrequency (%)
3
60.0%
2
40.0%
Space Separator
ValueCountFrequency (%)
5508
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1028
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1027
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 96
100.0%
Math Symbol
ValueCountFrequency (%)
~ 14
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 19075
57.9%
Common 13780
41.8%
Latin 84
 
0.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2256
 
11.8%
1233
 
6.5%
1163
 
6.1%
1035
 
5.4%
1023
 
5.4%
1020
 
5.3%
1019
 
5.3%
1013
 
5.3%
1013
 
5.3%
774
 
4.1%
Other values (295) 7526
39.5%
Latin
ValueCountFrequency (%)
B 29
34.5%
A 9
 
10.7%
P 5
 
6.0%
k 4
 
4.8%
E 3
 
3.6%
3
 
3.6%
S 3
 
3.6%
X 3
 
3.6%
C 2
 
2.4%
a 2
 
2.4%
Other values (16) 21
25.0%
Common
ValueCountFrequency (%)
5508
40.0%
1 1614
 
11.7%
( 1028
 
7.5%
) 1027
 
7.5%
. 938
 
6.8%
2 550
 
4.0%
0 535
 
3.9%
3 480
 
3.5%
4 447
 
3.2%
5 408
 
3.0%
Other values (7) 1245
 
9.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 19075
57.9%
ASCII 13859
42.1%
Number Forms 5
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
5508
39.7%
1 1614
 
11.6%
( 1028
 
7.4%
) 1027
 
7.4%
. 938
 
6.8%
2 550
 
4.0%
0 535
 
3.9%
3 480
 
3.5%
4 447
 
3.2%
5 408
 
2.9%
Other values (31) 1324
 
9.6%
Hangul
ValueCountFrequency (%)
2256
 
11.8%
1233
 
6.5%
1163
 
6.1%
1035
 
5.4%
1023
 
5.4%
1020
 
5.3%
1019
 
5.3%
1013
 
5.3%
1013
 
5.3%
774
 
4.1%
Other values (295) 7526
39.5%
Number Forms
ValueCountFrequency (%)
3
60.0%
2
40.0%

기준일
Categorical

CONSTANT 

Distinct1
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size8.2 KiB
2019-07-02
1040 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2019-07-02
2nd row2019-07-02
3rd row2019-07-02
4th row2019-07-02
5th row2019-07-02

Common Values

ValueCountFrequency (%)
2019-07-02 1040
100.0%

Length

2024-04-22T05:46:11.485101image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-22T05:46:11.641452image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2019-07-02 1040
100.0%

Missing values

2024-04-22T05:46:04.234966image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

업소명업소지번주소업소도로명주소기준일
0세븐일레븐 화곡메인점서울특별시 강서구 화곡동 465번지 8호서울특별시 강서구 곰달래로49길 112. 1층 (화곡동)2019-07-02
1GS25 염창사랑점서울특별시 강서구 염창동 293번지 한강동아아파트서울특별시 강서구 양천로73길 80. 상가동 103호 (염창동. 한강동아아파트)2019-07-02
2전자담배서울특별시 강서구 마곡동 799번지 16호서울특별시 강서구 마곡중앙2로 11. 108호 (마곡동)2019-07-02
3금원마트서울특별시 강서구 방화동 607번지 153호서울특별시 강서구 방화동로12길 14. 1층 (방화동)2019-07-02
4전자담배서울특별시 강서구 마곡동 797번지서울특별시 강서구 공항대로 237. 113호 (마곡동)2019-07-02
5미니스톱 발산역점서울특별시 강서구 등촌동 674번지 1호서울특별시 강서구 강서로 378. 1층 103호 (등촌동)2019-07-02
6전자담배백화점서울특별시 강서구 내발산동 702번지 4호서울특별시 강서구 강서로 294. 1층 105호 (내발산동)2019-07-02
7세븐일레븐 화곡푸르지오점서울특별시 강서구 화곡동 1091번지 화곡푸르지오서울특별시 강서구 화곡로13길 107. 상가동 102-2.103-1호 (화곡동. 화곡푸르지오)2019-07-02
8미니스톱 화곡명월점서울특별시 강서구 화곡동 918번지 21호 기룡빌딩서울특별시 강서구 곰달래로 104. 기룡빌딩 1층 (화곡동)2019-07-02
9세븐일레븐 마곡대방디엠점서울특별시 강서구 마곡동 776번지 마곡파크뷰대방디엠시티오피스텔서울특별시 강서구 마곡동로10길 23. 마곡파크뷰대방디엠시티오피스텔 127호 (마곡동)2019-07-02
업소명업소지번주소업소도로명주소기준일
1030가판점서울특별시 강서구 내발산동 719번지 6호서울특별시 강서구 강서로 267 (내발산동)2019-07-02
1031화동슈퍼서울특별시 강서구 화곡동 798번지 20호서울특별시 강서구 곰달래로55길 20 (화곡동)2019-07-02
1032현대이발서울특별시 강서구 화곡동 1012번지 19호서울특별시 강서구 강서로45가길 40 (화곡동)2019-07-02
1033가정슈퍼서울특별시 강서구 등촌동 641번지 12호서울특별시 강서구 양천로60길 60 (등촌동)2019-07-02
1034동산슈퍼서울특별시 강서구 방화동 615번지 54호서울특별시 강서구 개화동로27나길 43 (방화동)2019-07-02
1035GS25 화곡한빛서울특별시 강서구 화곡동 24번지 552호서울특별시 강서구 까치산로 71 (화곡동)2019-07-02
1036낙지골서울특별시 강서구 등촌동 647번지 24 호서울특별시 강서구 공항대로59길 32 (등촌동)2019-07-02
1037강남슈퍼서울특별시 강서구 화곡동 156번지 1호서울특별시 강서구 강서로18길 98 (화곡동)2019-07-02
1038알뜰미니슈퍼서울특별시 강서구 방화동 604번지 18호서울특별시 강서구 금낭화로1길 16-7 (방화동)2019-07-02
1039학생사서울특별시 강서구 공항동 45번지 49호서울특별시 강서구 공항대로3길 17 (공항동)2019-07-02