Overview

Dataset statistics

Number of variables5
Number of observations178
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory7.4 KiB
Average record size in memory42.7 B

Variable types

Text2
Categorical1
Numeric2

Dataset

Description파일 다운로드
Author도봉구
URLhttps://data.seoul.go.kr/dataList/OA-13667/F/1/datasetView.do

Alerts

위도 is highly overall correlated with 동구분High correlation
동구분 is highly overall correlated with 위도High correlation
업소명 has unique valuesUnique
도로명주소 has unique valuesUnique

Reproduction

Analysis started2023-12-11 09:32:13.226793
Analysis finished2023-12-11 09:32:13.948106
Duration0.72 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

업소명
Text

UNIQUE 

Distinct178
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size1.5 KiB
2023-12-11T18:32:14.106692image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length16
Median length14
Mean length9.494382
Min length3

Characters and Unicode

Total characters1690
Distinct characters167
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique178 ?
Unique (%)100.0%

Sample

1st row창동25시편의점
2nd row채널큐24시창동점
3rd row토마토
4th row티엠티24쌍문레미안점
5th row포시즌편의점(도봉점)
ValueCountFrequency (%)
gs25 52
 
15.0%
cu 50
 
14.5%
세븐일레븐 39
 
11.3%
위드미 12
 
3.5%
미니스톱 11
 
3.2%
쌍문중앙점 3
 
0.9%
도봉방학점 3
 
0.9%
쌍문역점 3
 
0.9%
창동중앙점 2
 
0.6%
도봉공원점 2
 
0.6%
Other values (156) 169
48.8%
2023-12-11T18:32:14.458568image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
168
 
9.9%
161
 
9.5%
79
 
4.7%
2 59
 
3.5%
5 54
 
3.2%
54
 
3.2%
G 52
 
3.1%
S 52
 
3.1%
U 51
 
3.0%
51
 
3.0%
Other values (157) 909
53.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1186
70.2%
Uppercase Letter 207
 
12.2%
Space Separator 168
 
9.9%
Decimal Number 125
 
7.4%
Open Punctuation 2
 
0.1%
Close Punctuation 2
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
161
 
13.6%
79
 
6.7%
54
 
4.6%
51
 
4.3%
51
 
4.3%
48
 
4.0%
41
 
3.5%
41
 
3.5%
40
 
3.4%
40
 
3.4%
Other values (140) 580
48.9%
Decimal Number
ValueCountFrequency (%)
2 59
47.2%
5 54
43.2%
4 4
 
3.2%
1 3
 
2.4%
3 2
 
1.6%
7 1
 
0.8%
0 1
 
0.8%
8 1
 
0.8%
Uppercase Letter
ValueCountFrequency (%)
G 52
25.1%
S 52
25.1%
U 51
24.6%
C 50
24.2%
Y 1
 
0.5%
B 1
 
0.5%
Space Separator
ValueCountFrequency (%)
168
100.0%
Open Punctuation
ValueCountFrequency (%)
( 2
100.0%
Close Punctuation
ValueCountFrequency (%)
) 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1186
70.2%
Common 297
 
17.6%
Latin 207
 
12.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
161
 
13.6%
79
 
6.7%
54
 
4.6%
51
 
4.3%
51
 
4.3%
48
 
4.0%
41
 
3.5%
41
 
3.5%
40
 
3.4%
40
 
3.4%
Other values (140) 580
48.9%
Common
ValueCountFrequency (%)
168
56.6%
2 59
 
19.9%
5 54
 
18.2%
4 4
 
1.3%
1 3
 
1.0%
( 2
 
0.7%
) 2
 
0.7%
3 2
 
0.7%
7 1
 
0.3%
0 1
 
0.3%
Latin
ValueCountFrequency (%)
G 52
25.1%
S 52
25.1%
U 51
24.6%
C 50
24.2%
Y 1
 
0.5%
B 1
 
0.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1186
70.2%
ASCII 504
29.8%

Most frequent character per block

ASCII
ValueCountFrequency (%)
168
33.3%
2 59
 
11.7%
5 54
 
10.7%
G 52
 
10.3%
S 52
 
10.3%
U 51
 
10.1%
C 50
 
9.9%
4 4
 
0.8%
1 3
 
0.6%
( 2
 
0.4%
Other values (7) 9
 
1.8%
Hangul
ValueCountFrequency (%)
161
 
13.6%
79
 
6.7%
54
 
4.6%
51
 
4.3%
51
 
4.3%
48
 
4.0%
41
 
3.5%
41
 
3.5%
40
 
3.4%
40
 
3.4%
Other values (140) 580
48.9%

동구분
Categorical

HIGH CORRELATION 

Distinct6
Distinct (%)3.4%
Missing0
Missing (%)0.0%
Memory size1.5 KiB
창동
63 
쌍문동
43 
방학동
40 
도봉동
30 
창4동
 
1

Length

Max length4
Median length3
Mean length2.6516854
Min length2

Unique

Unique2 ?
Unique (%)1.1%

Sample

1st row창동
2nd row창동
3rd row방학동
4th row쌍문동
5th row도봉동

Common Values

ValueCountFrequency (%)
창동 63
35.4%
쌍문동 43
24.2%
방학동 40
22.5%
도봉동 30
16.9%
창4동 1
 
0.6%
도봉2동 1
 
0.6%

Length

2023-12-11T18:32:14.594514image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T18:32:14.712663image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
창동 63
35.4%
쌍문동 43
24.2%
방학동 40
22.5%
도봉동 30
16.9%
창4동 1
 
0.6%
도봉2동 1
 
0.6%

도로명주소
Text

UNIQUE 

Distinct178
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size1.5 KiB
2023-12-11T18:32:15.012091image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length40
Median length37
Mean length16.792135
Min length10

Characters and Unicode

Total characters2989
Distinct characters108
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique178 ?
Unique (%)100.0%

Sample

1st row도봉구 노해로63길 79. 106호 (창동.우림빌딩)
2nd row도봉구 우이천로20길 37
3rd row도봉구 방학로6길 4
4th row도봉구 우이천로 330 (삼성레미안상가 102호. 103호)
5th row도봉구 도봉로169길 202
ValueCountFrequency (%)
도봉구 178
26.9%
1층 41
 
6.2%
도봉로 12
 
1.8%
마들로 9
 
1.4%
해등로 8
 
1.2%
시루봉로 7
 
1.1%
상가동 7
 
1.1%
10 7
 
1.1%
창동 6
 
0.9%
도당로 5
 
0.8%
Other values (273) 382
57.7%
2023-12-11T18:32:15.523388image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
490
16.4%
253
 
8.5%
253
 
8.5%
1 246
 
8.2%
178
 
6.0%
173
 
5.8%
116
 
3.9%
2 94
 
3.1%
0 86
 
2.9%
6 83
 
2.8%
Other values (98) 1017
34.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1517
50.8%
Decimal Number 842
28.2%
Space Separator 490
 
16.4%
Other Punctuation 65
 
2.2%
Close Punctuation 29
 
1.0%
Open Punctuation 29
 
1.0%
Dash Punctuation 12
 
0.4%
Uppercase Letter 5
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
253
16.7%
253
16.7%
178
11.7%
173
11.4%
116
 
7.6%
42
 
2.8%
34
 
2.2%
33
 
2.2%
32
 
2.1%
20
 
1.3%
Other values (77) 383
25.2%
Decimal Number
ValueCountFrequency (%)
1 246
29.2%
2 94
 
11.2%
0 86
 
10.2%
6 83
 
9.9%
3 75
 
8.9%
4 70
 
8.3%
5 59
 
7.0%
7 48
 
5.7%
8 44
 
5.2%
9 37
 
4.4%
Uppercase Letter
ValueCountFrequency (%)
B 1
20.0%
N 1
20.0%
K 1
20.0%
I 1
20.0%
G 1
20.0%
Other Punctuation
ValueCountFrequency (%)
. 64
98.5%
, 1
 
1.5%
Space Separator
ValueCountFrequency (%)
490
100.0%
Close Punctuation
ValueCountFrequency (%)
) 29
100.0%
Open Punctuation
ValueCountFrequency (%)
( 29
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 12
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1517
50.8%
Common 1467
49.1%
Latin 5
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
253
16.7%
253
16.7%
178
11.7%
173
11.4%
116
 
7.6%
42
 
2.8%
34
 
2.2%
33
 
2.2%
32
 
2.1%
20
 
1.3%
Other values (77) 383
25.2%
Common
ValueCountFrequency (%)
490
33.4%
1 246
16.8%
2 94
 
6.4%
0 86
 
5.9%
6 83
 
5.7%
3 75
 
5.1%
4 70
 
4.8%
. 64
 
4.4%
5 59
 
4.0%
7 48
 
3.3%
Other values (6) 152
 
10.4%
Latin
ValueCountFrequency (%)
B 1
20.0%
N 1
20.0%
K 1
20.0%
I 1
20.0%
G 1
20.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1517
50.8%
ASCII 1472
49.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
490
33.3%
1 246
16.7%
2 94
 
6.4%
0 86
 
5.8%
6 83
 
5.6%
3 75
 
5.1%
4 70
 
4.8%
. 64
 
4.3%
5 59
 
4.0%
7 48
 
3.3%
Other values (11) 157
 
10.7%
Hangul
ValueCountFrequency (%)
253
16.7%
253
16.7%
178
11.7%
173
11.4%
116
 
7.6%
42
 
2.8%
34
 
2.2%
33
 
2.2%
32
 
2.1%
20
 
1.3%
Other values (77) 383
25.2%

위도
Real number (ℝ)

HIGH CORRELATION 

Distinct176
Distinct (%)98.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean37.658341
Minimum37.633853
Maximum37.68908
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.7 KiB
2023-12-11T18:32:15.688827image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum37.633853
5-th percentile37.638584
Q137.648958
median37.656478
Q337.667141
95-th percentile37.682541
Maximum37.68908
Range0.0552274
Interquartile range (IQR)0.01818315

Descriptive statistics

Standard deviation0.012816599
Coefficient of variation (CV)0.00034033891
Kurtosis-0.46935051
Mean37.658341
Median Absolute Deviation (MAD)0.009105
Skewness0.39162063
Sum6703.1847
Variance0.0001642652
MonotonicityNot monotonic
2023-12-11T18:32:15.856572image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
37.66644 2
 
1.1%
37.65329 2
 
1.1%
37.66016 1
 
0.6%
37.65902 1
 
0.6%
37.6566296 1
 
0.6%
37.64567 1
 
0.6%
37.67849 1
 
0.6%
37.65719 1
 
0.6%
37.64488 1
 
0.6%
37.65225 1
 
0.6%
Other values (166) 166
93.3%
ValueCountFrequency (%)
37.6338529 1
0.6%
37.6345765 1
0.6%
37.6346628 1
0.6%
37.6367 1
0.6%
37.63717 1
0.6%
37.63724 1
0.6%
37.63799 1
0.6%
37.6381477 1
0.6%
37.6383237 1
0.6%
37.63863 1
0.6%
ValueCountFrequency (%)
37.6890803 1
0.6%
37.6880522 1
0.6%
37.6869933 1
0.6%
37.6861992 1
0.6%
37.6848277 1
0.6%
37.68402 1
0.6%
37.683731 1
0.6%
37.68364 1
0.6%
37.6829423 1
0.6%
37.68247 1
0.6%

경도
Real number (ℝ)

Distinct172
Distinct (%)96.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean127.03835
Minimum127.01296
Maximum127.05383
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.7 KiB
2023-12-11T18:32:16.029423image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum127.01296
5-th percentile127.0242
Q1127.03398
median127.03905
Q3127.04375
95-th percentile127.0494
Maximum127.05383
Range0.04087
Interquartile range (IQR)0.009775125

Descriptive statistics

Standard deviation0.0078691989
Coefficient of variation (CV)6.1943492 × 10-5
Kurtosis0.9580599
Mean127.03835
Median Absolute Deviation (MAD)0.00500145
Skewness-0.77056571
Sum22612.826
Variance6.1924291 × 10-5
MonotonicityNot monotonic
2023-12-11T18:32:16.207654image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
127.03475 2
 
1.1%
127.03234 2
 
1.1%
127.03502 2
 
1.1%
127.04474 2
 
1.1%
127.04101 2
 
1.1%
127.03527 2
 
1.1%
127.04205 1
 
0.6%
127.0390419 1
 
0.6%
127.04932 1
 
0.6%
127.05383 1
 
0.6%
Other values (162) 162
91.0%
ValueCountFrequency (%)
127.01296 1
0.6%
127.01329 1
0.6%
127.01354 1
0.6%
127.01562 1
0.6%
127.016837 1
0.6%
127.0200228 1
0.6%
127.0219908 1
0.6%
127.0234 1
0.6%
127.02379 1
0.6%
127.02427 1
0.6%
ValueCountFrequency (%)
127.05383 1
0.6%
127.05214 1
0.6%
127.0520542 1
0.6%
127.0513911 1
0.6%
127.05093 1
0.6%
127.04982 1
0.6%
127.04972 1
0.6%
127.04967 1
0.6%
127.04954 1
0.6%
127.04938 1
0.6%

Interactions

2023-12-11T18:32:13.628700image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T18:32:13.460742image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T18:32:13.711828image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T18:32:13.536229image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-11T18:32:16.315865image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
동구분위도경도
동구분1.0000.8050.607
위도0.8051.0000.562
경도0.6070.5621.000
2023-12-11T18:32:16.399115image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
위도경도동구분
위도1.0000.2850.587
경도0.2851.0000.370
동구분0.5870.3701.000

Missing values

2023-12-11T18:32:13.828326image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T18:32:13.912757image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

업소명동구분도로명주소위도경도
0창동25시편의점창동도봉구 노해로63길 79. 106호 (창동.우림빌딩)37.65305127.04629
1채널큐24시창동점창동도봉구 우이천로20길 3737.64193127.03502
2토마토방학동도봉구 방학로6길 437.6631127.03744
3티엠티24쌍문레미안점쌍문동도봉구 우이천로 330 (삼성레미안상가 102호. 103호)37.64643127.02679
4포시즌편의점(도봉점)도봉동도봉구 도봉로169길 20237.67783127.03502
5CU 도봉신동아점방학동도봉구 시루봉로 105-2 (방학동)37.667245127.031516
6CU 도봉신창점창동도봉구 덕릉로53길 2637.63863127.03617
7BUY24시창동도봉구 노해로67길 10 104호 (하나빌딩)37.6523127.04967
8GS25 방학효성점방학동도봉구 방학로 17137.66213127.03323
9GS25 도봉북서울점도봉동도봉구 도봉로177길 2637.68181127.04474
업소명동구분도로명주소위도경도
168위드미 도봉산점도봉동도봉구 도봉산길 81. 라동 55호37.686199127.037353
169위드미 도봉해등로점창동도봉구 해등로16길 52. 1층37.655062127.042656
170위드미 마트창동도봉구 노해로63길 67. 상가동 103호 (창동. 창동대림아파트)37.653159127.045654
171위드미 방학우성점방학동도봉구 해등로 307. 110동 B층 05호 (방학동)37.656827127.021991
172위드미 방학학마을방학동도봉구 도당로 115. 1층37.667178127.039218
173위드미 슈퍼마켓방학동도봉구 방학로2길 82. 1층37.666527127.041031
174위드미 쌍문역점창동도봉구 도봉로 지하 486-1. 413-14호 (창동)37.648811127.034866
175위드미 창동점창동도봉구 덕릉로 27637.641639127.04119
176위드미 행복드림도봉동도봉구 도봉로 879. 1층37.682942127.045591
177위드미 현대홈시티창동도봉구 도봉로106길 22. 201동 105호 (창동.북한산현대홈시티)37.644345127.033723