Overview

Dataset statistics

Number of variables5
Number of observations616
Missing cells208
Missing cells (%)6.8%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory24.8 KiB
Average record size in memory41.2 B

Variable types

Numeric1
Text3
Categorical1

Dataset

Description서울특별시 종로구 내에 음식물류 폐기물 다량배출사업장(업소명, 도로명주소, 업종명, 전화번호)에 대한 데이터를 제공합니다.
Author서울특별시 종로구
URLhttps://www.data.go.kr/data/15075089/fileData.do

Alerts

연번 is highly overall correlated with 업종명High correlation
업종명 is highly overall correlated with 연번High correlation
전화번호 has 208 (33.8%) missing valuesMissing
연번 has unique valuesUnique

Reproduction

Analysis started2023-12-12 00:58:38.554970
Analysis finished2023-12-12 00:58:39.139502
Duration0.58 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct616
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean308.5
Minimum1
Maximum616
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size5.5 KiB
2023-12-12T09:58:39.193906image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile31.75
Q1154.75
median308.5
Q3462.25
95-th percentile585.25
Maximum616
Range615
Interquartile range (IQR)307.5

Descriptive statistics

Standard deviation177.96816
Coefficient of variation (CV)0.57688221
Kurtosis-1.2
Mean308.5
Median Absolute Deviation (MAD)154
Skewness0
Sum190036
Variance31672.667
MonotonicityStrictly increasing
2023-12-12T09:58:39.307995image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.2%
415 1
 
0.2%
408 1
 
0.2%
409 1
 
0.2%
410 1
 
0.2%
411 1
 
0.2%
412 1
 
0.2%
413 1
 
0.2%
414 1
 
0.2%
416 1
 
0.2%
Other values (606) 606
98.4%
ValueCountFrequency (%)
1 1
0.2%
2 1
0.2%
3 1
0.2%
4 1
0.2%
5 1
0.2%
6 1
0.2%
7 1
0.2%
8 1
0.2%
9 1
0.2%
10 1
0.2%
ValueCountFrequency (%)
616 1
0.2%
615 1
0.2%
614 1
0.2%
613 1
0.2%
612 1
0.2%
611 1
0.2%
610 1
0.2%
609 1
0.2%
608 1
0.2%
607 1
0.2%
Distinct609
Distinct (%)98.9%
Missing0
Missing (%)0.0%
Memory size4.9 KiB
2023-12-12T09:58:39.523165image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length39
Median length28
Mean length8.112013
Min length1

Characters and Unicode

Total characters4997
Distinct characters560
Distinct categories11 ?
Distinct scripts4 ?
Distinct blocks6 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique604 ?
Unique (%)98.1%

Sample

1st row민들레처럼
2nd row만족 오향족발(관철점)
3rd row두박이
4th row함흥곰보냉면
5th row강호
ValueCountFrequency (%)
광화문점 24
 
2.4%
종로점 17
 
1.7%
대학로점 16
 
1.6%
광화문 10
 
1.0%
호텔 8
 
0.8%
7
 
0.7%
한빛프라자 6
 
0.6%
동대문 6
 
0.6%
종각점 6
 
0.6%
주식회사 5
 
0.5%
Other values (800) 876
89.3%
2023-12-12T09:58:39.861558image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
365
 
7.3%
139
 
2.8%
) 101
 
2.0%
( 100
 
2.0%
86
 
1.7%
82
 
1.6%
80
 
1.6%
80
 
1.6%
71
 
1.4%
71
 
1.4%
Other values (550) 3822
76.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 3998
80.0%
Space Separator 365
 
7.3%
Lowercase Letter 188
 
3.8%
Uppercase Letter 153
 
3.1%
Close Punctuation 101
 
2.0%
Open Punctuation 100
 
2.0%
Decimal Number 69
 
1.4%
Other Punctuation 20
 
0.4%
Connector Punctuation 1
 
< 0.1%
Letter Number 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
139
 
3.5%
86
 
2.2%
82
 
2.1%
80
 
2.0%
80
 
2.0%
71
 
1.8%
71
 
1.8%
64
 
1.6%
64
 
1.6%
60
 
1.5%
Other values (483) 3201
80.1%
Uppercase Letter
ValueCountFrequency (%)
M 11
 
7.2%
E 11
 
7.2%
C 10
 
6.5%
S 10
 
6.5%
T 10
 
6.5%
A 10
 
6.5%
K 9
 
5.9%
H 9
 
5.9%
I 7
 
4.6%
L 7
 
4.6%
Other values (14) 59
38.6%
Lowercase Letter
ValueCountFrequency (%)
a 24
12.8%
e 23
12.2%
o 17
9.0%
c 15
8.0%
i 13
 
6.9%
h 13
 
6.9%
n 13
 
6.9%
u 12
 
6.4%
r 11
 
5.9%
t 10
 
5.3%
Other values (11) 37
19.7%
Decimal Number
ValueCountFrequency (%)
2 16
23.2%
1 12
17.4%
0 10
14.5%
7 8
11.6%
5 5
 
7.2%
4 5
 
7.2%
9 4
 
5.8%
6 4
 
5.8%
3 3
 
4.3%
8 2
 
2.9%
Other Punctuation
ValueCountFrequency (%)
, 6
30.0%
& 5
25.0%
· 4
20.0%
. 3
15.0%
' 1
 
5.0%
1
 
5.0%
Space Separator
ValueCountFrequency (%)
365
100.0%
Close Punctuation
ValueCountFrequency (%)
) 101
100.0%
Open Punctuation
ValueCountFrequency (%)
( 100
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 1
100.0%
Letter Number
ValueCountFrequency (%)
1
100.0%
Other Symbol
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 3995
79.9%
Common 657
 
13.1%
Latin 342
 
6.8%
Han 3
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
139
 
3.5%
86
 
2.2%
82
 
2.1%
80
 
2.0%
80
 
2.0%
71
 
1.8%
71
 
1.8%
64
 
1.6%
64
 
1.6%
60
 
1.5%
Other values (480) 3198
80.1%
Latin
ValueCountFrequency (%)
a 24
 
7.0%
e 23
 
6.7%
o 17
 
5.0%
c 15
 
4.4%
i 13
 
3.8%
h 13
 
3.8%
n 13
 
3.8%
u 12
 
3.5%
M 11
 
3.2%
r 11
 
3.2%
Other values (36) 190
55.6%
Common
ValueCountFrequency (%)
365
55.6%
) 101
 
15.4%
( 100
 
15.2%
2 16
 
2.4%
1 12
 
1.8%
0 10
 
1.5%
7 8
 
1.2%
, 6
 
0.9%
5 5
 
0.8%
4 5
 
0.8%
Other values (11) 29
 
4.4%
Han
ValueCountFrequency (%)
1
33.3%
1
33.3%
1
33.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 3995
79.9%
ASCII 992
 
19.9%
None 5
 
0.1%
CJK 3
 
0.1%
Number Forms 1
 
< 0.1%
Letterlike Symbols 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
365
36.8%
) 101
 
10.2%
( 100
 
10.1%
a 24
 
2.4%
e 23
 
2.3%
o 17
 
1.7%
2 16
 
1.6%
c 15
 
1.5%
i 13
 
1.3%
h 13
 
1.3%
Other values (53) 305
30.7%
Hangul
ValueCountFrequency (%)
139
 
3.5%
86
 
2.2%
82
 
2.1%
80
 
2.0%
80
 
2.0%
71
 
1.8%
71
 
1.8%
64
 
1.6%
64
 
1.6%
60
 
1.5%
Other values (480) 3198
80.1%
None
ValueCountFrequency (%)
· 4
80.0%
1
 
20.0%
CJK
ValueCountFrequency (%)
1
33.3%
1
33.3%
1
33.3%
Number Forms
ValueCountFrequency (%)
1
100.0%
Letterlike Symbols
ValueCountFrequency (%)
1
100.0%
Distinct599
Distinct (%)97.2%
Missing0
Missing (%)0.0%
Memory size4.9 KiB
2023-12-12T09:58:40.138828image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length62
Median length48
Mean length31.050325
Min length20

Characters and Unicode

Total characters19127
Distinct characters299
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique586 ?
Unique (%)95.1%

Sample

1st row서울특별시 종로구 동숭길 98 (동숭동)
2nd row서울특별시 종로구 삼일대로17길 19, 원산빌딩 1층 (관철동)
3rd row서울특별시 종로구 대학로1길 10 (연지동)
4th row서울특별시 종로구 창경궁로 109, 401,405호 (인의동)
5th row서울특별시 종로구 삼일대로32길 22-1, 강호빌딩 지1층,1층 (경운동)
ValueCountFrequency (%)
서울특별시 616
 
16.5%
종로구 616
 
16.5%
지하1층 68
 
1.8%
관철동 62
 
1.7%
2층 52
 
1.4%
종로 50
 
1.3%
1층 50
 
1.3%
동숭동 35
 
0.9%
새문안로 32
 
0.9%
대학로 31
 
0.8%
Other values (779) 2129
56.9%
2023-12-12T09:58:40.528234image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
3126
 
16.3%
1236
 
6.5%
1 835
 
4.4%
802
 
4.2%
660
 
3.5%
) 647
 
3.4%
( 647
 
3.4%
642
 
3.4%
625
 
3.3%
624
 
3.3%
Other values (289) 9283
48.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 11035
57.7%
Space Separator 3126
 
16.3%
Decimal Number 2796
 
14.6%
Other Punctuation 657
 
3.4%
Close Punctuation 647
 
3.4%
Open Punctuation 647
 
3.4%
Uppercase Letter 78
 
0.4%
Dash Punctuation 68
 
0.4%
Math Symbol 41
 
0.2%
Lowercase Letter 32
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1236
 
11.2%
802
 
7.3%
660
 
6.0%
642
 
5.8%
625
 
5.7%
624
 
5.7%
622
 
5.6%
618
 
5.6%
616
 
5.6%
444
 
4.0%
Other values (241) 4146
37.6%
Uppercase Letter
ValueCountFrequency (%)
B 26
33.3%
D 10
 
12.8%
A 8
 
10.3%
G 6
 
7.7%
L 4
 
5.1%
R 4
 
5.1%
C 3
 
3.8%
M 3
 
3.8%
Y 3
 
3.8%
S 3
 
3.8%
Other values (5) 8
 
10.3%
Lowercase Letter
ValueCountFrequency (%)
l 4
12.5%
e 4
12.5%
t 4
12.5%
a 3
9.4%
i 3
9.4%
o 3
9.4%
c 2
 
6.2%
n 2
 
6.2%
g 1
 
3.1%
b 1
 
3.1%
Other values (5) 5
15.6%
Decimal Number
ValueCountFrequency (%)
1 835
29.9%
2 523
18.7%
3 321
 
11.5%
4 202
 
7.2%
0 193
 
6.9%
5 186
 
6.7%
9 149
 
5.3%
6 145
 
5.2%
7 124
 
4.4%
8 118
 
4.2%
Other Punctuation
ValueCountFrequency (%)
, 616
93.8%
. 24
 
3.7%
/ 17
 
2.6%
Space Separator
ValueCountFrequency (%)
3126
100.0%
Close Punctuation
ValueCountFrequency (%)
) 647
100.0%
Open Punctuation
ValueCountFrequency (%)
( 647
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 68
100.0%
Math Symbol
ValueCountFrequency (%)
~ 41
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 11035
57.7%
Common 7982
41.7%
Latin 110
 
0.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1236
 
11.2%
802
 
7.3%
660
 
6.0%
642
 
5.8%
625
 
5.7%
624
 
5.7%
622
 
5.6%
618
 
5.6%
616
 
5.6%
444
 
4.0%
Other values (241) 4146
37.6%
Latin
ValueCountFrequency (%)
B 26
23.6%
D 10
 
9.1%
A 8
 
7.3%
G 6
 
5.5%
L 4
 
3.6%
l 4
 
3.6%
R 4
 
3.6%
e 4
 
3.6%
t 4
 
3.6%
a 3
 
2.7%
Other values (20) 37
33.6%
Common
ValueCountFrequency (%)
3126
39.2%
1 835
 
10.5%
) 647
 
8.1%
( 647
 
8.1%
, 616
 
7.7%
2 523
 
6.6%
3 321
 
4.0%
4 202
 
2.5%
0 193
 
2.4%
5 186
 
2.3%
Other values (8) 686
 
8.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 11035
57.7%
ASCII 8092
42.3%

Most frequent character per block

ASCII
ValueCountFrequency (%)
3126
38.6%
1 835
 
10.3%
) 647
 
8.0%
( 647
 
8.0%
, 616
 
7.6%
2 523
 
6.5%
3 321
 
4.0%
4 202
 
2.5%
0 193
 
2.4%
5 186
 
2.3%
Other values (38) 796
 
9.8%
Hangul
ValueCountFrequency (%)
1236
 
11.2%
802
 
7.3%
660
 
6.0%
642
 
5.8%
625
 
5.7%
624
 
5.7%
622
 
5.6%
618
 
5.6%
616
 
5.6%
444
 
4.0%
Other values (241) 4146
37.6%

업종명
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size4.9 KiB
일반음식점
483 
집단급식소
105 
관광숙박업
 
28

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row일반음식점
2nd row일반음식점
3rd row일반음식점
4th row일반음식점
5th row일반음식점

Common Values

ValueCountFrequency (%)
일반음식점 483
78.4%
집단급식소 105
 
17.0%
관광숙박업 28
 
4.5%

Length

2023-12-12T09:58:40.640219image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T09:58:40.723410image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
일반음식점 483
78.4%
집단급식소 105
 
17.0%
관광숙박업 28
 
4.5%

전화번호
Text

MISSING 

Distinct388
Distinct (%)95.1%
Missing208
Missing (%)33.8%
Memory size4.9 KiB
2023-12-12T09:58:40.933112image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length63
Median length11
Mean length11.487745
Min length11

Characters and Unicode

Total characters4687
Distinct characters12
Distinct categories3 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique377 ?
Unique (%)92.4%

Sample

1st row02-765-6392
2nd row02-734-1195
3rd row02-763-8824
4th row02-2273-2833
5th row02-732-2919
ValueCountFrequency (%)
02-3774-7472 6
 
1.5%
02-741-2121 6
 
1.5%
070-4173-9778 3
 
0.7%
02-396-2442 2
 
0.5%
02-733-3276 2
 
0.5%
02-509-6000 2
 
0.5%
02-737-2567 2
 
0.5%
02-2265-7707 2
 
0.5%
02-745-0026 2
 
0.5%
02-6730-1131 2
 
0.5%
Other values (378) 379
92.9%
2023-12-12T09:58:41.269848image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 826
17.6%
2 789
16.8%
0 765
16.3%
7 527
11.2%
3 405
8.6%
6 276
 
5.9%
1 257
 
5.5%
4 250
 
5.3%
5 220
 
4.7%
9 194
 
4.1%
Other values (2) 178
 
3.8%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 3856
82.3%
Dash Punctuation 826
 
17.6%
Other Punctuation 5
 
0.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
2 789
20.5%
0 765
19.8%
7 527
13.7%
3 405
10.5%
6 276
 
7.2%
1 257
 
6.7%
4 250
 
6.5%
5 220
 
5.7%
9 194
 
5.0%
8 173
 
4.5%
Dash Punctuation
ValueCountFrequency (%)
- 826
100.0%
Other Punctuation
ValueCountFrequency (%)
/ 5
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 4687
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 826
17.6%
2 789
16.8%
0 765
16.3%
7 527
11.2%
3 405
8.6%
6 276
 
5.9%
1 257
 
5.5%
4 250
 
5.3%
5 220
 
4.7%
9 194
 
4.1%
Other values (2) 178
 
3.8%

Most occurring blocks

ValueCountFrequency (%)
ASCII 4687
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 826
17.6%
2 789
16.8%
0 765
16.3%
7 527
11.2%
3 405
8.6%
6 276
 
5.9%
1 257
 
5.5%
4 250
 
5.3%
5 220
 
4.7%
9 194
 
4.1%
Other values (2) 178
 
3.8%

Interactions

2023-12-12T09:58:38.943694image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T09:58:41.351898image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번업종명
연번1.0000.854
업종명0.8541.000
2023-12-12T09:58:41.415120image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번업종명
연번1.0000.769
업종명0.7691.000

Missing values

2023-12-12T09:58:39.037485image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T09:58:39.111019image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번업소명소재지 도로명주소업종명전화번호
01민들레처럼서울특별시 종로구 동숭길 98 (동숭동)일반음식점02-765-6392
12만족 오향족발(관철점)서울특별시 종로구 삼일대로17길 19, 원산빌딩 1층 (관철동)일반음식점02-734-1195
23두박이서울특별시 종로구 대학로1길 10 (연지동)일반음식점02-763-8824
34함흥곰보냉면서울특별시 종로구 창경궁로 109, 401,405호 (인의동)일반음식점02-2273-2833
45강호서울특별시 종로구 삼일대로32길 22-1, 강호빌딩 지1층,1층 (경운동)일반음식점<NA>
56송전서울특별시 종로구 창경궁로11길 3 (예지동)일반음식점<NA>
67두레서울특별시 종로구 삼청로 30, 1층 (소격동)일반음식점02-732-2919
78낙원회관서울특별시 종로구 삼일대로 457, 지하1층 (경운동)일반음식점02-738-5350
89호가양꼬치서울특별시 종로구 율곡로4길 66, 지층.1층 (수송동)일반음식점02-732-5502
910낙산가든서울특별시 종로구 동숭길 145 (동숭동,지상1,2,3층)일반음식점02-742-7470
연번업소명소재지 도로명주소업종명전화번호
606607호스텔 데이서울특별시 종로구 창경궁로 224 / 2 /3층 (명륜4가 / 서울시티빌딩)관광숙박업02-742-7439
607608서울앤호텔 동대문서울특별시 종로구 종로66가길 21 (숭인동)관광숙박업02-6365-0008
608609루미아호텔서울특별시 종로구 난계로29길 19-11 (숭인동)관광숙박업02-2235-1301
609610호텔쿠레타케소서울특별시 종로구 인사동길 20-9 (인사동)관광숙박업02-738-6100
610611JONGRO ALICE서울특별시 종로구 삼일대로32길 46 (익선동)관광숙박업<NA>
611612호텔 썬비서울특별시 종로구 인사동7길 26 / 호텔 썬비 (관훈동)관광숙박업02-730-3455
612613나인트리 프리미어 호텔 인사동서울특별시 종로구 인사동길 49 (관훈동)관광숙박업02-6917-3000
613614글루호텔서울특별시 종로구 율곡로 228 / glue hotel (이화동)관광숙박업02-2024-8400
614615목시 서울 인사동 호텔서울특별시 종로구 돈화문로11길 37 (낙원동)관광숙박업02-758-1731
615616포시즌스 호텔 서울서울특별시 종로구 새문안로 97 (당주동)관광숙박업<NA>