Overview

Dataset statistics

Number of variables4
Number of observations1933
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory62.4 KiB
Average record size in memory33.1 B

Variable types

Numeric1
Text2
Categorical1

Dataset

Description이 데이터는 서울특별시 동작구 관내에 있는 환경오염물질배출사업장에 관한 것입니다. 이 데이터에는 사업장 명칭, 도로명주소, 데이터 기준일자가 포함되어 있습니다.
URLhttps://www.data.go.kr/data/15037453/fileData.do

Alerts

데이터기준일자 has constant value ""Constant
연번 has unique valuesUnique

Reproduction

Analysis started2023-12-13 00:17:22.170735
Analysis finished2023-12-13 00:17:22.610818
Duration0.44 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

UNIQUE 

Distinct1933
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean967
Minimum1
Maximum1933
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size17.1 KiB
2023-12-13T09:17:22.667924image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile97.6
Q1484
median967
Q31450
95-th percentile1836.4
Maximum1933
Range1932
Interquartile range (IQR)966

Descriptive statistics

Standard deviation558.15335
Coefficient of variation (CV)0.57720099
Kurtosis-1.2
Mean967
Median Absolute Deviation (MAD)483
Skewness0
Sum1869211
Variance311535.17
MonotonicityStrictly increasing
2023-12-13T09:17:22.783247image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.1%
1330 1
 
0.1%
1298 1
 
0.1%
1297 1
 
0.1%
1296 1
 
0.1%
1295 1
 
0.1%
1294 1
 
0.1%
1293 1
 
0.1%
1292 1
 
0.1%
1291 1
 
0.1%
Other values (1923) 1923
99.5%
ValueCountFrequency (%)
1 1
0.1%
2 1
0.1%
3 1
0.1%
4 1
0.1%
5 1
0.1%
6 1
0.1%
7 1
0.1%
8 1
0.1%
9 1
0.1%
10 1
0.1%
ValueCountFrequency (%)
1933 1
0.1%
1932 1
0.1%
1931 1
0.1%
1930 1
0.1%
1929 1
0.1%
1928 1
0.1%
1927 1
0.1%
1926 1
0.1%
1925 1
0.1%
1924 1
0.1%

상호
Text

Distinct412
Distinct (%)21.3%
Missing0
Missing (%)0.0%
Memory size15.2 KiB
2023-12-13T09:17:22.941940image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length31
Median length23
Mean length6.9958614
Min length2

Characters and Unicode

Total characters13523
Distinct characters338
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique111 ?
Unique (%)5.7%

Sample

1st row서울주택도시공사
2nd row서울주택도시공사
3rd row(주)현대원룸
4th row(주)현대원룸
5th row개인
ValueCountFrequency (%)
개인 419
 
20.2%
중앙대학교 194
 
9.3%
동작구청 91
 
4.4%
서울특별시동작관악교육지원청 69
 
3.3%
현대자동차(주)남부서비스센터 35
 
1.7%
숭실대학교 32
 
1.5%
서울동작관악교육지원청 24
 
1.2%
서울특별시 20
 
1.0%
국립서울현충원 19
 
0.9%
주식회사 19
 
0.9%
Other values (441) 1154
55.6%
2023-12-13T09:17:23.208358image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
525
 
3.9%
501
 
3.7%
482
 
3.6%
454
 
3.4%
435
 
3.2%
429
 
3.2%
) 406
 
3.0%
( 404
 
3.0%
374
 
2.8%
328
 
2.4%
Other values (328) 9185
67.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 12439
92.0%
Close Punctuation 406
 
3.0%
Open Punctuation 404
 
3.0%
Space Separator 143
 
1.1%
Decimal Number 72
 
0.5%
Uppercase Letter 50
 
0.4%
Dash Punctuation 5
 
< 0.1%
Lowercase Letter 4
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
525
 
4.2%
501
 
4.0%
482
 
3.9%
454
 
3.6%
435
 
3.5%
429
 
3.4%
374
 
3.0%
328
 
2.6%
306
 
2.5%
283
 
2.3%
Other values (304) 8322
66.9%
Decimal Number
ValueCountFrequency (%)
2 15
20.8%
3 12
16.7%
4 10
13.9%
1 9
12.5%
6 8
11.1%
8 7
9.7%
7 7
9.7%
0 2
 
2.8%
9 1
 
1.4%
5 1
 
1.4%
Uppercase Letter
ValueCountFrequency (%)
U 13
26.0%
L 13
26.0%
B 13
26.0%
H 5
 
10.0%
J 4
 
8.0%
S 1
 
2.0%
V 1
 
2.0%
Lowercase Letter
ValueCountFrequency (%)
l 2
50.0%
e 1
25.0%
i 1
25.0%
Close Punctuation
ValueCountFrequency (%)
) 406
100.0%
Open Punctuation
ValueCountFrequency (%)
( 404
100.0%
Space Separator
ValueCountFrequency (%)
143
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 5
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 12439
92.0%
Common 1030
 
7.6%
Latin 54
 
0.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
525
 
4.2%
501
 
4.0%
482
 
3.9%
454
 
3.6%
435
 
3.5%
429
 
3.4%
374
 
3.0%
328
 
2.6%
306
 
2.5%
283
 
2.3%
Other values (304) 8322
66.9%
Common
ValueCountFrequency (%)
) 406
39.4%
( 404
39.2%
143
 
13.9%
2 15
 
1.5%
3 12
 
1.2%
4 10
 
1.0%
1 9
 
0.9%
6 8
 
0.8%
8 7
 
0.7%
7 7
 
0.7%
Other values (4) 9
 
0.9%
Latin
ValueCountFrequency (%)
U 13
24.1%
L 13
24.1%
B 13
24.1%
H 5
 
9.3%
J 4
 
7.4%
l 2
 
3.7%
e 1
 
1.9%
S 1
 
1.9%
V 1
 
1.9%
i 1
 
1.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 12439
92.0%
ASCII 1084
 
8.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
525
 
4.2%
501
 
4.0%
482
 
3.9%
454
 
3.6%
435
 
3.5%
429
 
3.4%
374
 
3.0%
328
 
2.6%
306
 
2.5%
283
 
2.3%
Other values (304) 8322
66.9%
ASCII
ValueCountFrequency (%)
) 406
37.5%
( 404
37.3%
143
 
13.2%
2 15
 
1.4%
U 13
 
1.2%
L 13
 
1.2%
B 13
 
1.2%
3 12
 
1.1%
4 10
 
0.9%
1 9
 
0.8%
Other values (14) 46
 
4.2%
Distinct631
Distinct (%)32.6%
Missing0
Missing (%)0.0%
Memory size15.2 KiB
2023-12-13T09:17:23.430624image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length55
Median length40
Mean length28.185204
Min length1

Characters and Unicode

Total characters54482
Distinct characters276
Distinct categories11 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique106 ?
Unique (%)5.5%

Sample

1st row서울특별시 동작구 흑석로3길 20-1 (흑석동)
2nd row서울특별시 동작구 흑석로3길 20-1 (흑석동)
3rd row서울특별시 동작구 상도로37길 39 (상도1동)
4th row서울특별시 동작구 상도로37길 39 (상도1동)
5th row서울특별시 동작구 남부순환로269길 8_ 덕창빌딩 (사당동)
ValueCountFrequency (%)
서울특별시 1798
 
17.4%
동작구 1786
 
17.2%
상도동 381
 
3.7%
흑석동 302
 
2.9%
대방동 284
 
2.7%
사당동 266
 
2.6%
노량진동 214
 
2.1%
흑석로 204
 
2.0%
84 185
 
1.8%
신대방동 166
 
1.6%
Other values (807) 4776
46.1%
2023-12-13T09:17:23.751466image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
8754
 
16.1%
4058
 
7.4%
2048
 
3.8%
1972
 
3.6%
1905
 
3.5%
1893
 
3.5%
1831
 
3.4%
1818
 
3.3%
) 1814
 
3.3%
1814
 
3.3%
Other values (266) 26575
48.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 34421
63.2%
Space Separator 8754
 
16.1%
Decimal Number 6370
 
11.7%
Close Punctuation 1814
 
3.3%
Open Punctuation 1814
 
3.3%
Connector Punctuation 1004
 
1.8%
Dash Punctuation 202
 
0.4%
Lowercase Letter 51
 
0.1%
Uppercase Letter 32
 
0.1%
Other Punctuation 18
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
4058
 
11.8%
2048
 
5.9%
1972
 
5.7%
1905
 
5.5%
1893
 
5.5%
1831
 
5.3%
1818
 
5.3%
1814
 
5.3%
1709
 
5.0%
1300
 
3.8%
Other values (233) 14073
40.9%
Decimal Number
ValueCountFrequency (%)
1 1422
22.3%
2 941
14.8%
4 739
11.6%
3 621
9.7%
6 586
9.2%
5 519
 
8.1%
8 441
 
6.9%
0 400
 
6.3%
7 370
 
5.8%
9 331
 
5.2%
Lowercase Letter
ValueCountFrequency (%)
a 23
45.1%
o 4
 
7.8%
n 4
 
7.8%
g 4
 
7.8%
j 4
 
7.8%
t 4
 
7.8%
k 4
 
7.8%
s 4
 
7.8%
Uppercase Letter
ValueCountFrequency (%)
A 10
31.2%
S 10
31.2%
D 5
15.6%
T 3
 
9.4%
K 2
 
6.2%
B 1
 
3.1%
V 1
 
3.1%
Other Punctuation
ValueCountFrequency (%)
/ 10
55.6%
. 8
44.4%
Space Separator
ValueCountFrequency (%)
8754
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1814
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1814
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 1004
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 202
100.0%
Math Symbol
ValueCountFrequency (%)
~ 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 34421
63.2%
Common 19978
36.7%
Latin 83
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
4058
 
11.8%
2048
 
5.9%
1972
 
5.7%
1905
 
5.5%
1893
 
5.5%
1831
 
5.3%
1818
 
5.3%
1814
 
5.3%
1709
 
5.0%
1300
 
3.8%
Other values (233) 14073
40.9%
Common
ValueCountFrequency (%)
8754
43.8%
) 1814
 
9.1%
( 1814
 
9.1%
1 1422
 
7.1%
_ 1004
 
5.0%
2 941
 
4.7%
4 739
 
3.7%
3 621
 
3.1%
6 586
 
2.9%
5 519
 
2.6%
Other values (8) 1764
 
8.8%
Latin
ValueCountFrequency (%)
a 23
27.7%
A 10
12.0%
S 10
12.0%
D 5
 
6.0%
o 4
 
4.8%
n 4
 
4.8%
g 4
 
4.8%
j 4
 
4.8%
t 4
 
4.8%
k 4
 
4.8%
Other values (5) 11
13.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 34421
63.2%
ASCII 20061
36.8%

Most frequent character per block

ASCII
ValueCountFrequency (%)
8754
43.6%
) 1814
 
9.0%
( 1814
 
9.0%
1 1422
 
7.1%
_ 1004
 
5.0%
2 941
 
4.7%
4 739
 
3.7%
3 621
 
3.1%
6 586
 
2.9%
5 519
 
2.6%
Other values (23) 1847
 
9.2%
Hangul
ValueCountFrequency (%)
4058
 
11.8%
2048
 
5.9%
1972
 
5.7%
1905
 
5.5%
1893
 
5.5%
1831
 
5.3%
1818
 
5.3%
1814
 
5.3%
1709
 
5.0%
1300
 
3.8%
Other values (233) 14073
40.9%

데이터기준일자
Categorical

CONSTANT 

Distinct1
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size15.2 KiB
2023-07-07
1933 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2023-07-07
2nd row2023-07-07
3rd row2023-07-07
4th row2023-07-07
5th row2023-07-07

Common Values

ValueCountFrequency (%)
2023-07-07 1933
100.0%

Length

2023-12-13T09:17:24.087187image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T09:17:24.152241image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2023-07-07 1933
100.0%

Interactions

2023-12-13T09:17:22.426503image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Missing values

2023-12-13T09:17:22.523008image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T09:17:22.584566image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번상호사업장도로명주소데이터기준일자
01서울주택도시공사서울특별시 동작구 흑석로3길 20-1 (흑석동)2023-07-07
12서울주택도시공사서울특별시 동작구 흑석로3길 20-1 (흑석동)2023-07-07
23(주)현대원룸서울특별시 동작구 상도로37길 39 (상도1동)2023-07-07
34(주)현대원룸서울특별시 동작구 상도로37길 39 (상도1동)2023-07-07
45개인서울특별시 동작구 남부순환로269길 8_ 덕창빌딩 (사당동)2023-07-07
56개인서울특별시 동작구 남부순환로269길 8_ 덕창빌딩 (사당동)2023-07-07
67동작구청(전략사업과)서울특별시 동작구 사당로16라길 74 (사당동)2023-07-07
78동작구청(전략사업과)서울특별시 동작구 사당로16라길 74 (사당동)2023-07-07
89김종석서울특별시 동작구 양녕로36길 12 (상도동)2023-07-07
910김종석서울특별시 동작구 양녕로36길 12 (상도동)2023-07-07
연번상호사업장도로명주소데이터기준일자
19231924중앙대학교서울특별시 동작구 흑석로 84 (흑석동_ 중앙대학교 교양학관 안전관리팀)2023-07-07
19241925중앙대학교서울특별시 동작구 흑석로 84 (흑석동_ 중앙대학교 교양학관 안전관리팀)2023-07-07
19251926서울특별시보라매병원서울특별시 동작구 보라매로5길 20 (신대방동_서울특별시보라매병원)2023-07-07
19261927서울특별시보라매병원서울특별시 동작구 보라매로5길 20 (신대방동_서울특별시보라매병원)2023-07-07
19271928서울특별시보라매병원서울특별시 동작구 보라매로5길 20 (신대방동_서울특별시보라매병원)2023-07-07
19281929서울특별시보라매병원서울특별시 동작구 보라매로5길 20 (신대방동_서울특별시보라매병원)2023-07-07
19291930서울특별시보라매병원서울특별시 동작구 보라매로5길 20 (신대방동_서울특별시보라매병원)2023-07-07
19301931서울특별시보라매병원서울특별시 동작구 보라매로5길 20 (신대방동_서울특별시보라매병원)2023-07-07
19311932서울특별시보라매병원서울특별시 동작구 보라매로5길 20 (신대방동_서울특별시보라매병원)2023-07-07
19321933전문건설공제조합서울특별시 동작구 보라매로5길 15 (신대방동)2023-07-07