Overview

Dataset statistics

Number of variables13
Number of observations2550
Missing cells4295
Missing cells (%)13.0%
Duplicate rows4
Duplicate rows (%)0.2%
Total size in memory271.6 KiB
Average record size in memory109.1 B

Variable types

Text4
Numeric2
Categorical5
Boolean1
Unsupported1

Dataset

Description업체(시설)명,인허가번호,업종코드,업종명,지도점검일자,점검기관,점검기관명,지도점검구분,처분대상여부,점검사항,점검결과,소재지도로명주소,소재지주소
Author성동구
URLhttps://data.seoul.go.kr/dataList/OA-10740/S/1/datasetView.do

Alerts

점검기관 has constant value ""Constant
점검기관명 has constant value ""Constant
Dataset has 4 (0.2%) duplicate rowsDuplicates
업종명 is highly overall correlated with 인허가번호 and 1 other fieldsHigh correlation
업종코드 is highly overall correlated with 인허가번호 and 1 other fieldsHigh correlation
인허가번호 is highly overall correlated with 업종코드 and 1 other fieldsHigh correlation
지도점검구분 is highly imbalanced (80.7%)Imbalance
처분대상여부 is highly imbalanced (99.5%)Imbalance
처분대상여부 has 135 (5.3%) missing valuesMissing
점검결과 has 2550 (100.0%) missing valuesMissing
소재지도로명주소 has 1489 (58.4%) missing valuesMissing
소재지주소 has 110 (4.3%) missing valuesMissing
점검결과 is an unsupported type, check if it needs cleaning or further analysisUnsupported

Reproduction

Analysis started2024-05-11 07:04:32.471948
Analysis finished2024-05-11 07:04:34.732543
Duration2.26 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct698
Distinct (%)27.4%
Missing0
Missing (%)0.0%
Memory size20.1 KiB
2024-05-11T16:04:34.986876image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length40
Median length19
Mean length6.2866667
Min length2

Characters and Unicode

Total characters16031
Distinct characters381
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique200 ?
Unique (%)7.8%

Sample

1st row학교법인 한양대학교
2nd row한양대학병원
3rd row(주)엘앤피에스
4th row홍익점
5th row(주)한미더블유비오
ValueCountFrequency (%)
제일도금 20
 
0.7%
주화섬유 19
 
0.7%
신광도금 19
 
0.7%
빛나금속 17
 
0.6%
신광특수정공사 17
 
0.6%
삼구당 16
 
0.6%
오륜섬유 15
 
0.6%
화성금속 15
 
0.6%
삼성모터스 15
 
0.6%
신덕섬유 15
 
0.6%
Other values (718) 2515
93.7%
2024-05-11T16:04:35.547163image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
976
 
6.1%
) 843
 
5.3%
( 837
 
5.2%
591
 
3.7%
453
 
2.8%
373
 
2.3%
337
 
2.1%
335
 
2.1%
325
 
2.0%
307
 
1.9%
Other values (371) 10654
66.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 13749
85.8%
Close Punctuation 843
 
5.3%
Open Punctuation 837
 
5.2%
Uppercase Letter 271
 
1.7%
Space Separator 133
 
0.8%
Lowercase Letter 110
 
0.7%
Dash Punctuation 36
 
0.2%
Other Punctuation 28
 
0.2%
Decimal Number 24
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
976
 
7.1%
591
 
4.3%
453
 
3.3%
373
 
2.7%
337
 
2.5%
335
 
2.4%
325
 
2.4%
307
 
2.2%
302
 
2.2%
291
 
2.1%
Other values (327) 9459
68.8%
Uppercase Letter
ValueCountFrequency (%)
C 31
11.4%
S 26
 
9.6%
D 25
 
9.2%
P 24
 
8.9%
K 23
 
8.5%
G 17
 
6.3%
T 17
 
6.3%
M 17
 
6.3%
N 12
 
4.4%
Y 12
 
4.4%
Other values (11) 67
24.7%
Lowercase Letter
ValueCountFrequency (%)
o 20
18.2%
e 18
16.4%
t 11
10.0%
s 11
10.0%
d 10
9.1%
a 10
9.1%
r 9
8.2%
c 5
 
4.5%
h 5
 
4.5%
p 5
 
4.5%
Other values (2) 6
 
5.5%
Decimal Number
ValueCountFrequency (%)
2 13
54.2%
8 4
 
16.7%
5 4
 
16.7%
3 2
 
8.3%
1 1
 
4.2%
Other Punctuation
ValueCountFrequency (%)
. 19
67.9%
& 9
32.1%
Close Punctuation
ValueCountFrequency (%)
) 843
100.0%
Open Punctuation
ValueCountFrequency (%)
( 837
100.0%
Space Separator
ValueCountFrequency (%)
133
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 36
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 13749
85.8%
Common 1901
 
11.9%
Latin 381
 
2.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
976
 
7.1%
591
 
4.3%
453
 
3.3%
373
 
2.7%
337
 
2.5%
335
 
2.4%
325
 
2.4%
307
 
2.2%
302
 
2.2%
291
 
2.1%
Other values (327) 9459
68.8%
Latin
ValueCountFrequency (%)
C 31
 
8.1%
S 26
 
6.8%
D 25
 
6.6%
P 24
 
6.3%
K 23
 
6.0%
o 20
 
5.2%
e 18
 
4.7%
G 17
 
4.5%
T 17
 
4.5%
M 17
 
4.5%
Other values (23) 163
42.8%
Common
ValueCountFrequency (%)
) 843
44.3%
( 837
44.0%
133
 
7.0%
- 36
 
1.9%
. 19
 
1.0%
2 13
 
0.7%
& 9
 
0.5%
8 4
 
0.2%
5 4
 
0.2%
3 2
 
0.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 13749
85.8%
ASCII 2282
 
14.2%

Most frequent character per block

Hangul
ValueCountFrequency (%)
976
 
7.1%
591
 
4.3%
453
 
3.3%
373
 
2.7%
337
 
2.5%
335
 
2.4%
325
 
2.4%
307
 
2.2%
302
 
2.2%
291
 
2.1%
Other values (327) 9459
68.8%
ASCII
ValueCountFrequency (%)
) 843
36.9%
( 837
36.7%
133
 
5.8%
- 36
 
1.6%
C 31
 
1.4%
S 26
 
1.1%
D 25
 
1.1%
P 24
 
1.1%
K 23
 
1.0%
o 20
 
0.9%
Other values (34) 284
 
12.4%

인허가번호
Real number (ℝ)

HIGH CORRELATION 

Distinct729
Distinct (%)28.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3.0300002 × 1017
Minimum3.0300002 × 1017
Maximum3.0300006 × 1017
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size22.5 KiB
2024-05-11T16:04:35.736672image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum3.0300002 × 1017
5-th percentile3.0300002 × 1017
Q13.0300002 × 1017
median3.0300002 × 1017
Q33.0300002 × 1017
95-th percentile3.0300002 × 1017
Maximum3.0300006 × 1017
Range4.12011 × 1010
Interquartile range (IQR)9.998 × 108

Descriptive statistics

Standard deviation2.017456 × 109
Coefficient of variation (CV)6.6582702 × 10-9
Kurtosis372.21351
Mean3.0300002 × 1017
Median Absolute Deviation (MAD)1200000
Skewness18.746119
Sum-2.1131952 × 1018
Variance4.0701288 × 1018
MonotonicityNot monotonic
2024-05-11T16:04:35.941039image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
303000022200200086 19
 
0.7%
303000022200400105 16
 
0.6%
303000022199800031 16
 
0.6%
303000022201000023 15
 
0.6%
303000022200600019 14
 
0.5%
303000022200800012 14
 
0.5%
303000022200000053 14
 
0.5%
303000022200800005 13
 
0.5%
303000022200800006 13
 
0.5%
303000022198200004 12
 
0.5%
Other values (719) 2404
94.3%
ValueCountFrequency (%)
303000021000000001 4
0.2%
303000021197300007 4
0.2%
303000021197500040 2
 
0.1%
303000021197500081 2
 
0.1%
303000021197800008 6
0.2%
303000021197900001 4
0.2%
303000021198000039 1
 
< 0.1%
303000021198200091 1
 
< 0.1%
303000021198300085 2
 
0.1%
303000021198400099 2
 
0.1%
ValueCountFrequency (%)
303000062201100001 1
< 0.1%
303000062201000003 2
0.1%
303000062201000002 2
0.1%
303000062201000001 1
< 0.1%
303000025201200003 1
< 0.1%
303000022201700001 1
< 0.1%
303000022201600012 1
< 0.1%
303000022201600006 1
< 0.1%
303000022201600003 2
0.1%
303000022201600002 2
0.1%

업종코드
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size20.1 KiB
22
1575 
21
974 
25
 
1

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique1 ?
Unique (%)< 0.1%

Sample

1st row22
2nd row22
3rd row22
4th row22
5th row22

Common Values

ValueCountFrequency (%)
22 1575
61.8%
21 974
38.2%
25 1
 
< 0.1%

Length

2024-05-11T16:04:36.378188image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-11T16:04:36.494231image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
22 1575
61.8%
21 974
38.2%
25 1
 
< 0.1%

업종명
Categorical

HIGH CORRELATION 

Distinct4
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size20.1 KiB
폐수배출업소관리
1574 
대기배출업소관리
843 
<NA>
 
132
기타수질오염원관리
 
1

Length

Max length9
Median length8
Mean length7.7933333
Min length4

Unique

Unique1 ?
Unique (%)< 0.1%

Sample

1st row폐수배출업소관리
2nd row폐수배출업소관리
3rd row폐수배출업소관리
4th row폐수배출업소관리
5th row폐수배출업소관리

Common Values

ValueCountFrequency (%)
폐수배출업소관리 1574
61.7%
대기배출업소관리 843
33.1%
<NA> 132
 
5.2%
기타수질오염원관리 1
 
< 0.1%

Length

2024-05-11T16:04:36.638889image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-11T16:04:36.751664image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
폐수배출업소관리 1574
61.7%
대기배출업소관리 843
33.1%
na 132
 
5.2%
기타수질오염원관리 1
 
< 0.1%

지도점검일자
Real number (ℝ)

Distinct774
Distinct (%)30.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean20130359
Minimum20100108
Maximum20170629
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size22.5 KiB
2024-05-11T16:04:36.922795image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum20100108
5-th percentile20100514
Q120110747
median20130628
Q320150413
95-th percentile20161123
Maximum20170629
Range70521
Interquartile range (IQR)39666.25

Descriptive statistics

Standard deviation21182.399
Coefficient of variation (CV)0.0010522614
Kurtosis-1.1680308
Mean20130359
Median Absolute Deviation (MAD)19801.5
Skewness0.16470128
Sum5.1332416 × 1010
Variance4.4869402 × 108
MonotonicityDecreasing
2024-05-11T16:04:37.116216image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
20131213 56
 
2.2%
20111115 42
 
1.6%
20120522 40
 
1.6%
20121126 25
 
1.0%
20121120 18
 
0.7%
20100701 16
 
0.6%
20100623 16
 
0.6%
20120420 15
 
0.6%
20110531 15
 
0.6%
20100315 11
 
0.4%
Other values (764) 2296
90.0%
ValueCountFrequency (%)
20100108 3
0.1%
20100111 2
 
0.1%
20100113 1
 
< 0.1%
20100122 1
 
< 0.1%
20100125 1
 
< 0.1%
20100129 1
 
< 0.1%
20100205 5
0.2%
20100223 3
0.1%
20100224 6
0.2%
20100305 1
 
< 0.1%
ValueCountFrequency (%)
20170629 2
 
0.1%
20170614 2
 
0.1%
20170609 3
0.1%
20170608 1
 
< 0.1%
20170602 1
 
< 0.1%
20170530 6
0.2%
20170529 2
 
0.1%
20170526 1
 
< 0.1%
20170524 2
 
0.1%
20170523 3
0.1%

점검기관
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size20.1 KiB
3030000
2550 

Length

Max length7
Median length7
Mean length7
Min length7

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row3030000
2nd row3030000
3rd row3030000
4th row3030000
5th row3030000

Common Values

ValueCountFrequency (%)
3030000 2550
100.0%

Length

2024-05-11T16:04:37.284980image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-11T16:04:37.393486image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
3030000 2550
100.0%

점검기관명
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size20.1 KiB
서울특별시 성동구
2550 

Length

Max length9
Median length9
Mean length9
Min length9

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row서울특별시 성동구
2nd row서울특별시 성동구
3rd row서울특별시 성동구
4th row서울특별시 성동구
5th row서울특별시 성동구

Common Values

ValueCountFrequency (%)
서울특별시 성동구 2550
100.0%

Length

2024-05-11T16:04:37.506380image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-11T16:04:37.613305image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
서울특별시 2550
50.0%
성동구 2550
50.0%

지도점검구분
Categorical

IMBALANCE 

Distinct5
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size20.1 KiB
정기
2370 
수시
 
94
기타
 
82
<NA>
 
3
합동
 
1

Length

Max length4
Median length2
Mean length2.0023529
Min length2

Unique

Unique1 ?
Unique (%)< 0.1%

Sample

1st row정기
2nd row정기
3rd row정기
4th row정기
5th row정기

Common Values

ValueCountFrequency (%)
정기 2370
92.9%
수시 94
 
3.7%
기타 82
 
3.2%
<NA> 3
 
0.1%
합동 1
 
< 0.1%

Length

2024-05-11T16:04:37.739495image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-11T16:04:37.879307image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
정기 2370
92.9%
수시 94
 
3.7%
기타 82
 
3.2%
na 3
 
0.1%
합동 1
 
< 0.1%

처분대상여부
Boolean

IMBALANCE  MISSING 

Distinct2
Distinct (%)0.1%
Missing135
Missing (%)5.3%
Memory size5.1 KiB
False
2414 
True
 
1
(Missing)
 
135
ValueCountFrequency (%)
False 2414
94.7%
True 1
 
< 0.1%
(Missing) 135
 
5.3%
2024-05-11T16:04:38.012131image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Distinct208
Distinct (%)8.2%
Missing11
Missing (%)0.4%
Memory size20.1 KiB
2024-05-11T16:04:38.295186image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length59
Median length42
Mean length21.332808
Min length6

Characters and Unicode

Total characters54164
Distinct characters118
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique108 ?
Unique (%)4.3%

Sample

1st row폐수배출시설 및 수질오염방지시설 적정운영 여부
2nd row폐수배출시설 및 수질오염방지시설 적정운영 여부
3rd row폐수배출시설 및 수질오염방지시설 적정운영 여부
4th row폐수배출시설 및 수질오염방지시설 적정운영 여부
5th row폐수배출시설 및 수질오염방지시설 적정운영 여부
ValueCountFrequency (%)
2146
17.7%
여부 1608
13.2%
폐수배출시설 1300
10.7%
방지시설 1091
9.0%
적정운영 928
7.6%
적정 783
 
6.5%
수질오염방지시설 698
 
5.8%
배출시설 519
 
4.3%
대기배출시설 466
 
3.8%
운영 393
 
3.2%
Other values (152) 2207
18.2%
2024-05-11T16:04:38.782896image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
9604
17.7%
4687
 
8.7%
4650
 
8.6%
2565
 
4.7%
2458
 
4.5%
2455
 
4.5%
2357
 
4.4%
2262
 
4.2%
2198
 
4.1%
2102
 
3.9%
Other values (108) 18826
34.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 44532
82.2%
Space Separator 9604
 
17.7%
Close Punctuation 9
 
< 0.1%
Open Punctuation 9
 
< 0.1%
Other Punctuation 7
 
< 0.1%
Uppercase Letter 3
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
4687
 
10.5%
4650
 
10.4%
2565
 
5.8%
2458
 
5.5%
2455
 
5.5%
2357
 
5.3%
2262
 
5.1%
2198
 
4.9%
2102
 
4.7%
2099
 
4.7%
Other values (100) 16699
37.5%
Uppercase Letter
ValueCountFrequency (%)
S 1
33.3%
M 1
33.3%
T 1
33.3%
Other Punctuation
ValueCountFrequency (%)
, 6
85.7%
1
 
14.3%
Space Separator
ValueCountFrequency (%)
9604
100.0%
Close Punctuation
ValueCountFrequency (%)
) 9
100.0%
Open Punctuation
ValueCountFrequency (%)
( 9
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 44532
82.2%
Common 9629
 
17.8%
Latin 3
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
4687
 
10.5%
4650
 
10.4%
2565
 
5.8%
2458
 
5.5%
2455
 
5.5%
2357
 
5.3%
2262
 
5.1%
2198
 
4.9%
2102
 
4.7%
2099
 
4.7%
Other values (100) 16699
37.5%
Common
ValueCountFrequency (%)
9604
99.7%
) 9
 
0.1%
( 9
 
0.1%
, 6
 
0.1%
1
 
< 0.1%
Latin
ValueCountFrequency (%)
S 1
33.3%
M 1
33.3%
T 1
33.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 44532
82.2%
ASCII 9631
 
17.8%
None 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
9604
99.7%
) 9
 
0.1%
( 9
 
0.1%
, 6
 
0.1%
S 1
 
< 0.1%
M 1
 
< 0.1%
T 1
 
< 0.1%
Hangul
ValueCountFrequency (%)
4687
 
10.5%
4650
 
10.4%
2565
 
5.8%
2458
 
5.5%
2455
 
5.5%
2357
 
5.3%
2262
 
5.1%
2198
 
4.9%
2102
 
4.7%
2099
 
4.7%
Other values (100) 16699
37.5%
None
ValueCountFrequency (%)
1
100.0%

점검결과
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing2550
Missing (%)100.0%
Memory size22.5 KiB
Distinct538
Distinct (%)50.7%
Missing1489
Missing (%)58.4%
Memory size20.1 KiB
2024-05-11T16:04:39.110971image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length61
Median length44
Mean length30.02262
Min length22

Characters and Unicode

Total characters31854
Distinct characters187
Distinct categories11 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique262 ?
Unique (%)24.7%

Sample

1st row서울특별시 성동구 왕십리로 222 (행당동)
2nd row서울특별시 성동구 왕십리로 222 (행당동)
3rd row서울특별시 성동구 동일로 143 (성수동2가, 성수1차 대우아파트)
4th row서울특별시 성동구 마장로 188 (홍익동)
5th row서울특별시 성동구 아차산로17길 33 (성수동2가)
ValueCountFrequency (%)
서울특별시 1061
18.4%
성동구 1061
18.4%
성수동2가 660
 
11.5%
성수동1가 143
 
2.5%
1층 107
 
1.9%
용답동 66
 
1.1%
아차산로5길 52
 
0.9%
연무장길 52
 
0.9%
자동차시장1길 51
 
0.9%
성수이로18길 49
 
0.9%
Other values (443) 2456
42.7%
2024-05-11T16:04:39.715560image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
4938
 
15.5%
2402
 
7.5%
2231
 
7.0%
1 1515
 
4.8%
2 1385
 
4.3%
1338
 
4.2%
1119
 
3.5%
( 1105
 
3.5%
) 1104
 
3.5%
1073
 
3.4%
Other values (177) 13644
42.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 18721
58.8%
Decimal Number 5211
 
16.4%
Space Separator 4938
 
15.5%
Open Punctuation 1105
 
3.5%
Close Punctuation 1104
 
3.5%
Other Punctuation 522
 
1.6%
Dash Punctuation 217
 
0.7%
Uppercase Letter 24
 
0.1%
Math Symbol 9
 
< 0.1%
Lowercase Letter 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2402
12.8%
2231
11.9%
1338
 
7.1%
1119
 
6.0%
1073
 
5.7%
1069
 
5.7%
1063
 
5.7%
1061
 
5.7%
1061
 
5.7%
1004
 
5.4%
Other values (152) 5300
28.3%
Decimal Number
ValueCountFrequency (%)
1 1515
29.1%
2 1385
26.6%
3 458
 
8.8%
4 366
 
7.0%
6 293
 
5.6%
5 279
 
5.4%
8 241
 
4.6%
0 233
 
4.5%
7 223
 
4.3%
9 218
 
4.2%
Uppercase Letter
ValueCountFrequency (%)
B 9
37.5%
A 3
 
12.5%
G 3
 
12.5%
F 3
 
12.5%
N 3
 
12.5%
E 3
 
12.5%
Other Punctuation
ValueCountFrequency (%)
, 518
99.2%
. 4
 
0.8%
Space Separator
ValueCountFrequency (%)
4938
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1105
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1104
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 217
100.0%
Math Symbol
ValueCountFrequency (%)
~ 9
100.0%
Lowercase Letter
ValueCountFrequency (%)
b 2
100.0%
Other Symbol
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 18721
58.8%
Common 13107
41.1%
Latin 26
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2402
12.8%
2231
11.9%
1338
 
7.1%
1119
 
6.0%
1073
 
5.7%
1069
 
5.7%
1063
 
5.7%
1061
 
5.7%
1061
 
5.7%
1004
 
5.4%
Other values (152) 5300
28.3%
Common
ValueCountFrequency (%)
4938
37.7%
1 1515
 
11.6%
2 1385
 
10.6%
( 1105
 
8.4%
) 1104
 
8.4%
, 518
 
4.0%
3 458
 
3.5%
4 366
 
2.8%
6 293
 
2.2%
5 279
 
2.1%
Other values (8) 1146
 
8.7%
Latin
ValueCountFrequency (%)
B 9
34.6%
A 3
 
11.5%
G 3
 
11.5%
F 3
 
11.5%
N 3
 
11.5%
E 3
 
11.5%
b 2
 
7.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 18721
58.8%
ASCII 13132
41.2%
CJK Compat 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
4938
37.6%
1 1515
 
11.5%
2 1385
 
10.5%
( 1105
 
8.4%
) 1104
 
8.4%
, 518
 
3.9%
3 458
 
3.5%
4 366
 
2.8%
6 293
 
2.2%
5 279
 
2.1%
Other values (14) 1171
 
8.9%
Hangul
ValueCountFrequency (%)
2402
12.8%
2231
11.9%
1338
 
7.1%
1119
 
6.0%
1073
 
5.7%
1069
 
5.7%
1063
 
5.7%
1061
 
5.7%
1061
 
5.7%
1004
 
5.4%
Other values (152) 5300
28.3%
CJK Compat
ValueCountFrequency (%)
1
100.0%

소재지주소
Text

MISSING 

Distinct681
Distinct (%)27.9%
Missing110
Missing (%)4.3%
Memory size20.1 KiB
2024-05-11T16:04:40.330614image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length55
Median length45
Mean length27.045492
Min length16

Characters and Unicode

Total characters65991
Distinct characters136
Distinct categories11 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique180 ?
Unique (%)7.4%

Sample

1st row서울특별시 성동구 행당동 17번지
2nd row서울특별시 성동구 행당동 17
3rd row서울특별시 성동구 성수동2가 279-50번지
4th row서울특별시 성동구 홍익동 410번지
5th row서울특별시 성동구 성수동2가 279-15번지
ValueCountFrequency (%)
성동구 2443
21.9%
서울특별시 2440
21.8%
성수동2가 1743
15.6%
성수동1가 432
 
3.9%
1층 369
 
3.3%
용답동 158
 
1.4%
2층 129
 
1.2%
2가3동 93
 
0.8%
일부 71
 
0.6%
지하1층 59
 
0.5%
Other values (680) 3240
29.0%
2024-05-11T16:04:40.886890image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
11102
16.8%
5157
 
7.8%
2 4926
 
7.5%
4627
 
7.0%
1 2911
 
4.4%
2448
 
3.7%
2443
 
3.7%
2443
 
3.7%
2440
 
3.7%
2440
 
3.7%
Other values (126) 25054
38.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 35768
54.2%
Decimal Number 15843
24.0%
Space Separator 11102
 
16.8%
Dash Punctuation 2424
 
3.7%
Open Punctuation 354
 
0.5%
Close Punctuation 348
 
0.5%
Other Punctuation 120
 
0.2%
Uppercase Letter 17
 
< 0.1%
Math Symbol 9
 
< 0.1%
Lowercase Letter 5
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
5157
14.4%
4627
12.9%
2448
 
6.8%
2443
 
6.8%
2443
 
6.8%
2440
 
6.8%
2440
 
6.8%
2440
 
6.8%
2372
 
6.6%
2300
 
6.4%
Other values (106) 6658
18.6%
Decimal Number
ValueCountFrequency (%)
2 4926
31.1%
1 2911
18.4%
7 1592
 
10.0%
3 1524
 
9.6%
6 938
 
5.9%
9 888
 
5.6%
5 872
 
5.5%
8 822
 
5.2%
4 750
 
4.7%
0 620
 
3.9%
Other Punctuation
ValueCountFrequency (%)
, 110
91.7%
. 10
 
8.3%
Space Separator
ValueCountFrequency (%)
11102
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 2424
100.0%
Open Punctuation
ValueCountFrequency (%)
( 354
100.0%
Close Punctuation
ValueCountFrequency (%)
) 348
100.0%
Uppercase Letter
ValueCountFrequency (%)
B 17
100.0%
Math Symbol
ValueCountFrequency (%)
~ 9
100.0%
Lowercase Letter
ValueCountFrequency (%)
b 5
100.0%
Other Symbol
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 35768
54.2%
Common 30201
45.8%
Latin 22
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
5157
14.4%
4627
12.9%
2448
 
6.8%
2443
 
6.8%
2443
 
6.8%
2440
 
6.8%
2440
 
6.8%
2440
 
6.8%
2372
 
6.6%
2300
 
6.4%
Other values (106) 6658
18.6%
Common
ValueCountFrequency (%)
11102
36.8%
2 4926
16.3%
1 2911
 
9.6%
- 2424
 
8.0%
7 1592
 
5.3%
3 1524
 
5.0%
6 938
 
3.1%
9 888
 
2.9%
5 872
 
2.9%
8 822
 
2.7%
Other values (8) 2202
 
7.3%
Latin
ValueCountFrequency (%)
B 17
77.3%
b 5
 
22.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 35768
54.2%
ASCII 30222
45.8%
CJK Compat 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
11102
36.7%
2 4926
16.3%
1 2911
 
9.6%
- 2424
 
8.0%
7 1592
 
5.3%
3 1524
 
5.0%
6 938
 
3.1%
9 888
 
2.9%
5 872
 
2.9%
8 822
 
2.7%
Other values (9) 2223
 
7.4%
Hangul
ValueCountFrequency (%)
5157
14.4%
4627
12.9%
2448
 
6.8%
2443
 
6.8%
2443
 
6.8%
2440
 
6.8%
2440
 
6.8%
2440
 
6.8%
2372
 
6.6%
2300
 
6.4%
Other values (106) 6658
18.6%
CJK Compat
ValueCountFrequency (%)
1
100.0%

Interactions

2024-05-11T16:04:33.893500image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-11T16:04:33.545664image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-11T16:04:34.020760image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-11T16:04:33.720414image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-05-11T16:04:40.987865image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
인허가번호업종코드업종명지도점검일자지도점검구분처분대상여부
인허가번호1.0000.9430.9430.0830.1210.000
업종코드0.9431.0001.0000.0880.0590.000
업종명0.9431.0001.0000.2540.0780.000
지도점검일자0.0830.0880.2541.0000.2480.000
지도점검구분0.1210.0590.0780.2481.0000.155
처분대상여부0.0000.0000.0000.0000.1551.000
2024-05-11T16:04:41.090467image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
지도점검구분처분대상여부업종명업종코드
지도점검구분1.0000.1030.0740.055
처분대상여부0.1031.0000.0000.000
업종명0.0740.0001.0001.000
업종코드0.0550.0001.0001.000
2024-05-11T16:04:41.182907image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
인허가번호지도점검일자업종코드업종명지도점검구분처분대상여부
인허가번호1.0000.0820.7070.7070.1140.000
지도점검일자0.0821.0000.0540.1660.1130.000
업종코드0.7070.0541.0001.0000.0550.000
업종명0.7070.1661.0001.0000.0740.000
지도점검구분0.1140.1130.0550.0741.0000.103
처분대상여부0.0000.0000.0000.0000.1031.000

Missing values

2024-05-11T16:04:34.213566image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-05-11T16:04:34.446303image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2024-05-11T16:04:34.621949image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

업체(시설)명인허가번호업종코드업종명지도점검일자점검기관점검기관명지도점검구분처분대상여부점검사항점검결과소재지도로명주소소재지주소
0학교법인 한양대학교30300002219790002222폐수배출업소관리201706293030000서울특별시 성동구정기N폐수배출시설 및 수질오염방지시설 적정운영 여부<NA>서울특별시 성동구 왕십리로 222 (행당동)서울특별시 성동구 행당동 17번지
1한양대학병원30300002219730001822폐수배출업소관리201706293030000서울특별시 성동구정기N폐수배출시설 및 수질오염방지시설 적정운영 여부<NA>서울특별시 성동구 왕십리로 222 (행당동)서울특별시 성동구 행당동 17
2(주)엘앤피에스30300002220160000122폐수배출업소관리201706143030000서울특별시 성동구정기N폐수배출시설 및 수질오염방지시설 적정운영 여부<NA>서울특별시 성동구 동일로 143 (성수동2가, 성수1차 대우아파트)서울특별시 성동구 성수동2가 279-50번지
3홍익점30300002220170000122폐수배출업소관리201706143030000서울특별시 성동구정기N폐수배출시설 및 수질오염방지시설 적정운영 여부<NA>서울특별시 성동구 마장로 188 (홍익동)서울특별시 성동구 홍익동 410번지
4(주)한미더블유비오30300002220000005322폐수배출업소관리201706093030000서울특별시 성동구정기N폐수배출시설 및 수질오염방지시설 적정운영 여부<NA>서울특별시 성동구 아차산로17길 33 (성수동2가)서울특별시 성동구 성수동2가 279-15번지
5하이텍스테크30300002219970006022폐수배출업소관리201706093030000서울특별시 성동구정기N폐수배출시설 및 수질오염방지시설 적정운영 여부<NA>서울특별시 성동구 성수이로26길 55, 3층 (성수동2가)서울특별시 성동구 성수동2가 279-30번지
6하이텍스테크30300002219970006022폐수배출업소관리201706093030000서울특별시 성동구정기N폐수배출시설 및 수질오염방지시설 적정운영 여부<NA>서울특별시 성동구 성수이로26길 55, 3층 (성수동2가)서울특별시 성동구 성수동2가 279-30번지
7이레제경사30300002220050003122폐수배출업소관리201706083030000서울특별시 성동구정기N폐수배출시설 및 수질오염방지시설 적정운영 여부<NA>서울특별시 성동구 아차산로9길 21 (성수동2가,(지하1층))서울특별시 성동구 성수동2가 277-175번지 (지하1층)
8세진테크30300002220130000122폐수배출업소관리201706023030000서울특별시 성동구정기N폐수배출시설 및 수질오염방지시설 적정운영 여부<NA>서울특별시 성동구 연무장길 28-10 (성수동2가)서울특별시 성동구 성수동2가 310-53번지
9(주)애드피앤씨30300002220110000922폐수배출업소관리201705303030000서울특별시 성동구정기N폐수배출시설 및 수질오염방지시설 적정운영 여부<NA>서울특별시 성동구 성수이로 144-29 (성수동2가, 지하 201호)서울특별시 성동구 성수동2가 277-7번지 지하 201호
업체(시설)명인허가번호업종코드업종명지도점검일자점검기관점검기관명지도점검구분처분대상여부점검사항점검결과소재지도로명주소소재지주소
2540주-삼표성수공장30300002120040003421대기배출업소관리201002053030000서울특별시 성동구정기N대기배출시설 및 방지시설 정상운영 여부<NA><NA>서울특별시 성동구 성수동1가 683번지
2541현대섬유30300002220080002522폐수배출업소관리201001293030000서울특별시 성동구정기N폐수배출시설 지도점검<NA><NA>서울특별시 성동구 성수동2가 273-52번지 주1(1층)
2542에이아이모터스(주)30300002120080000821대기배출업소관리201001253030000서울특별시 성동구기타N대기배출시설 및 방지시설 정상운영 여부<NA><NA>서울특별시 성동구 성수동2가 315-3번지
2543삼정자동차공업사30300002119910012221대기배출업소관리201001223030000서울특별시 성동구정기N대기배출시설 및 방지시설 정상가동 여부<NA><NA>서울특별시 성동구 용답동 238-1번지
2544(주)동진에벤에셀30300002119820009121대기배출업소관리201001133030000서울특별시 성동구정기N대기배출시설 및 방지시설 정상운영 여부<NA><NA>서울특별시 성동구 성수동2가 3동 273-12호
2545주-보배모터스30300002120090001021대기배출업소관리201001113030000서울특별시 성동구정기N대기배출시설 및 방지시설 정상운영 여부<NA><NA>서울특별시 성동구 성수동2가 273-12번지
2546M-Tech30300002120090002821대기배출업소관리201001113030000서울특별시 성동구정기N대기배출시설 및 방지시설 정상운영 여부<NA><NA>서울특별시 성동구 성수동2가 271-20번지
2547주화섬유30300002220020008622폐수배출업소관리201001083030000서울특별시 성동구정기N폐수배출시설 지도점검<NA><NA>서울특별시 성동구 성수동2가 269-6번지 1층 일부
2548(주)광성모터스30300002220090002022폐수배출업소관리201001083030000서울특별시 성동구정기N폐수배출시설 지도점검<NA><NA>서울특별시 성동구 성수동2가 280-14번지 1층
2549광훈실업30300002220000006522폐수배출업소관리201001083030000서울특별시 성동구정기N폐수배출시설 지도점검<NA><NA>서울특별시 성동구 성수동2가 269-20 (2가1동)

Duplicate rows

Most frequently occurring

업체(시설)명인허가번호업종코드업종명지도점검일자점검기관점검기관명지도점검구분처분대상여부점검사항소재지도로명주소소재지주소# duplicates
0성빈산업30300002220020064622폐수배출업소관리201204183030000서울특별시 성동구정기N폐수배출시설 및 수질오염방지시설 적정 운영여부<NA>서울특별시 성동구 성수동2가 322-3번지 1층 일부2
1신광도금30300002120120000321대기배출업소관리201501303030000서울특별시 성동구정기N배출시설 및 방지시설 적정 여부서울특별시 성동구 성수이로16길 31 (성수동2가)<NA>2
2조이실업30300002220050000422폐수배출업소관리201203223030000서울특별시 성동구정기N폐수배출시설 및 수질오염방지시설의 적정운영 여부<NA>서울특별시 성동구 성수동2가 278-44번지 (1층)2
3하이텍스테크30300002219970006022폐수배출업소관리201706093030000서울특별시 성동구정기N폐수배출시설 및 수질오염방지시설 적정운영 여부서울특별시 성동구 성수이로26길 55, 3층 (성수동2가)서울특별시 성동구 성수동2가 279-30번지2