Overview

Dataset statistics

Number of variables48
Number of observations101
Missing cells2185
Missing cells (%)45.1%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory40.9 KiB
Average record size in memory414.3 B

Variable types

Categorical12
Numeric12
DateTime4
Unsupported10
Text10

Dataset

Description개방자치단체코드,관리번호,인허가일자,인허가취소일자,영업상태코드,영업상태명,상세영업상태코드,상세영업상태명,폐업일자,휴업시작일자,휴업종료일자,재개업일자,전화번호,소재지면적,소재지우편번호,지번주소,도로명주소,도로명우편번호,사업장명,최종수정일자,데이터갱신구분,데이터갱신일자,업태구분명,좌표정보(X),좌표정보(Y),실험실면적,사업장구분명,영업소면적,위탁업체명,실험실지역코드,실험실우편번호,실험실산,실험실번지,실험실호,실험실통,실험실반,실험실특수주소,실험실특수주소동,실험실특수주소호,실험실도로명주소시군구코드,실험실도로명주소읍면동코드,실험실도로명주소읍면동구분,실험실도로명주소코드,실험실도로명특수주소,실험실도로명주소건물층구분,실험실도로명주소건물본번호,실험실도로명주소건물부번호,실험실도로명주소우편번호
Author강남구
URLhttps://data.seoul.go.kr/dataList/OA-19524/S/1/datasetView.do

Alerts

개방자치단체코드 has constant value ""Constant
실험실면적 is highly imbalanced (88.0%)Imbalance
실험실도로명주소건물부번호 is highly imbalanced (86.1%)Imbalance
인허가취소일자 has 101 (100.0%) missing valuesMissing
폐업일자 has 66 (65.3%) missing valuesMissing
휴업시작일자 has 101 (100.0%) missing valuesMissing
휴업종료일자 has 101 (100.0%) missing valuesMissing
재개업일자 has 101 (100.0%) missing valuesMissing
전화번호 has 5 (5.0%) missing valuesMissing
소재지면적 has 101 (100.0%) missing valuesMissing
소재지우편번호 has 99 (98.0%) missing valuesMissing
지번주소 has 3 (3.0%) missing valuesMissing
도로명주소 has 8 (7.9%) missing valuesMissing
도로명우편번호 has 57 (56.4%) missing valuesMissing
업태구분명 has 97 (96.0%) missing valuesMissing
좌표정보(X) has 4 (4.0%) missing valuesMissing
좌표정보(Y) has 4 (4.0%) missing valuesMissing
영업소면적 has 94 (93.1%) missing valuesMissing
위탁업체명 has 74 (73.3%) missing valuesMissing
실험실지역코드 has 63 (62.4%) missing valuesMissing
실험실우편번호 has 65 (64.4%) missing valuesMissing
실험실번지 has 65 (64.4%) missing valuesMissing
실험실호 has 73 (72.3%) missing valuesMissing
실험실통 has 101 (100.0%) missing valuesMissing
실험실반 has 101 (100.0%) missing valuesMissing
실험실특수주소 has 78 (77.2%) missing valuesMissing
실험실특수주소동 has 101 (100.0%) missing valuesMissing
실험실특수주소호 has 101 (100.0%) missing valuesMissing
실험실도로명주소시군구코드 has 64 (63.4%) missing valuesMissing
실험실도로명주소읍면동코드 has 64 (63.4%) missing valuesMissing
실험실도로명주소코드 has 64 (63.4%) missing valuesMissing
실험실도로명특수주소 has 64 (63.4%) missing valuesMissing
실험실도로명주소건물본번호 has 64 (63.4%) missing valuesMissing
실험실도로명주소우편번호 has 101 (100.0%) missing valuesMissing
관리번호 has unique valuesUnique
최종수정일자 has unique valuesUnique
인허가취소일자 is an unsupported type, check if it needs cleaning or further analysisUnsupported
휴업시작일자 is an unsupported type, check if it needs cleaning or further analysisUnsupported
휴업종료일자 is an unsupported type, check if it needs cleaning or further analysisUnsupported
재개업일자 is an unsupported type, check if it needs cleaning or further analysisUnsupported
소재지면적 is an unsupported type, check if it needs cleaning or further analysisUnsupported
실험실통 is an unsupported type, check if it needs cleaning or further analysisUnsupported
실험실반 is an unsupported type, check if it needs cleaning or further analysisUnsupported
실험실특수주소동 is an unsupported type, check if it needs cleaning or further analysisUnsupported
실험실특수주소호 is an unsupported type, check if it needs cleaning or further analysisUnsupported
실험실도로명주소우편번호 is an unsupported type, check if it needs cleaning or further analysisUnsupported

Reproduction

Analysis started2024-04-17 18:13:13.085000
Analysis finished2024-04-17 18:13:13.937340
Duration0.85 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

개방자치단체코드
Categorical

CONSTANT 

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size940.0 B
3220000
101 

Length

Max length7
Median length7
Mean length7
Min length7

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row3220000
2nd row3220000
3rd row3220000
4th row3220000
5th row3220000

Common Values

ValueCountFrequency (%)
3220000 101
100.0%

Length

2024-04-18T03:13:13.986437image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-18T03:13:14.053197image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
3220000 101
100.0%

관리번호
Real number (ℝ)

UNIQUE 

Distinct101
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3.2200007 × 1017
Minimum3.2200007 × 1017
Maximum3.2200007 × 1017
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2024-04-18T03:13:14.132658image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum3.2200007 × 1017
5-th percentile3.2200007 × 1017
Q13.2200007 × 1017
median3.2200007 × 1017
Q33.2200007 × 1017
95-th percentile3.2200007 × 1017
Maximum3.2200007 × 1017
Range4299977
Interquartile range (IQR)300032

Descriptive statistics

Standard deviation787298.75
Coefficient of variation (CV)2.4450267 × 10-12
Kurtosis6.6224448
Mean3.2200007 × 1017
Median Absolute Deviation (MAD)99584
Skewness-2.0086413
Sum-4.3714814 × 1018
Variance6.1983932 × 1011
MonotonicityStrictly increasing
2024-04-18T03:13:14.234743image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
322000067198100026 1
 
1.0%
322000067201200001 1
 
1.0%
322000067201300006 1
 
1.0%
322000067201300005 1
 
1.0%
322000067201300004 1
 
1.0%
322000067201300003 1
 
1.0%
322000067201300002 1
 
1.0%
322000067201300001 1
 
1.0%
322000067201200005 1
 
1.0%
322000067201200004 1
 
1.0%
Other values (91) 91
90.1%
ValueCountFrequency (%)
322000067198100026 1
1.0%
322000067198100032 1
1.0%
322000067198100040 1
1.0%
322000067198200001 1
1.0%
322000067199200288 1
1.0%
322000067199700463 1
1.0%
322000067200200210 1
1.0%
322000067200500772 1
1.0%
322000067200600986 1
1.0%
322000067200900001 1
1.0%
ValueCountFrequency (%)
322000067202400003 1
1.0%
322000067202400002 1
1.0%
322000067202400001 1
1.0%
322000067202300001 1
1.0%
322000067202200003 1
1.0%
322000067202200002 1
1.0%
322000067202200001 1
1.0%
322000067202100003 1
1.0%
322000067202100002 1
1.0%
322000067202100001 1
1.0%
Distinct72
Distinct (%)71.3%
Missing0
Missing (%)0.0%
Memory size940.0 B
Minimum1981-09-30 00:00:00
Maximum2024-02-26 00:00:00
2024-04-18T03:13:14.336761image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-18T03:13:14.442142image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

인허가취소일자
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing101
Missing (%)100.0%
Memory size1.0 KiB
Distinct4
Distinct (%)4.0%
Missing0
Missing (%)0.0%
Memory size940.0 B
3
50 
1
45 
5
 
4
4
 
2

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row3
2nd row3
3rd row3
4th row1
5th row1

Common Values

ValueCountFrequency (%)
3 50
49.5%
1 45
44.6%
5 4
 
4.0%
4 2
 
2.0%

Length

2024-04-18T03:13:14.536883image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-18T03:13:14.608605image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
3 50
49.5%
1 45
44.6%
5 4
 
4.0%
4 2
 
2.0%

영업상태명
Categorical

Distinct4
Distinct (%)4.0%
Missing0
Missing (%)0.0%
Memory size940.0 B
폐업
50 
영업/정상
45 
제외/삭제/전출
 
4
취소/말소/만료/정지/중지
 
2

Length

Max length14
Median length8
Mean length3.8118812
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row폐업
2nd row폐업
3rd row폐업
4th row영업/정상
5th row영업/정상

Common Values

ValueCountFrequency (%)
폐업 50
49.5%
영업/정상 45
44.6%
제외/삭제/전출 4
 
4.0%
취소/말소/만료/정지/중지 2
 
2.0%

Length

2024-04-18T03:13:14.689039image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-18T03:13:14.769493image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
폐업 50
49.5%
영업/정상 45
44.6%
제외/삭제/전출 4
 
4.0%
취소/말소/만료/정지/중지 2
 
2.0%
Distinct4
Distinct (%)4.0%
Missing0
Missing (%)0.0%
Memory size940.0 B
2
50 
BBBB
45 
5
 
4
4
 
2

Length

Max length4
Median length1
Mean length2.3366337
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2
2nd row2
3rd row2
4th rowBBBB
5th rowBBBB

Common Values

ValueCountFrequency (%)
2 50
49.5%
BBBB 45
44.6%
5 4
 
4.0%
4 2
 
2.0%

Length

2024-04-18T03:13:14.854046image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-18T03:13:14.948053image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2 50
49.5%
bbbb 45
44.6%
5 4
 
4.0%
4 2
 
2.0%
Distinct4
Distinct (%)4.0%
Missing0
Missing (%)0.0%
Memory size940.0 B
폐업
50 
영업
45 
제외사항
 
4
폐쇄
 
2

Length

Max length4
Median length2
Mean length2.0792079
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row폐업
2nd row폐업
3rd row폐업
4th row영업
5th row영업

Common Values

ValueCountFrequency (%)
폐업 50
49.5%
영업 45
44.6%
제외사항 4
 
4.0%
폐쇄 2
 
2.0%

Length

2024-04-18T03:13:15.041528image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-18T03:13:15.120920image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
폐업 50
49.5%
영업 45
44.6%
제외사항 4
 
4.0%
폐쇄 2
 
2.0%

폐업일자
Date

MISSING 

Distinct29
Distinct (%)82.9%
Missing66
Missing (%)65.3%
Memory size940.0 B
Minimum2010-05-26 00:00:00
Maximum2024-02-26 00:00:00
2024-04-18T03:13:15.204537image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-18T03:13:15.290195image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=29)

휴업시작일자
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing101
Missing (%)100.0%
Memory size1.0 KiB

휴업종료일자
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing101
Missing (%)100.0%
Memory size1.0 KiB

재개업일자
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing101
Missing (%)100.0%
Memory size1.0 KiB

전화번호
Text

MISSING 

Distinct84
Distinct (%)87.5%
Missing5
Missing (%)5.0%
Memory size940.0 B
2024-04-18T03:13:15.498341image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length11
Mean length8.78125
Min length7

Characters and Unicode

Total characters843
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique74 ?
Unique (%)77.1%

Sample

1st row20089812
2nd row20089812
3rd row20089812
4th row027402434
5th row62020901
ValueCountFrequency (%)
02-6960-6114 3
 
3.1%
20089812 3
 
3.1%
0522699825 2
 
2.1%
5105691 2
 
2.1%
62020901 2
 
2.1%
02-6420-3821 2
 
2.1%
6202-0114 2
 
2.1%
22260003 2
 
2.1%
02-2008-0557 2
 
2.1%
34482114 2
 
2.1%
Other values (74) 74
77.1%
2024-04-18T03:13:15.807966image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 134
15.9%
1 109
12.9%
2 103
12.2%
5 89
10.6%
3 72
8.5%
6 64
7.6%
4 63
7.5%
- 58
6.9%
8 54
6.4%
9 50
 
5.9%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 785
93.1%
Dash Punctuation 58
 
6.9%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 134
17.1%
1 109
13.9%
2 103
13.1%
5 89
11.3%
3 72
9.2%
6 64
8.2%
4 63
8.0%
8 54
6.9%
9 50
 
6.4%
7 47
 
6.0%
Dash Punctuation
ValueCountFrequency (%)
- 58
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 843
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 134
15.9%
1 109
12.9%
2 103
12.2%
5 89
10.6%
3 72
8.5%
6 64
7.6%
4 63
7.5%
- 58
6.9%
8 54
6.4%
9 50
 
5.9%

Most occurring blocks

ValueCountFrequency (%)
ASCII 843
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 134
15.9%
1 109
12.9%
2 103
12.2%
5 89
10.6%
3 72
8.5%
6 64
7.6%
4 63
7.5%
- 58
6.9%
8 54
6.4%
9 50
 
5.9%

소재지면적
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing101
Missing (%)100.0%
Memory size1.0 KiB

소재지우편번호
Text

MISSING 

Distinct2
Distinct (%)100.0%
Missing99
Missing (%)98.0%
Memory size940.0 B
2024-04-18T03:13:15.915092image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length6
Median length6
Mean length6
Min length6

Characters and Unicode

Total characters12
Distinct characters9
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2 ?
Unique (%)100.0%

Sample

1st row135889
2nd rownull
ValueCountFrequency (%)
135889 1
50.0%
null 1
50.0%
2024-04-18T03:13:16.090947image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
8 2
16.7%
l 2
16.7%
2
16.7%
1 1
8.3%
3 1
8.3%
5 1
8.3%
9 1
8.3%
n 1
8.3%
u 1
8.3%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 6
50.0%
Lowercase Letter 4
33.3%
Space Separator 2
 
16.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
8 2
33.3%
1 1
16.7%
3 1
16.7%
5 1
16.7%
9 1
16.7%
Lowercase Letter
ValueCountFrequency (%)
l 2
50.0%
n 1
25.0%
u 1
25.0%
Space Separator
ValueCountFrequency (%)
2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 8
66.7%
Latin 4
33.3%

Most frequent character per script

Common
ValueCountFrequency (%)
8 2
25.0%
2
25.0%
1 1
12.5%
3 1
12.5%
5 1
12.5%
9 1
12.5%
Latin
ValueCountFrequency (%)
l 2
50.0%
n 1
25.0%
u 1
25.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 12
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
8 2
16.7%
l 2
16.7%
2
16.7%
1 1
8.3%
3 1
8.3%
5 1
8.3%
9 1
8.3%
n 1
8.3%
u 1
8.3%

지번주소
Text

MISSING 

Distinct81
Distinct (%)82.7%
Missing3
Missing (%)3.0%
Memory size940.0 B
2024-04-18T03:13:16.323956image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length43
Median length32
Mean length24.357143
Min length14

Characters and Unicode

Total characters2387
Distinct characters132
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique70 ?
Unique (%)71.4%

Sample

1st row서울특별시 강남구 삼성동 160
2nd row서울특별시 강남구 삼성동 160
3rd row서울특별시 강남구 삼성동 160
4th row서울특별시 강남구 역삼동 662-7
5th row서울특별시 강남구 역삼동 832-40
ValueCountFrequency (%)
서울특별시 97
20.0%
강남구 97
20.0%
역삼동 33
 
6.8%
대치동 12
 
2.5%
삼성동 12
 
2.5%
논현동 11
 
2.3%
수서동 8
 
1.6%
도곡동 6
 
1.2%
5층 5
 
1.0%
신사동 5
 
1.0%
Other values (148) 199
41.0%
2024-04-18T03:13:16.671177image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
454
19.0%
107
 
4.5%
102
 
4.3%
99
 
4.1%
99
 
4.1%
98
 
4.1%
98
 
4.1%
97
 
4.1%
97
 
4.1%
97
 
4.1%
Other values (122) 1039
43.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1405
58.9%
Space Separator 454
 
19.0%
Decimal Number 448
 
18.8%
Dash Punctuation 73
 
3.1%
Other Punctuation 4
 
0.2%
Open Punctuation 1
 
< 0.1%
Uppercase Letter 1
 
< 0.1%
Close Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
107
 
7.6%
102
 
7.3%
99
 
7.0%
99
 
7.0%
98
 
7.0%
98
 
7.0%
97
 
6.9%
97
 
6.9%
97
 
6.9%
49
 
3.5%
Other values (106) 462
32.9%
Decimal Number
ValueCountFrequency (%)
1 74
16.5%
2 63
14.1%
3 55
12.3%
4 51
11.4%
6 42
9.4%
8 38
8.5%
0 36
8.0%
7 34
7.6%
5 33
7.4%
9 22
 
4.9%
Space Separator
ValueCountFrequency (%)
454
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 73
100.0%
Other Punctuation
ValueCountFrequency (%)
, 4
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1
100.0%
Uppercase Letter
ValueCountFrequency (%)
A 1
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1405
58.9%
Common 981
41.1%
Latin 1
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
107
 
7.6%
102
 
7.3%
99
 
7.0%
99
 
7.0%
98
 
7.0%
98
 
7.0%
97
 
6.9%
97
 
6.9%
97
 
6.9%
49
 
3.5%
Other values (106) 462
32.9%
Common
ValueCountFrequency (%)
454
46.3%
1 74
 
7.5%
- 73
 
7.4%
2 63
 
6.4%
3 55
 
5.6%
4 51
 
5.2%
6 42
 
4.3%
8 38
 
3.9%
0 36
 
3.7%
7 34
 
3.5%
Other values (5) 61
 
6.2%
Latin
ValueCountFrequency (%)
A 1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1405
58.9%
ASCII 982
41.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
454
46.2%
1 74
 
7.5%
- 73
 
7.4%
2 63
 
6.4%
3 55
 
5.6%
4 51
 
5.2%
6 42
 
4.3%
8 38
 
3.9%
0 36
 
3.7%
7 34
 
3.5%
Other values (6) 62
 
6.3%
Hangul
ValueCountFrequency (%)
107
 
7.6%
102
 
7.3%
99
 
7.0%
99
 
7.0%
98
 
7.0%
98
 
7.0%
97
 
6.9%
97
 
6.9%
97
 
6.9%
49
 
3.5%
Other values (106) 462
32.9%

도로명주소
Text

MISSING 

Distinct82
Distinct (%)88.2%
Missing8
Missing (%)7.9%
Memory size940.0 B
2024-04-18T03:13:16.896318image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length46
Median length38
Mean length31.88172
Min length23

Characters and Unicode

Total characters2965
Distinct characters149
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique74 ?
Unique (%)79.6%

Sample

1st row서울특별시 강남구 영동대로 520 (삼성동)
2nd row서울특별시 강남구 영동대로 520 (삼성동)
3rd row서울특별시 강남구 영동대로 520 (삼성동)
4th row서울특별시 강남구 언주로 547, 13층 (역삼동)
5th row서울특별시 강남구 도산대로 139 (신사동)
ValueCountFrequency (%)
서울특별시 92
 
16.5%
강남구 92
 
16.5%
역삼동 19
 
3.4%
삼성동 10
 
1.8%
대치동 9
 
1.6%
논현동 9
 
1.6%
언주로 8
 
1.4%
영동대로 8
 
1.4%
5층 7
 
1.3%
테헤란로 7
 
1.3%
Other values (203) 297
53.2%
2024-04-18T03:13:17.218751image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
496
 
16.7%
109
 
3.7%
101
 
3.4%
100
 
3.4%
98
 
3.3%
97
 
3.3%
) 94
 
3.2%
( 94
 
3.2%
94
 
3.2%
93
 
3.1%
Other values (139) 1589
53.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1754
59.2%
Space Separator 496
 
16.7%
Decimal Number 435
 
14.7%
Close Punctuation 94
 
3.2%
Open Punctuation 94
 
3.2%
Other Punctuation 83
 
2.8%
Dash Punctuation 7
 
0.2%
Uppercase Letter 2
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
109
 
6.2%
101
 
5.8%
100
 
5.7%
98
 
5.6%
97
 
5.5%
94
 
5.4%
93
 
5.3%
93
 
5.3%
92
 
5.2%
92
 
5.2%
Other values (122) 785
44.8%
Decimal Number
ValueCountFrequency (%)
1 91
20.9%
2 62
14.3%
3 59
13.6%
0 52
12.0%
5 45
10.3%
4 38
8.7%
6 33
 
7.6%
7 27
 
6.2%
8 19
 
4.4%
9 9
 
2.1%
Uppercase Letter
ValueCountFrequency (%)
A 1
50.0%
B 1
50.0%
Space Separator
ValueCountFrequency (%)
496
100.0%
Close Punctuation
ValueCountFrequency (%)
) 94
100.0%
Open Punctuation
ValueCountFrequency (%)
( 94
100.0%
Other Punctuation
ValueCountFrequency (%)
, 83
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 7
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1754
59.2%
Common 1209
40.8%
Latin 2
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
109
 
6.2%
101
 
5.8%
100
 
5.7%
98
 
5.6%
97
 
5.5%
94
 
5.4%
93
 
5.3%
93
 
5.3%
92
 
5.2%
92
 
5.2%
Other values (122) 785
44.8%
Common
ValueCountFrequency (%)
496
41.0%
) 94
 
7.8%
( 94
 
7.8%
1 91
 
7.5%
, 83
 
6.9%
2 62
 
5.1%
3 59
 
4.9%
0 52
 
4.3%
5 45
 
3.7%
4 38
 
3.1%
Other values (5) 95
 
7.9%
Latin
ValueCountFrequency (%)
A 1
50.0%
B 1
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1754
59.2%
ASCII 1211
40.8%

Most frequent character per block

ASCII
ValueCountFrequency (%)
496
41.0%
) 94
 
7.8%
( 94
 
7.8%
1 91
 
7.5%
, 83
 
6.9%
2 62
 
5.1%
3 59
 
4.9%
0 52
 
4.3%
5 45
 
3.7%
4 38
 
3.1%
Other values (7) 97
 
8.0%
Hangul
ValueCountFrequency (%)
109
 
6.2%
101
 
5.8%
100
 
5.7%
98
 
5.6%
97
 
5.5%
94
 
5.4%
93
 
5.3%
93
 
5.3%
92
 
5.2%
92
 
5.2%
Other values (122) 785
44.8%

도로명우편번호
Text

MISSING 

Distinct33
Distinct (%)75.0%
Missing57
Missing (%)56.4%
Memory size940.0 B
2024-04-18T03:13:17.393822image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length7
Median length5
Mean length5.25
Min length5

Characters and Unicode

Total characters231
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique24 ?
Unique (%)54.5%

Sample

1st row06138
2nd row06112
3rd row06057
4th row06012
5th row06097
ValueCountFrequency (%)
06188 3
 
6.8%
06138 3
 
6.8%
135912 2
 
4.5%
06112 2
 
4.5%
06057 2
 
4.5%
06129 2
 
4.5%
06044 2
 
4.5%
06252 2
 
4.5%
06132 2
 
4.5%
135081 1
 
2.3%
Other values (23) 23
52.3%
2024-04-18T03:13:17.652789image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 43
18.6%
1 39
16.9%
6 35
15.2%
5 25
10.8%
3 24
10.4%
8 18
7.8%
2 18
7.8%
9 10
 
4.3%
7 10
 
4.3%
4 8
 
3.5%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 230
99.6%
Dash Punctuation 1
 
0.4%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 43
18.7%
1 39
17.0%
6 35
15.2%
5 25
10.9%
3 24
10.4%
8 18
7.8%
2 18
7.8%
9 10
 
4.3%
7 10
 
4.3%
4 8
 
3.5%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 231
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 43
18.6%
1 39
16.9%
6 35
15.2%
5 25
10.8%
3 24
10.4%
8 18
7.8%
2 18
7.8%
9 10
 
4.3%
7 10
 
4.3%
4 8
 
3.5%

Most occurring blocks

ValueCountFrequency (%)
ASCII 231
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 43
18.6%
1 39
16.9%
6 35
15.2%
5 25
10.8%
3 24
10.4%
8 18
7.8%
2 18
7.8%
9 10
 
4.3%
7 10
 
4.3%
4 8
 
3.5%
Distinct86
Distinct (%)85.1%
Missing0
Missing (%)0.0%
Memory size940.0 B
2024-04-18T03:13:17.846720image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length24
Median length16
Mean length8.6237624
Min length5

Characters and Unicode

Total characters871
Distinct characters163
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique73 ?
Unique (%)72.3%

Sample

1st row현대산업개발(주)[소음진동]
2nd row현대산업개발(주)[수질]
3rd row현대산업개발(주)[대기]
4th row삼환기업(주)
5th row(주)유신
ValueCountFrequency (%)
주)유신 4
 
3.8%
세아stx엔테크 3
 
2.9%
한일건설(주 2
 
1.9%
주)고도엔바 2
 
1.9%
주)에스코알티에스 2
 
1.9%
두산건설(주 2
 
1.9%
도요엔지니어링코리아(주 2
 
1.9%
주)이엔쓰리환경 2
 
1.9%
주)에팩 2
 
1.9%
주)이테크건설[대기 2
 
1.9%
Other values (79) 82
78.1%
2024-04-18T03:13:18.154010image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
90
 
10.3%
) 89
 
10.2%
( 89
 
10.2%
33
 
3.8%
28
 
3.2%
19
 
2.2%
18
 
2.1%
17
 
2.0%
16
 
1.8%
15
 
1.7%
Other values (153) 457
52.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 657
75.4%
Close Punctuation 95
 
10.9%
Open Punctuation 95
 
10.9%
Uppercase Letter 15
 
1.7%
Space Separator 4
 
0.5%
Decimal Number 3
 
0.3%
Dash Punctuation 1
 
0.1%
Other Symbol 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
90
 
13.7%
33
 
5.0%
28
 
4.3%
19
 
2.9%
18
 
2.7%
17
 
2.6%
16
 
2.4%
15
 
2.3%
15
 
2.3%
12
 
1.8%
Other values (135) 394
60.0%
Uppercase Letter
ValueCountFrequency (%)
T 3
20.0%
S 3
20.0%
X 3
20.0%
G 2
13.3%
E 1
 
6.7%
I 1
 
6.7%
L 1
 
6.7%
N 1
 
6.7%
Decimal Number
ValueCountFrequency (%)
1 1
33.3%
9 1
33.3%
2 1
33.3%
Close Punctuation
ValueCountFrequency (%)
) 89
93.7%
] 6
 
6.3%
Open Punctuation
ValueCountFrequency (%)
( 89
93.7%
[ 6
 
6.3%
Space Separator
ValueCountFrequency (%)
4
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%
Other Symbol
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 658
75.5%
Common 198
 
22.7%
Latin 15
 
1.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
90
 
13.7%
33
 
5.0%
28
 
4.3%
19
 
2.9%
18
 
2.7%
17
 
2.6%
16
 
2.4%
15
 
2.3%
15
 
2.3%
12
 
1.8%
Other values (136) 395
60.0%
Common
ValueCountFrequency (%)
) 89
44.9%
( 89
44.9%
[ 6
 
3.0%
] 6
 
3.0%
4
 
2.0%
- 1
 
0.5%
1 1
 
0.5%
9 1
 
0.5%
2 1
 
0.5%
Latin
ValueCountFrequency (%)
T 3
20.0%
S 3
20.0%
X 3
20.0%
G 2
13.3%
E 1
 
6.7%
I 1
 
6.7%
L 1
 
6.7%
N 1
 
6.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 657
75.4%
ASCII 213
 
24.5%
None 1
 
0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
90
 
13.7%
33
 
5.0%
28
 
4.3%
19
 
2.9%
18
 
2.7%
17
 
2.6%
16
 
2.4%
15
 
2.3%
15
 
2.3%
12
 
1.8%
Other values (135) 394
60.0%
ASCII
ValueCountFrequency (%)
) 89
41.8%
( 89
41.8%
[ 6
 
2.8%
] 6
 
2.8%
4
 
1.9%
T 3
 
1.4%
S 3
 
1.4%
X 3
 
1.4%
G 2
 
0.9%
E 1
 
0.5%
Other values (7) 7
 
3.3%
None
ValueCountFrequency (%)
1
100.0%

최종수정일자
Date

UNIQUE 

Distinct101
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size940.0 B
Minimum2010-04-26 15:36:59
Maximum2024-02-26 15:47:22
2024-04-18T03:13:18.261432image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-18T03:13:18.358151image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
Distinct2
Distinct (%)2.0%
Missing0
Missing (%)0.0%
Memory size940.0 B
I
67 
U
34 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowI
2nd rowI
3rd rowI
4th rowU
5th rowI

Common Values

ValueCountFrequency (%)
I 67
66.3%
U 34
33.7%

Length

2024-04-18T03:13:18.447556image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-18T03:13:18.515560image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
i 67
66.3%
u 34
33.7%
Distinct34
Distinct (%)33.7%
Missing0
Missing (%)0.0%
Memory size940.0 B
Minimum2019-03-30 02:20:09
Maximum2023-12-02 00:04:00
2024-04-18T03:13:18.583805image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-18T03:13:18.674788image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=34)

업태구분명
Text

MISSING 

Distinct4
Distinct (%)100.0%
Missing97
Missing (%)96.0%
Memory size940.0 B
2024-04-18T03:13:18.796379image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length15
Median length5
Mean length6.75
Min length2

Characters and Unicode

Total characters27
Distinct characters20
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique4 ?
Unique (%)100.0%

Sample

1st row부동산업
2nd row시설물 축조 관련 전문공사업
3rd row종합 건설업
4th row농업
ValueCountFrequency (%)
부동산업 1
12.5%
시설물 1
12.5%
축조 1
12.5%
관련 1
12.5%
전문공사업 1
12.5%
종합 1
12.5%
건설업 1
12.5%
농업 1
12.5%
2024-04-18T03:13:19.004128image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
4
 
14.8%
4
 
14.8%
2
 
7.4%
1
 
3.7%
1
 
3.7%
1
 
3.7%
1
 
3.7%
1
 
3.7%
1
 
3.7%
1
 
3.7%
Other values (10) 10
37.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 23
85.2%
Space Separator 4
 
14.8%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
4
17.4%
2
 
8.7%
1
 
4.3%
1
 
4.3%
1
 
4.3%
1
 
4.3%
1
 
4.3%
1
 
4.3%
1
 
4.3%
1
 
4.3%
Other values (9) 9
39.1%
Space Separator
ValueCountFrequency (%)
4
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 23
85.2%
Common 4
 
14.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
4
17.4%
2
 
8.7%
1
 
4.3%
1
 
4.3%
1
 
4.3%
1
 
4.3%
1
 
4.3%
1
 
4.3%
1
 
4.3%
1
 
4.3%
Other values (9) 9
39.1%
Common
ValueCountFrequency (%)
4
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 23
85.2%
ASCII 4
 
14.8%

Most frequent character per block

Hangul
ValueCountFrequency (%)
4
17.4%
2
 
8.7%
1
 
4.3%
1
 
4.3%
1
 
4.3%
1
 
4.3%
1
 
4.3%
1
 
4.3%
1
 
4.3%
1
 
4.3%
Other values (9) 9
39.1%
ASCII
ValueCountFrequency (%)
4
100.0%

좌표정보(X)
Real number (ℝ)

MISSING 

Distinct70
Distinct (%)72.2%
Missing4
Missing (%)4.0%
Infinite0
Infinite (%)0.0%
Mean206404
Minimum201830.49
Maximum415265.49
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2024-04-18T03:13:19.106646image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum201830.49
5-th percentile202239.07
Q1202818.08
median203611.97
Q3205106.58
95-th percentile208960.62
Maximum415265.49
Range213435
Interquartile range (IQR)2288.5033

Descriptive statistics

Standard deviation21506.975
Coefficient of variation (CV)0.10419844
Kurtosis95.530374
Mean206404
Median Absolute Deviation (MAD)1007.0099
Skewness9.7386786
Sum20021188
Variance4.6254995 × 108
MonotonicityNot monotonic
2024-04-18T03:13:19.206031image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
205330.313614472 4
 
4.0%
202741.379034107 4
 
4.0%
202025.925476246 3
 
3.0%
208937.760652081 3
 
3.0%
205677.522476464 3
 
3.0%
203468.913964504 3
 
3.0%
201830.491067264 2
 
2.0%
203591.134030174 2
 
2.0%
202628.580226205 2
 
2.0%
202292.352706149 2
 
2.0%
Other values (60) 69
68.3%
(Missing) 4
 
4.0%
ValueCountFrequency (%)
201830.491067264 2
2.0%
202025.925476246 3
3.0%
202292.352706149 2
2.0%
202399.87424403 1
 
1.0%
202491.418046122 1
 
1.0%
202499.846384414 2
2.0%
202545.777216075 1
 
1.0%
202606.298810258 1
 
1.0%
202628.580226205 2
2.0%
202641.61 2
2.0%
ValueCountFrequency (%)
415265.487045081 1
 
1.0%
209166.05037089 1
 
1.0%
209133.785598961 1
 
1.0%
209052.072465426 2
2.0%
208937.760652081 3
3.0%
207435.42749441 1
 
1.0%
207254.144850327 1
 
1.0%
206970.919291372 2
2.0%
205711.214362101 1
 
1.0%
205707.361024519 1
 
1.0%

좌표정보(Y)
Real number (ℝ)

MISSING 

Distinct70
Distinct (%)72.2%
Missing4
Missing (%)4.0%
Infinite0
Infinite (%)0.0%
Mean442187.46
Minimum226387.31
Maximum447263.61
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2024-04-18T03:13:19.306851image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum226387.31
5-th percentile442672.9
Q1443413.45
median444525.36
Q3445497.96
95-th percentile446316.51
Maximum447263.61
Range220876.3
Interquartile range (IQR)2084.51

Descriptive statistics

Standard deviation22174.544
Coefficient of variation (CV)0.050147383
Kurtosis96.367663
Mean442187.46
Median Absolute Deviation (MAD)1095.7874
Skewness-9.8011769
Sum42892184
Variance4.917104 × 108
MonotonicityNot monotonic
2024-04-18T03:13:19.407485image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
445725.236017407 4
 
4.0%
443413.449624806 4
 
4.0%
446222.990609811 3
 
3.0%
442873.588039887 3
 
3.0%
444555.644453973 3
 
3.0%
444937.661021993 3
 
3.0%
445497.959632683 2
 
2.0%
445726.743185756 2
 
2.0%
444157.369413239 2
 
2.0%
444285.071455487 2
 
2.0%
Other values (60) 69
68.3%
(Missing) 4
 
4.0%
ValueCountFrequency (%)
226387.310870846 1
 
1.0%
441499.805 1
 
1.0%
441525.151880808 1
 
1.0%
441599.660870096 1
 
1.0%
442550.169073112 1
 
1.0%
442703.584823804 1
 
1.0%
442797.138150563 2
2.0%
442797.361773867 1
 
1.0%
442873.588039887 3
3.0%
442941.362378204 2
2.0%
ValueCountFrequency (%)
447263.606582656 1
 
1.0%
446955.050194808 1
 
1.0%
446683.837587169 1
 
1.0%
446479.893605702 1
 
1.0%
446316.514036122 2
2.0%
446222.990609811 3
3.0%
445937.744254557 1
 
1.0%
445916.661258728 1
 
1.0%
445726.743185756 2
2.0%
445725.236017407 4
4.0%

실험실면적
Categorical

IMBALANCE 

Distinct4
Distinct (%)4.0%
Missing0
Missing (%)0.0%
Memory size940.0 B
<NA>
98 
1
 
1
6313
 
1
0
 
1

Length

Max length4
Median length4
Mean length3.9405941
Min length1

Unique

Unique3 ?
Unique (%)3.0%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 98
97.0%
1 1
 
1.0%
6313 1
 
1.0%
0 1
 
1.0%

Length

2024-04-18T03:13:19.499586image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-18T03:13:19.795066image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 98
97.0%
1 1
 
1.0%
6313 1
 
1.0%
0 1
 
1.0%
Distinct2
Distinct (%)2.0%
Missing0
Missing (%)0.0%
Memory size940.0 B
환경전문공사업
71 
<NA>
30 

Length

Max length7
Median length7
Mean length6.1089109
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row환경전문공사업
2nd row환경전문공사업
3rd row환경전문공사업
4th row<NA>
5th row환경전문공사업

Common Values

ValueCountFrequency (%)
환경전문공사업 71
70.3%
<NA> 30
29.7%

Length

2024-04-18T03:13:19.891334image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-18T03:13:19.969572image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
환경전문공사업 71
70.3%
na 30
29.7%

영업소면적
Real number (ℝ)

MISSING 

Distinct7
Distinct (%)100.0%
Missing94
Missing (%)93.1%
Infinite0
Infinite (%)0.0%
Mean4209.4286
Minimum0
Maximum27993
Zeros1
Zeros (%)1.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2024-04-18T03:13:20.029583image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0.3
Q119.5
median163
Q3635.5
95-th percentile19862.7
Maximum27993
Range27993
Interquartile range (IQR)616

Descriptive statistics

Standard deviation10492.381
Coefficient of variation (CV)2.4925903
Kurtosis6.9793326
Mean4209.4286
Median Absolute Deviation (MAD)163
Skewness2.6407052
Sum29466
Variance1.1009006 × 108
MonotonicityNot monotonic
2024-04-18T03:13:20.102162image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=7)
ValueCountFrequency (%)
1 1
 
1.0%
27993 1
 
1.0%
38 1
 
1.0%
379 1
 
1.0%
892 1
 
1.0%
0 1
 
1.0%
163 1
 
1.0%
(Missing) 94
93.1%
ValueCountFrequency (%)
0 1
1.0%
1 1
1.0%
38 1
1.0%
163 1
1.0%
379 1
1.0%
892 1
1.0%
27993 1
1.0%
ValueCountFrequency (%)
27993 1
1.0%
892 1
1.0%
379 1
1.0%
163 1
1.0%
38 1
1.0%
1 1
1.0%
0 1
1.0%

위탁업체명
Text

MISSING 

Distinct19
Distinct (%)70.4%
Missing74
Missing (%)73.3%
Memory size940.0 B
2024-04-18T03:13:20.228889image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length23
Median length14
Mean length8.6296296
Min length4

Characters and Unicode

Total characters233
Distinct characters64
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique13 ?
Unique (%)48.1%

Sample

1st row용산구로 이전
2nd row용산구로 이전
3rd row강동구로 이전
4th row강동구로 이전
5th row타구(영등포) 소재지 이전
ValueCountFrequency (%)
이전 7
18.9%
강동구로 3
 
8.1%
용산구로 3
 
8.1%
주)청룡환경 2
 
5.4%
주)혜성환경 2
 
5.4%
청룡환경 2
 
5.4%
유앤아이환경기술(주 2
 
5.4%
영등포 1
 
2.7%
주)산업공해연구소 1
 
2.7%
주)청명기연환경 1
 
2.7%
Other values (13) 13
35.1%
2024-04-18T03:13:20.490569image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
( 18
 
7.7%
) 18
 
7.7%
16
 
6.9%
16
 
6.9%
16
 
6.9%
11
 
4.7%
10
 
4.3%
10
 
4.3%
7
 
3.0%
7
 
3.0%
Other values (54) 104
44.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 182
78.1%
Open Punctuation 18
 
7.7%
Close Punctuation 18
 
7.7%
Space Separator 10
 
4.3%
Uppercase Letter 4
 
1.7%
Decimal Number 1
 
0.4%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
16
 
8.8%
16
 
8.8%
16
 
8.8%
11
 
6.0%
10
 
5.5%
7
 
3.8%
7
 
3.8%
6
 
3.3%
5
 
2.7%
4
 
2.2%
Other values (47) 84
46.2%
Uppercase Letter
ValueCountFrequency (%)
I 2
50.0%
T 1
25.0%
F 1
25.0%
Open Punctuation
ValueCountFrequency (%)
( 18
100.0%
Close Punctuation
ValueCountFrequency (%)
) 18
100.0%
Space Separator
ValueCountFrequency (%)
10
100.0%
Decimal Number
ValueCountFrequency (%)
2 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 182
78.1%
Common 47
 
20.2%
Latin 4
 
1.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
16
 
8.8%
16
 
8.8%
16
 
8.8%
11
 
6.0%
10
 
5.5%
7
 
3.8%
7
 
3.8%
6
 
3.3%
5
 
2.7%
4
 
2.2%
Other values (47) 84
46.2%
Common
ValueCountFrequency (%)
( 18
38.3%
) 18
38.3%
10
21.3%
2 1
 
2.1%
Latin
ValueCountFrequency (%)
I 2
50.0%
T 1
25.0%
F 1
25.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 182
78.1%
ASCII 51
 
21.9%

Most frequent character per block

ASCII
ValueCountFrequency (%)
( 18
35.3%
) 18
35.3%
10
19.6%
I 2
 
3.9%
T 1
 
2.0%
2 1
 
2.0%
F 1
 
2.0%
Hangul
ValueCountFrequency (%)
16
 
8.8%
16
 
8.8%
16
 
8.8%
11
 
6.0%
10
 
5.5%
7
 
3.8%
7
 
3.8%
6
 
3.3%
5
 
2.7%
4
 
2.2%
Other values (47) 84
46.2%

실험실지역코드
Real number (ℝ)

MISSING 

Distinct18
Distinct (%)47.4%
Missing63
Missing (%)62.4%
Infinite0
Infinite (%)0.0%
Mean1.6718026 × 109
Minimum1.1230105 × 109
Maximum4.1273101 × 109
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2024-04-18T03:13:20.582363image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1.1230105 × 109
5-th percentile1.1485102 × 109
Q11.1548858 × 109
median1.1680102 × 109
Q31.1710105 × 109
95-th percentile4.1210101 × 109
Maximum4.1273101 × 109
Range3.0042996 × 109
Interquartile range (IQR)16124750

Descriptive statistics

Standard deviation1.1063863 × 109
Coefficient of variation (CV)0.66179245
Kurtosis1.3773159
Mean1.6718026 × 109
Median Absolute Deviation (MAD)3000300
Skewness1.7941078
Sum6.3528499 × 1010
Variance1.2240907 × 1018
MonotonicityNot monotonic
2024-04-18T03:13:20.669018image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=18)
ValueCountFrequency (%)
1168010100 6
 
5.9%
1154510100 5
 
5.0%
1171010500 5
 
5.0%
1153010200 3
 
3.0%
1168010500 3
 
3.0%
1168010300 2
 
2.0%
1123010500 2
 
2.0%
1165010800 2
 
2.0%
1168011500 1
 
1.0%
4121010200 1
 
1.0%
Other values (8) 8
 
7.9%
(Missing) 63
62.4%
ValueCountFrequency (%)
1123010500 2
 
2.0%
1153010200 3
3.0%
1154510100 5
5.0%
1156012700 1
 
1.0%
1165010800 2
 
2.0%
1168010100 6
5.9%
1168010300 2
 
2.0%
1168010500 3
3.0%
1168011400 1
 
1.0%
1168011500 1
 
1.0%
ValueCountFrequency (%)
4127310100 1
 
1.0%
4121010200 1
 
1.0%
4121010100 1
 
1.0%
4113310500 1
 
1.0%
4111710200 1
 
1.0%
4111312800 1
 
1.0%
2820010300 1
 
1.0%
1171010500 5
5.0%
1168011500 1
 
1.0%
1168011400 1
 
1.0%

실험실우편번호
Real number (ℝ)

MISSING 

Distinct22
Distinct (%)61.1%
Missing65
Missing (%)64.4%
Infinite0
Infinite (%)0.0%
Mean188971.69
Minimum130805
Maximum462807
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2024-04-18T03:13:20.753132image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum130805
5-th percentile134011.25
Q1135881
median138190
Q3153782
95-th percentile429720.75
Maximum462807
Range332002
Interquartile range (IQR)17901

Descriptive statistics

Standard deviation110104.3
Coefficient of variation (CV)0.58264967
Kurtosis1.6553039
Mean188971.69
Median Absolute Deviation (MAD)3029
Skewness1.8694462
Sum6802981
Variance1.2122956 × 1010
MonotonicityNot monotonic
2024-04-18T03:13:20.833862image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=22)
ValueCountFrequency (%)
153782 5
 
5.0%
152766 3
 
3.0%
138190 3
 
3.0%
135881 3
 
3.0%
137070 2
 
2.0%
138845 2
 
2.0%
135936 2
 
2.0%
130805 2
 
2.0%
425020 1
 
1.0%
135943 1
 
1.0%
Other values (12) 12
 
11.9%
(Missing) 65
64.4%
ValueCountFrequency (%)
130805 2
2.0%
135080 1
 
1.0%
135081 1
 
1.0%
135241 1
 
1.0%
135569 1
 
1.0%
135744 1
 
1.0%
135881 3
3.0%
135913 1
 
1.0%
135936 2
2.0%
135939 1
 
1.0%
ValueCountFrequency (%)
462807 1
 
1.0%
443823 1
 
1.0%
425020 1
 
1.0%
423812 1
 
1.0%
423030 1
 
1.0%
405246 1
 
1.0%
153782 5
5.0%
152766 3
3.0%
138845 2
 
2.0%
138190 3
3.0%

실험실산
Categorical

Distinct2
Distinct (%)2.0%
Missing0
Missing (%)0.0%
Memory size940.0 B
<NA>
63 
1
38 

Length

Max length4
Median length4
Mean length2.8712871
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1
2nd row1
3rd row1
4th row<NA>
5th row1

Common Values

ValueCountFrequency (%)
<NA> 63
62.4%
1 38
37.6%

Length

2024-04-18T03:13:20.927878image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-18T03:13:21.000558image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 63
62.4%
1 38
37.6%

실험실번지
Real number (ℝ)

MISSING 

Distinct24
Distinct (%)66.7%
Missing65
Missing (%)64.4%
Infinite0
Infinite (%)0.0%
Mean519.02778
Minimum77
Maximum1364
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2024-04-18T03:13:21.073074image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum77
5-th percentile159
Q1232.75
median351.5
Q3724.75
95-th percentile1272.5
Maximum1364
Range1287
Interquartile range (IQR)492

Descriptive statistics

Standard deviation372.69102
Coefficient of variation (CV)0.71805603
Kurtosis0.0096117906
Mean519.02778
Median Absolute Deviation (MAD)191.5
Skewness1.0088188
Sum18685
Variance138898.6
MonotonicityNot monotonic
2024-04-18T03:13:21.163278image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=24)
ValueCountFrequency (%)
345 4
 
4.0%
235 3
 
3.0%
197 3
 
3.0%
160 3
 
3.0%
832 2
 
2.0%
497 2
 
2.0%
1364 2
 
2.0%
1242 1
 
1.0%
288 1
 
1.0%
77 1
 
1.0%
Other values (14) 14
 
13.9%
(Missing) 65
64.4%
ValueCountFrequency (%)
77 1
 
1.0%
156 1
 
1.0%
160 3
3.0%
197 3
3.0%
226 1
 
1.0%
235 3
3.0%
249 1
 
1.0%
288 1
 
1.0%
345 4
4.0%
358 1
 
1.0%
ValueCountFrequency (%)
1364 2
2.0%
1242 1
1.0%
1230 1
1.0%
1094 1
1.0%
834 1
1.0%
832 2
2.0%
727 1
1.0%
724 1
1.0%
677 1
1.0%
662 1
1.0%

실험실호
Real number (ℝ)

MISSING 

Distinct18
Distinct (%)64.3%
Missing73
Missing (%)72.3%
Infinite0
Infinite (%)0.0%
Mean27
Minimum1
Maximum80
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2024-04-18T03:13:21.250515image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile2.7
Q114.75
median22
Q335.5
95-th percentile66
Maximum80
Range79
Interquartile range (IQR)20.75

Descriptive statistics

Standard deviation19.684549
Coefficient of variation (CV)0.72905738
Kurtosis1.0546056
Mean27
Median Absolute Deviation (MAD)11.5
Skewness1.0637674
Sum756
Variance387.48148
MonotonicityNot monotonic
2024-04-18T03:13:21.339146image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=18)
ValueCountFrequency (%)
30 4
 
4.0%
18 3
 
3.0%
22 3
 
3.0%
40 2
 
2.0%
66 2
 
2.0%
42 2
 
2.0%
17 1
 
1.0%
34 1
 
1.0%
1 1
 
1.0%
4 1
 
1.0%
Other values (8) 8
 
7.9%
(Missing) 73
72.3%
ValueCountFrequency (%)
1 1
 
1.0%
2 1
 
1.0%
4 1
 
1.0%
5 1
 
1.0%
9 1
 
1.0%
10 1
 
1.0%
11 1
 
1.0%
16 1
 
1.0%
17 1
 
1.0%
18 3
3.0%
ValueCountFrequency (%)
80 1
 
1.0%
66 2
2.0%
42 2
2.0%
40 2
2.0%
34 1
 
1.0%
31 1
 
1.0%
30 4
4.0%
22 3
3.0%
18 3
3.0%
17 1
 
1.0%

실험실통
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing101
Missing (%)100.0%
Memory size1.0 KiB

실험실반
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing101
Missing (%)100.0%
Memory size1.0 KiB

실험실특수주소
Text

MISSING 

Distinct14
Distinct (%)60.9%
Missing78
Missing (%)77.2%
Memory size940.0 B
2024-04-18T03:13:21.491765image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length17
Median length13
Mean length10.086957
Min length2

Characters and Unicode

Total characters232
Distinct characters68
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique9 ?
Unique (%)39.1%

Sample

1st row환경보전협회
2nd row환경보전협회
3rd row삼성엔지니어링(주) 기술연구소
4th row로즈데일오피스텔
5th row남성프라자(에이스9차) 10층
ValueCountFrequency (%)
남성프라자(에이스9차 5
12.2%
10층 5
12.2%
5층 4
 
9.8%
에이스테크노5차 3
 
7.3%
209호 3
 
7.3%
세광빌딩 3
 
7.3%
환경보전협회 2
 
4.9%
포스코빌딩 2
 
4.9%
401 2
 
4.9%
무영빌딩 1
 
2.4%
Other values (11) 11
26.8%
2024-04-18T03:13:21.763544image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
19
 
8.2%
12
 
5.2%
11
 
4.7%
0 11
 
4.7%
9
 
3.9%
9
 
3.9%
8
 
3.4%
8
 
3.4%
9 8
 
3.4%
8
 
3.4%
Other values (58) 129
55.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 156
67.2%
Decimal Number 44
 
19.0%
Space Separator 19
 
8.2%
Open Punctuation 6
 
2.6%
Close Punctuation 6
 
2.6%
Other Punctuation 1
 
0.4%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
12
 
7.7%
11
 
7.1%
9
 
5.8%
9
 
5.8%
8
 
5.1%
8
 
5.1%
8
 
5.1%
6
 
3.8%
5
 
3.2%
5
 
3.2%
Other values (47) 75
48.1%
Decimal Number
ValueCountFrequency (%)
0 11
25.0%
9 8
18.2%
5 7
15.9%
1 7
15.9%
4 5
11.4%
2 4
 
9.1%
3 2
 
4.5%
Space Separator
ValueCountFrequency (%)
19
100.0%
Open Punctuation
ValueCountFrequency (%)
( 6
100.0%
Close Punctuation
ValueCountFrequency (%)
) 6
100.0%
Other Punctuation
ValueCountFrequency (%)
, 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 156
67.2%
Common 76
32.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
12
 
7.7%
11
 
7.1%
9
 
5.8%
9
 
5.8%
8
 
5.1%
8
 
5.1%
8
 
5.1%
6
 
3.8%
5
 
3.2%
5
 
3.2%
Other values (47) 75
48.1%
Common
ValueCountFrequency (%)
19
25.0%
0 11
14.5%
9 8
10.5%
5 7
 
9.2%
1 7
 
9.2%
( 6
 
7.9%
) 6
 
7.9%
4 5
 
6.6%
2 4
 
5.3%
3 2
 
2.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 156
67.2%
ASCII 76
32.8%

Most frequent character per block

ASCII
ValueCountFrequency (%)
19
25.0%
0 11
14.5%
9 8
10.5%
5 7
 
9.2%
1 7
 
9.2%
( 6
 
7.9%
) 6
 
7.9%
4 5
 
6.6%
2 4
 
5.3%
3 2
 
2.6%
Hangul
ValueCountFrequency (%)
12
 
7.7%
11
 
7.1%
9
 
5.8%
9
 
5.8%
8
 
5.1%
8
 
5.1%
8
 
5.1%
6
 
3.8%
5
 
3.2%
5
 
3.2%
Other values (47) 75
48.1%

실험실특수주소동
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing101
Missing (%)100.0%
Memory size1.0 KiB

실험실특수주소호
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing101
Missing (%)100.0%
Memory size1.0 KiB

실험실도로명주소시군구코드
Real number (ℝ)

MISSING 

Distinct13
Distinct (%)35.1%
Missing64
Missing (%)63.4%
Infinite0
Infinite (%)0.0%
Mean16857.73
Minimum11230
Maximum41273
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2024-04-18T03:13:21.852052image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum11230
5-th percentile11470
Q111560
median11680
Q311710
95-th percentile41210
Maximum41273
Range30043
Interquartile range (IQR)150

Descriptive statistics

Standard deviation11182.393
Coefficient of variation (CV)0.6633392
Kurtosis1.2196804
Mean16857.73
Median Absolute Deviation (MAD)30
Skewness1.751867
Sum623736
Variance1.2504591 × 108
MonotonicityNot monotonic
2024-04-18T03:13:21.932537image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=13)
ValueCountFrequency (%)
11680 13
 
12.9%
11710 5
 
5.0%
11545 4
 
4.0%
11530 3
 
3.0%
11230 2
 
2.0%
41210 2
 
2.0%
11650 2
 
2.0%
41117 1
 
1.0%
41273 1
 
1.0%
28200 1
 
1.0%
Other values (3) 3
 
3.0%
(Missing) 64
63.4%
ValueCountFrequency (%)
11230 2
 
2.0%
11530 3
 
3.0%
11545 4
 
4.0%
11560 1
 
1.0%
11650 2
 
2.0%
11680 13
12.9%
11710 5
 
5.0%
28200 1
 
1.0%
41113 1
 
1.0%
41117 1
 
1.0%
ValueCountFrequency (%)
41273 1
 
1.0%
41210 2
 
2.0%
41133 1
 
1.0%
41117 1
 
1.0%
41113 1
 
1.0%
28200 1
 
1.0%
11710 5
 
5.0%
11680 13
12.9%
11650 2
 
2.0%
11560 1
 
1.0%

실험실도로명주소읍면동코드
Real number (ℝ)

MISSING 

Distinct18
Distinct (%)48.6%
Missing64
Missing (%)63.4%
Infinite0
Infinite (%)0.0%
Mean1.6857835 × 109
Minimum1.1230105 × 109
Maximum4.1273101 × 109
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2024-04-18T03:13:22.015408image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1.1230105 × 109
5-th percentile1.1470103 × 109
Q11.1560127 × 109
median1.1680103 × 109
Q31.1710105 × 109
95-th percentile4.1210101 × 109
Maximum4.1273101 × 109
Range3.0042996 × 109
Interquartile range (IQR)14997800

Descriptive statistics

Standard deviation1.1182394 × 109
Coefficient of variation (CV)0.6633351
Kurtosis1.2196805
Mean1.6857835 × 109
Median Absolute Deviation (MAD)3000200
Skewness1.751867
Sum6.2373989 × 1010
Variance1.2504593 × 1018
MonotonicityNot monotonic
2024-04-18T03:13:22.103206image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=18)
ValueCountFrequency (%)
1168010100 6
 
5.9%
1171010500 5
 
5.0%
1154510100 4
 
4.0%
1153010200 3
 
3.0%
1168010500 3
 
3.0%
1168010300 2
 
2.0%
1123010500 2
 
2.0%
1165010800 2
 
2.0%
1168011500 1
 
1.0%
4121010200 1
 
1.0%
Other values (8) 8
 
7.9%
(Missing) 64
63.4%
ValueCountFrequency (%)
1123010500 2
 
2.0%
1153010200 3
3.0%
1154510100 4
4.0%
1156012700 1
 
1.0%
1165010800 2
 
2.0%
1168010100 6
5.9%
1168010300 2
 
2.0%
1168010500 3
3.0%
1168011400 1
 
1.0%
1168011500 1
 
1.0%
ValueCountFrequency (%)
4127310100 1
 
1.0%
4121010200 1
 
1.0%
4121010100 1
 
1.0%
4113310500 1
 
1.0%
4111710200 1
 
1.0%
4111312800 1
 
1.0%
2820010300 1
 
1.0%
1171010500 5
5.0%
1168011500 1
 
1.0%
1168011400 1
 
1.0%
Distinct2
Distinct (%)2.0%
Missing0
Missing (%)0.0%
Memory size940.0 B
<NA>
64 
1
37 

Length

Max length4
Median length4
Mean length2.9009901
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1
2nd row1
3rd row1
4th row<NA>
5th row1

Common Values

ValueCountFrequency (%)
<NA> 64
63.4%
1 37
36.6%

Length

2024-04-18T03:13:22.201573image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-18T03:13:22.285922image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 64
63.4%
1 37
36.6%

실험실도로명주소코드
Real number (ℝ)

MISSING 

Distinct25
Distinct (%)67.6%
Missing64
Missing (%)63.4%
Infinite0
Infinite (%)0.0%
Mean3520053.5
Minimum2000008
Maximum4361034
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2024-04-18T03:13:22.358746image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2000008
5-th percentile2004008
Q13000026
median4148335
Q34166670
95-th percentile4332283.6
Maximum4361034
Range2361026
Interquartile range (IQR)1166644

Descriptive statistics

Standard deviation820874.69
Coefficient of variation (CV)0.23319949
Kurtosis-0.92330162
Mean3520053.5
Median Absolute Deviation (MAD)188871
Skewness-0.70357056
Sum1.3024198 × 108
Variance6.7383526 × 1011
MonotonicityNot monotonic
2024-04-18T03:13:22.446703image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=25)
ValueCountFrequency (%)
3000026 4
 
4.0%
4169186 3
 
3.0%
4148335 3
 
3.0%
2122002 3
 
3.0%
4166649 2
 
2.0%
4163084 2
 
2.0%
2000008 2
 
2.0%
4166051 1
 
1.0%
3123001 1
 
1.0%
2005008 1
 
1.0%
Other values (15) 15
 
14.9%
(Missing) 64
63.4%
ValueCountFrequency (%)
2000008 2
2.0%
2005008 1
 
1.0%
2122002 3
3.0%
3000026 4
4.0%
3005086 1
 
1.0%
3122001 1
 
1.0%
3122002 1
 
1.0%
3123001 1
 
1.0%
3153019 1
 
1.0%
3187055 1
 
1.0%
ValueCountFrequency (%)
4361034 1
 
1.0%
4337206 1
 
1.0%
4331053 1
 
1.0%
4325389 1
 
1.0%
4169244 1
 
1.0%
4169186 3
3.0%
4166736 1
 
1.0%
4166670 1
 
1.0%
4166649 2
2.0%
4166158 1
 
1.0%
Distinct30
Distinct (%)81.1%
Missing64
Missing (%)63.4%
Memory size940.0 B
2024-04-18T03:13:22.612032image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length24
Median length20
Mean length12.540541
Min length5

Characters and Unicode

Total characters464
Distinct characters96
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique24 ?
Unique (%)64.9%

Sample

1st row (삼성동)
2nd row (삼성동)
3rd row (삼성동)
4th row(역삼동)
5th row(역삼동)
ValueCountFrequency (%)
5층 4
 
7.1%
가산동,남성프라자(에이스9차 4
 
7.1%
10층 4
 
7.1%
삼성동 3
 
5.4%
석촌동,세광빌딩 3
 
5.4%
구로동,에이스테크노5차 3
 
5.4%
209호 3
 
5.4%
역삼동 3
 
5.4%
답십리동,환경보전협회 2
 
3.6%
석촌동 2
 
3.6%
Other values (23) 25
44.6%
2024-04-18T03:13:22.897175image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
43
 
9.3%
( 42
 
9.1%
) 42
 
9.1%
37
 
8.0%
, 24
 
5.2%
11
 
2.4%
0 10
 
2.2%
10
 
2.2%
10
 
2.2%
9
 
1.9%
Other values (86) 226
48.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 268
57.8%
Decimal Number 45
 
9.7%
Space Separator 43
 
9.3%
Open Punctuation 42
 
9.1%
Close Punctuation 42
 
9.1%
Other Punctuation 24
 
5.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
37
 
13.8%
11
 
4.1%
10
 
3.7%
10
 
3.7%
9
 
3.4%
9
 
3.4%
8
 
3.0%
7
 
2.6%
7
 
2.6%
7
 
2.6%
Other values (74) 153
57.1%
Decimal Number
ValueCountFrequency (%)
0 10
22.2%
5 7
15.6%
1 7
15.6%
9 7
15.6%
4 5
11.1%
2 4
 
8.9%
3 4
 
8.9%
6 1
 
2.2%
Space Separator
ValueCountFrequency (%)
43
100.0%
Open Punctuation
ValueCountFrequency (%)
( 42
100.0%
Close Punctuation
ValueCountFrequency (%)
) 42
100.0%
Other Punctuation
ValueCountFrequency (%)
, 24
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 268
57.8%
Common 196
42.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
37
 
13.8%
11
 
4.1%
10
 
3.7%
10
 
3.7%
9
 
3.4%
9
 
3.4%
8
 
3.0%
7
 
2.6%
7
 
2.6%
7
 
2.6%
Other values (74) 153
57.1%
Common
ValueCountFrequency (%)
43
21.9%
( 42
21.4%
) 42
21.4%
, 24
12.2%
0 10
 
5.1%
5 7
 
3.6%
1 7
 
3.6%
9 7
 
3.6%
4 5
 
2.6%
2 4
 
2.0%
Other values (2) 5
 
2.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 268
57.8%
ASCII 196
42.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
43
21.9%
( 42
21.4%
) 42
21.4%
, 24
12.2%
0 10
 
5.1%
5 7
 
3.6%
1 7
 
3.6%
9 7
 
3.6%
4 5
 
2.6%
2 4
 
2.0%
Other values (2) 5
 
2.6%
Hangul
ValueCountFrequency (%)
37
 
13.8%
11
 
4.1%
10
 
3.7%
10
 
3.7%
9
 
3.4%
9
 
3.4%
8
 
3.0%
7
 
2.6%
7
 
2.6%
7
 
2.6%
Other values (74) 153
57.1%
Distinct2
Distinct (%)2.0%
Missing0
Missing (%)0.0%
Memory size940.0 B
<NA>
64 
0
37 

Length

Max length4
Median length4
Mean length2.9009901
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row<NA>
5th row0

Common Values

ValueCountFrequency (%)
<NA> 64
63.4%
0 37
36.6%

Length

2024-04-18T03:13:23.003433image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-18T03:13:23.078658image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 64
63.4%
0 37
36.6%

실험실도로명주소건물본번호
Real number (ℝ)

MISSING 

Distinct21
Distinct (%)56.8%
Missing64
Missing (%)63.4%
Infinite0
Infinite (%)0.0%
Mean140.81081
Minimum4
Maximum625
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2024-04-18T03:13:23.144828image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum4
5-th percentile8
Q112
median40
Q3130
95-th percentile531.4
Maximum625
Range621
Interquartile range (IQR)118

Descriptive statistics

Standard deviation193.66865
Coefficient of variation (CV)1.375382
Kurtosis0.72182624
Mean140.81081
Median Absolute Deviation (MAD)29
Skewness1.4797695
Sum5210
Variance37507.547
MonotonicityNot monotonic
2024-04-18T03:13:23.230850image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=21)
ValueCountFrequency (%)
130 4
 
4.0%
11 4
 
4.0%
12 3
 
3.0%
20 3
 
3.0%
520 3
 
3.0%
23 3
 
3.0%
8 2
 
2.0%
259 2
 
2.0%
280 1
 
1.0%
32 1
 
1.0%
Other values (11) 11
 
10.9%
(Missing) 64
63.4%
ValueCountFrequency (%)
4 1
 
1.0%
8 2
2.0%
11 4
4.0%
12 3
3.0%
15 1
 
1.0%
20 3
3.0%
23 3
3.0%
32 1
 
1.0%
40 1
 
1.0%
41 1
 
1.0%
ValueCountFrequency (%)
625 1
 
1.0%
541 1
 
1.0%
529 1
 
1.0%
520 3
3.0%
280 1
 
1.0%
259 2
2.0%
130 4
4.0%
99 1
 
1.0%
72 1
 
1.0%
61 1
 
1.0%
Distinct3
Distinct (%)3.0%
Missing0
Missing (%)0.0%
Memory size940.0 B
<NA>
98 
1
 
2
11
 
1

Length

Max length4
Median length4
Mean length3.9207921
Min length1

Unique

Unique1 ?
Unique (%)1.0%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 98
97.0%
1 2
 
2.0%
11 1
 
1.0%

Length

2024-04-18T03:13:23.325771image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-18T03:13:23.400570image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 98
97.0%
1 2
 
2.0%
11 1
 
1.0%

실험실도로명주소우편번호
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing101
Missing (%)100.0%
Memory size1.0 KiB

Sample

개방자치단체코드관리번호인허가일자인허가취소일자영업상태코드영업상태명상세영업상태코드상세영업상태명폐업일자휴업시작일자휴업종료일자재개업일자전화번호소재지면적소재지우편번호지번주소도로명주소도로명우편번호사업장명최종수정일자데이터갱신구분데이터갱신일자업태구분명좌표정보(X)좌표정보(Y)실험실면적사업장구분명영업소면적위탁업체명실험실지역코드실험실우편번호실험실산실험실번지실험실호실험실통실험실반실험실특수주소실험실특수주소동실험실특수주소호실험실도로명주소시군구코드실험실도로명주소읍면동코드실험실도로명주소읍면동구분실험실도로명주소코드실험실도로명특수주소실험실도로명주소건물층구분실험실도로명주소건물본번호실험실도로명주소건물부번호실험실도로명주소우편번호
0322000032200006719810002619810930<NA>3폐업2폐업20120105<NA><NA><NA>20089812<NA><NA>서울특별시 강남구 삼성동 160서울특별시 강남구 영동대로 520 (삼성동)<NA>현대산업개발(주)[소음진동]2012-06-21 13:28:56I2019-03-30 02:20:09.0<NA>205330.313614445725.236017<NA>환경전문공사업<NA>용산구로 이전11680105001358811160<NA><NA><NA><NA><NA><NA>11680116801050012122002(삼성동)0520<NA><NA>
1322000032200006719810003219810930<NA>3폐업2폐업20120105<NA><NA><NA>20089812<NA><NA>서울특별시 강남구 삼성동 160서울특별시 강남구 영동대로 520 (삼성동)<NA>현대산업개발(주)[수질]2012-06-21 13:29:37I2019-03-30 02:20:09.0<NA>205330.313614445725.236017<NA>환경전문공사업<NA><NA>11680105001358811160<NA><NA><NA><NA><NA><NA>11680116801050012122002(삼성동)0520<NA><NA>
2322000032200006719810004019810930<NA>3폐업2폐업20120105<NA><NA><NA>20089812<NA><NA>서울특별시 강남구 삼성동 160서울특별시 강남구 영동대로 520 (삼성동)<NA>현대산업개발(주)[대기]2012-06-21 13:30:18I2019-03-30 02:20:09.0<NA>205330.313614445725.236017<NA>환경전문공사업<NA>용산구로 이전11680105001358811160<NA><NA><NA><NA><NA><NA>11680116801050012122002(삼성동)0520<NA><NA>
332200003220000671982000011982-02-13<NA>1영업/정상BBBB영업<NA><NA><NA><NA>027402434<NA><NA>서울특별시 강남구 역삼동 662-7서울특별시 강남구 언주로 547, 13층 (역삼동)06138삼환기업(주)2023-03-06 09:41:41U2022-12-03 00:08:00.0<NA>203468.913965444937.661022<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>
4322000032200006719920028819920428<NA>1영업/정상BBBB영업<NA><NA><NA><NA>62020901<NA><NA>서울특별시 강남구 역삼동 832-40<NA><NA>(주)유신2018-07-26 18:00:28I2019-03-30 02:20:09.0<NA>202741.379034443413.449625<NA>환경전문공사업1<NA>1168010100135936183240<NA><NA><NA><NA><NA>11680116801010014166649(역삼동)08<NA><NA>
5322000032200006719970046319970429<NA>3폐업2폐업20141028<NA><NA><NA>5105691<NA><NA>서울특별시 강남구 신사동 538서울특별시 강남구 도산대로 139 (신사동)<NA>(주)이테크건설[대기]2014-10-29 09:20:07I2019-03-30 02:20:09.0<NA>202025.925476446222.99061<NA>환경전문공사업<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>
6322000032200006720020021020020326<NA>1영업/정상BBBB영업<NA><NA><NA><NA>62020901<NA><NA>서울특별시 강남구 역삼동 832-40<NA><NA>(주)유신 [소음진동]2019-11-11 13:40:06U2019-11-13 02:40:00.0<NA>202741.379034443413.4496251환경전문공사업<NA><NA>1168010100135936183240<NA><NA><NA><NA><NA>11680116801010014166649(역삼동)08<NA><NA>
732200003220000672005007722005-09-14<NA>1영업/정상BBBB영업<NA><NA><NA><NA>6323-3000<NA><NA>서울특별시 강남구 대치동 942-1서울특별시 강남구 삼성로 438 (대치동)<NA>(주)도화엔지니어링2023-04-13 11:30:22U2022-12-03 23:05:00.0부동산업205010.852416444887.348385<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>
832200003220000672006009862006-07-25<NA>5제외/삭제/전출5제외사항<NA><NA><NA><NA>5533850<NA><NA>서울특별시 강남구 개포동 13-3 대청타워 2031호서울특별시 강남구 개포로 623, 2031호 (개포동,대청타워)<NA>(주)고도엔바2023-06-08 15:21:58U2022-12-05 23:00:00.0<NA>206970.919291443574.411626<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>
9322000032200006720090000120090121<NA>4취소/말소/만료/정지/중지4폐쇄20120727<NA><NA><NA>02-3147-1593<NA><NA>서울특별시 강남구 역삼동 662-9 에프앤에프빌딩 3층서울특별시 강남구 언주로 541 (역삼동,에프앤에프빌딩 3층)<NA>베올리아워터솔루션스앤테크놀로지스코리아주식회사2012-07-31 13:49:09I2019-03-30 02:20:09.0<NA>203496.053652444902.934533<NA>환경전문공사업<NA><NA>116801010013591316629<NA><NA><NA><NA><NA>11680116801010013005086(역삼동)0541<NA><NA>
개방자치단체코드관리번호인허가일자인허가취소일자영업상태코드영업상태명상세영업상태코드상세영업상태명폐업일자휴업시작일자휴업종료일자재개업일자전화번호소재지면적소재지우편번호지번주소도로명주소도로명우편번호사업장명최종수정일자데이터갱신구분데이터갱신일자업태구분명좌표정보(X)좌표정보(Y)실험실면적사업장구분명영업소면적위탁업체명실험실지역코드실험실우편번호실험실산실험실번지실험실호실험실통실험실반실험실특수주소실험실특수주소동실험실특수주소호실험실도로명주소시군구코드실험실도로명주소읍면동코드실험실도로명주소읍면동구분실험실도로명주소코드실험실도로명특수주소실험실도로명주소건물층구분실험실도로명주소건물본번호실험실도로명주소건물부번호실험실도로명주소우편번호
9132200003220000672021000012021-02-22<NA>3폐업2폐업2024-01-12<NA><NA><NA>2186-6276<NA><NA>서울특별시 강남구 역삼동 643-11 금화빌딩서울특별시 강남구 테헤란로25길 15-4, 금화빌딩 (역삼동)06131(주)금화피에스시2024-01-16 10:03:17U2023-11-30 23:09:00.0<NA>203026.275697444411.841874<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>
9232200003220000672021000022021-06-01<NA>3폐업2폐업2023-05-22<NA><NA><NA>572-1345<NA><NA>서울특별시 강남구 개포동 1228 삼우빌딩서울특별시 강남구 논현로 64, 삼우빌딩 6층 (개포동)06312보국에너텍(주)2023-05-22 15:10:48U2022-12-04 22:04:00.0<NA>204077.082955441525.151881<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>
9332200003220000672021000032021-12-15<NA>1영업/정상BBBB영업<NA><NA><NA><NA><NA><NA><NA>서울특별시 강남구 논현동 50-1 논현빌딩서울특별시 강남구 강남대로 556, 논현빌딩 6층 (논현동)06044한성크렌텍주식회사2023-12-08 17:19:04U2022-11-01 23:00:00.0<NA>201830.491067445497.959633<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>
9432200003220000672022000012022-01-13<NA>5제외/삭제/전출5제외사항<NA><NA><NA><NA>02-6420-3821<NA><NA>서울특별시 강남구 역삼동 662-7서울특별시 강남구 언주로 547 (역삼동)06138동아건설산업(주)2023-10-20 16:42:12U2022-10-30 22:02:00.0<NA>203468.913965444937.661022<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>
9532200003220000672022000022022-02-03<NA>5제외/삭제/전출5제외사항<NA><NA><NA><NA>558-1001<NA><NA>서울특별시 강남구 도곡동 467-19 현대비전21서울특별시 강남구 언주로30길 10, 현대비전21 522호 (도곡동)06295(주)천일2023-10-23 09:27:04U2022-10-30 22:05:00.0<NA>204469.514468442797.361774<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>
9632200003220000672022000032022-06-03<NA>1영업/정상BBBB영업<NA><NA><NA><NA>539-7700<NA><NA>서울특별시 강남구 논현동 50-1 논현빌딩서울특별시 강남구 강남대로 556, 논현빌딩 5층 (논현동)06044대양엔바이오(주)2023-08-03 09:05:40U2022-12-08 00:05:00.0<NA>201830.491067445497.959633<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>
9732200003220000672023000012023-09-11<NA>1영업/정상BBBB영업<NA><NA><NA><NA>02-6420-3821<NA><NA>서울특별시 강남구 역삼동 662-7서울특별시 강남구 언주로 547 (역삼동)06138동아건설산업(주)2023-09-11 17:30:47I2022-12-08 23:03:00.0<NA>203468.913965444937.661022<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>
9832200003220000672024000012024-01-15<NA>3폐업2폐업2024-02-26<NA><NA><NA><NA><NA><NA>서울특별시 강남구 역삼동 840-5 로얄빌딩서울특별시 강남구 논현로63길 63, 로얄빌딩 (역삼동)06255(주)리트코2024-02-26 10:02:08U2023-12-01 22:08:00.0<NA>203103.509174443236.368338<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>
9932200003220000672024000022024-02-26<NA>1영업/정상BBBB영업<NA><NA><NA><NA>0522699825<NA><NA>서울특별시 강남구 청담동 133-3 화천회관빌딩서울특별시 강남구 영동대로 702, 화천회관빌딩 501호 (청담동)06075(주)에팩2024-02-26 10:24:00I2023-12-01 22:08:00.0<NA>205026.123358446479.893606<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>
10032200003220000672024000032024-02-26<NA>1영업/정상BBBB영업<NA><NA><NA><NA>0522699825<NA><NA>울산광역시 남구 매암동 360 (주)삼양사울산광역시 남구 장생포로 285 (주)삼양사 (매암동)44778(주)에팩2024-02-26 15:47:22I2023-12-01 22:08:00.0농업415265.487045226387.310871<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>