Overview

Dataset statistics

Number of variables9
Number of observations1975
Missing cells776
Missing cells (%)4.4%
Duplicate rows2
Duplicate rows (%)0.1%
Total size in memory140.9 KiB
Average record size in memory73.1 B

Variable types

Text5
Categorical3
Numeric1

Dataset

Description산림보호법에 따른 연도별 나무병원 등록현황을 나타내는 자료입니다. 나무병원 법인명, 대표자명, 사업종류, 주소, 영/폐업 여부, 등록시도 등
Author산림청
URLhttps://www.data.go.kr/data/15091330/fileData.do

Alerts

Dataset has 2 (0.1%) duplicate rowsDuplicates
우편번호 is highly overall correlated with 시도High correlation
시도 is highly overall correlated with 우편번호High correlation
영 폐업 여부 is highly imbalanced (63.5%)Imbalance
상세주소 has 776 (39.3%) missing valuesMissing

Reproduction

Analysis started2023-12-12 07:41:42.214530
Analysis finished2023-12-12 07:41:43.733192
Duration1.52 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct1899
Distinct (%)96.2%
Missing0
Missing (%)0.0%
Memory size15.6 KiB
2023-12-12T16:41:44.046243image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length19
Median length18
Mean length8.5478481
Min length3

Characters and Unicode

Total characters16882
Distinct characters402
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1828 ?
Unique (%)92.6%

Sample

1st row주식회사 대동나무병원
2nd row주식회사 나무병원건강한숲
3rd row주식회사 강원나무병원
4th row호반방역청소용역 주식회사
5th row주식회사 목원
ValueCountFrequency (%)
주식회사 691
 
25.1%
유한회사 31
 
1.1%
산림조합 18
 
0.7%
14
 
0.5%
농업회사법인 6
 
0.2%
합자회사 5
 
0.2%
방지거조경(주 3
 
0.1%
나무병원 3
 
0.1%
우진조경(주 3
 
0.1%
상원조경 3
 
0.1%
Other values (1895) 1981
71.8%
2023-12-12T16:41:44.629565image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1735
 
10.3%
) 1055
 
6.2%
( 1044
 
6.2%
864
 
5.1%
811
 
4.8%
790
 
4.7%
783
 
4.6%
763
 
4.5%
717
 
4.2%
365
 
2.2%
Other values (392) 7955
47.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 13891
82.3%
Close Punctuation 1055
 
6.2%
Open Punctuation 1044
 
6.2%
Space Separator 783
 
4.6%
Other Symbol 97
 
0.6%
Uppercase Letter 5
 
< 0.1%
Decimal Number 5
 
< 0.1%
Other Punctuation 1
 
< 0.1%
Dash Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1735
 
12.5%
864
 
6.2%
811
 
5.8%
790
 
5.7%
763
 
5.5%
717
 
5.2%
365
 
2.6%
263
 
1.9%
258
 
1.9%
251
 
1.8%
Other values (380) 7074
50.9%
Uppercase Letter
ValueCountFrequency (%)
C 3
60.0%
O 1
 
20.0%
E 1
 
20.0%
Decimal Number
ValueCountFrequency (%)
1 3
60.0%
2 1
 
20.0%
9 1
 
20.0%
Close Punctuation
ValueCountFrequency (%)
) 1055
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1044
100.0%
Space Separator
ValueCountFrequency (%)
783
100.0%
Other Symbol
ValueCountFrequency (%)
97
100.0%
Other Punctuation
ValueCountFrequency (%)
. 1
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 13988
82.9%
Common 2889
 
17.1%
Latin 5
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1735
 
12.4%
864
 
6.2%
811
 
5.8%
790
 
5.6%
763
 
5.5%
717
 
5.1%
365
 
2.6%
263
 
1.9%
258
 
1.8%
251
 
1.8%
Other values (381) 7171
51.3%
Common
ValueCountFrequency (%)
) 1055
36.5%
( 1044
36.1%
783
27.1%
1 3
 
0.1%
2 1
 
< 0.1%
9 1
 
< 0.1%
. 1
 
< 0.1%
- 1
 
< 0.1%
Latin
ValueCountFrequency (%)
C 3
60.0%
O 1
 
20.0%
E 1
 
20.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 13891
82.3%
ASCII 2894
 
17.1%
None 97
 
0.6%

Most frequent character per block

Hangul
ValueCountFrequency (%)
1735
 
12.5%
864
 
6.2%
811
 
5.8%
790
 
5.7%
763
 
5.5%
717
 
5.2%
365
 
2.6%
263
 
1.9%
258
 
1.9%
251
 
1.8%
Other values (380) 7074
50.9%
ASCII
ValueCountFrequency (%)
) 1055
36.5%
( 1044
36.1%
783
27.1%
C 3
 
0.1%
1 3
 
0.1%
2 1
 
< 0.1%
9 1
 
< 0.1%
. 1
 
< 0.1%
O 1
 
< 0.1%
- 1
 
< 0.1%
None
ValueCountFrequency (%)
97
100.0%
Distinct1798
Distinct (%)91.0%
Missing0
Missing (%)0.0%
Memory size15.6 KiB
2023-12-12T16:41:45.038399image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length11
Median length3
Mean length3.0744304
Min length2

Characters and Unicode

Total characters6072
Distinct characters234
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1648 ?
Unique (%)83.4%

Sample

1st row이경민
2nd row김규헌
3rd row이광세
4th row박인용
5th row한재국
ValueCountFrequency (%)
정영희 5
 
0.2%
김진희 4
 
0.2%
이옥준 4
 
0.2%
윤창일 4
 
0.2%
강언모 4
 
0.2%
김혜경 4
 
0.2%
이은희 3
 
0.1%
전경애 3
 
0.1%
3
 
0.1%
황태연 3
 
0.1%
Other values (1815) 1980
98.2%
2023-12-12T16:41:45.593023image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
425
 
7.0%
327
 
5.4%
225
 
3.7%
200
 
3.3%
189
 
3.1%
129
 
2.1%
113
 
1.9%
105
 
1.7%
105
 
1.7%
100
 
1.6%
Other values (224) 4154
68.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 6029
99.3%
Space Separator 42
 
0.7%
Other Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
425
 
7.0%
327
 
5.4%
225
 
3.7%
200
 
3.3%
189
 
3.1%
129
 
2.1%
113
 
1.9%
105
 
1.7%
105
 
1.7%
100
 
1.7%
Other values (222) 4111
68.2%
Space Separator
ValueCountFrequency (%)
42
100.0%
Other Punctuation
ValueCountFrequency (%)
. 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 6029
99.3%
Common 43
 
0.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
425
 
7.0%
327
 
5.4%
225
 
3.7%
200
 
3.3%
189
 
3.1%
129
 
2.1%
113
 
1.9%
105
 
1.7%
105
 
1.7%
100
 
1.7%
Other values (222) 4111
68.2%
Common
ValueCountFrequency (%)
42
97.7%
. 1
 
2.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 6029
99.3%
ASCII 43
 
0.7%

Most frequent character per block

Hangul
ValueCountFrequency (%)
425
 
7.0%
327
 
5.4%
225
 
3.7%
200
 
3.3%
189
 
3.1%
129
 
2.1%
113
 
1.9%
105
 
1.7%
105
 
1.7%
100
 
1.7%
Other values (222) 4111
68.2%
ASCII
ValueCountFrequency (%)
42
97.7%
. 1
 
2.3%

사업종류
Categorical

Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size15.6 KiB
2종 나무병원
1398 
1종 나무병원
577 

Length

Max length7
Median length7
Mean length7
Min length7

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1종 나무병원
2nd row1종 나무병원
3rd row1종 나무병원
4th row1종 나무병원
5th row1종 나무병원

Common Values

ValueCountFrequency (%)
2종 나무병원 1398
70.8%
1종 나무병원 577
29.2%

Length

2023-12-12T16:41:45.792062image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T16:41:45.917970image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
나무병원 1975
50.0%
2종 1398
35.4%
1종 577
 
14.6%

우편번호
Real number (ℝ)

HIGH CORRELATION 

Distinct1508
Distinct (%)76.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean28068.94
Minimum1014
Maximum63600
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size17.5 KiB
2023-12-12T16:41:46.043959image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1014
5-th percentile5790.2
Q114029
median24307
Q341855
95-th percentile58526.3
Maximum63600
Range62586
Interquartile range (IQR)27826

Descriptive statistics

Standard deviation16769.025
Coefficient of variation (CV)0.59742279
Kurtosis-0.91722227
Mean28068.94
Median Absolute Deviation (MAD)11106
Skewness0.48429107
Sum55436157
Variance2.8120019 × 108
MonotonicityNot monotonic
2023-12-12T16:41:46.217305image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
13631 18
 
0.9%
21673 8
 
0.4%
10523 7
 
0.4%
31212 7
 
0.4%
24210 7
 
0.4%
10073 6
 
0.3%
34186 6
 
0.3%
42936 5
 
0.3%
13837 5
 
0.3%
10071 5
 
0.3%
Other values (1498) 1901
96.3%
ValueCountFrequency (%)
1014 1
0.1%
1042 1
0.1%
1073 1
0.1%
1177 1
0.1%
1235 2
0.1%
1305 2
0.1%
1306 1
0.1%
1327 1
0.1%
1344 1
0.1%
1402 1
0.1%
ValueCountFrequency (%)
63600 1
0.1%
63584 1
0.1%
63569 1
0.1%
63333 1
0.1%
63186 1
0.1%
63147 1
0.1%
63125 1
0.1%
63082 1
0.1%
63081 1
0.1%
63067 1
0.1%

주소
Text

Distinct1894
Distinct (%)95.9%
Missing0
Missing (%)0.0%
Memory size15.6 KiB
2023-12-12T16:41:46.610491image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length53
Median length42
Mean length28.329114
Min length10

Characters and Unicode

Total characters55950
Distinct characters505
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1825 ?
Unique (%)92.4%

Sample

1st row강원도 춘천시 벌말길47번길 30 (석사동)
2nd row강원도 춘천시 후석로420번길 49-11 (후평동)
3rd row강원도 춘천시 충열로 45 옥산빌딩 4층 (우두동)
4th row강원도 춘천시 후만로126번길 24 (후평동)
5th row강원도 원주시 호저면 운동들2길 29
ValueCountFrequency (%)
경기도 621
 
5.2%
서울특별시 195
 
1.6%
2층 160
 
1.4%
인천광역시 154
 
1.3%
강원도 104
 
0.9%
충남 97
 
0.8%
대전광역시 93
 
0.8%
1층 89
 
0.8%
울산광역시 86
 
0.7%
성남시 80
 
0.7%
Other values (4127) 10154
85.8%
2023-12-12T16:41:47.214107image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
11367
 
20.3%
1 2193
 
3.9%
1913
 
3.4%
1842
 
3.3%
1639
 
2.9%
2 1521
 
2.7%
( 1412
 
2.5%
) 1411
 
2.5%
1204
 
2.2%
3 1121
 
2.0%
Other values (495) 30327
54.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 31498
56.3%
Space Separator 11367
 
20.3%
Decimal Number 9724
 
17.4%
Open Punctuation 1412
 
2.5%
Close Punctuation 1411
 
2.5%
Dash Punctuation 480
 
0.9%
Uppercase Letter 41
 
0.1%
Lowercase Letter 12
 
< 0.1%
Math Symbol 3
 
< 0.1%
Other Punctuation 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1913
 
6.1%
1842
 
5.8%
1639
 
5.2%
1204
 
3.8%
973
 
3.1%
930
 
3.0%
892
 
2.8%
791
 
2.5%
731
 
2.3%
685
 
2.2%
Other values (463) 19898
63.2%
Uppercase Letter
ValueCountFrequency (%)
B 16
39.0%
A 9
22.0%
C 4
 
9.8%
F 3
 
7.3%
S 3
 
7.3%
T 1
 
2.4%
R 1
 
2.4%
H 1
 
2.4%
D 1
 
2.4%
P 1
 
2.4%
Decimal Number
ValueCountFrequency (%)
1 2193
22.6%
2 1521
15.6%
3 1121
11.5%
0 1092
11.2%
4 829
 
8.5%
5 739
 
7.6%
6 667
 
6.9%
7 625
 
6.4%
8 532
 
5.5%
9 405
 
4.2%
Lowercase Letter
ValueCountFrequency (%)
a 4
33.3%
c 3
25.0%
b 3
25.0%
d 1
 
8.3%
e 1
 
8.3%
Space Separator
ValueCountFrequency (%)
11367
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1412
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1411
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 480
100.0%
Math Symbol
ValueCountFrequency (%)
~ 3
100.0%
Other Punctuation
ValueCountFrequency (%)
· 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 31498
56.3%
Common 24399
43.6%
Latin 53
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1913
 
6.1%
1842
 
5.8%
1639
 
5.2%
1204
 
3.8%
973
 
3.1%
930
 
3.0%
892
 
2.8%
791
 
2.5%
731
 
2.3%
685
 
2.2%
Other values (463) 19898
63.2%
Common
ValueCountFrequency (%)
11367
46.6%
1 2193
 
9.0%
2 1521
 
6.2%
( 1412
 
5.8%
) 1411
 
5.8%
3 1121
 
4.6%
0 1092
 
4.5%
4 829
 
3.4%
5 739
 
3.0%
6 667
 
2.7%
Other values (6) 2047
 
8.4%
Latin
ValueCountFrequency (%)
B 16
30.2%
A 9
17.0%
a 4
 
7.5%
C 4
 
7.5%
F 3
 
5.7%
S 3
 
5.7%
c 3
 
5.7%
b 3
 
5.7%
d 1
 
1.9%
T 1
 
1.9%
Other values (6) 6
 
11.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 31498
56.3%
ASCII 24450
43.7%
None 2
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
11367
46.5%
1 2193
 
9.0%
2 1521
 
6.2%
( 1412
 
5.8%
) 1411
 
5.8%
3 1121
 
4.6%
0 1092
 
4.5%
4 829
 
3.4%
5 739
 
3.0%
6 667
 
2.7%
Other values (21) 2098
 
8.6%
Hangul
ValueCountFrequency (%)
1913
 
6.1%
1842
 
5.8%
1639
 
5.2%
1204
 
3.8%
973
 
3.1%
930
 
3.0%
892
 
2.8%
791
 
2.5%
731
 
2.3%
685
 
2.2%
Other values (463) 19898
63.2%
None
ValueCountFrequency (%)
· 2
100.0%

상세주소
Text

MISSING 

Distinct690
Distinct (%)57.5%
Missing776
Missing (%)39.3%
Memory size15.6 KiB
2023-12-12T16:41:47.565221image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length20
Median length18
Mean length5.4278565
Min length1

Characters and Unicode

Total characters6508
Distinct characters296
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique601 ?
Unique (%)50.1%

Sample

1st row 
2nd row 
3rd row 
4th row 
5th row 
ValueCountFrequency (%)
2층 142
 
9.3%
1층 77
 
5.0%
3층 59
 
3.8%
201호 36
 
2.3%
101호 29
 
1.9%
4층 27
 
1.8%
상가동 23
 
1.5%
202호 20
 
1.3%
302호 20
 
1.3%
0 19
 
1.2%
Other values (712) 1083
70.6%
2023-12-12T16:41:48.088886image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
610
 
9.4%
1 595
 
9.1%
0 541
 
8.3%
2 517
 
7.9%
408
 
6.3%
375
 
5.8%
290
 
4.5%
3 279
 
4.3%
( 206
 
3.2%
) 204
 
3.1%
Other values (286) 2483
38.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 3013
46.3%
Decimal Number 2546
39.1%
Space Separator 425
 
6.5%
Open Punctuation 206
 
3.2%
Close Punctuation 204
 
3.1%
Dash Punctuation 66
 
1.0%
Uppercase Letter 36
 
0.6%
Lowercase Letter 9
 
0.1%
Math Symbol 3
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
610
20.2%
375
 
12.4%
290
 
9.6%
63
 
2.1%
63
 
2.1%
61
 
2.0%
61
 
2.0%
59
 
2.0%
54
 
1.8%
48
 
1.6%
Other values (256) 1329
44.1%
Decimal Number
ValueCountFrequency (%)
1 595
23.4%
0 541
21.2%
2 517
20.3%
3 279
11.0%
4 166
 
6.5%
5 126
 
4.9%
6 100
 
3.9%
7 98
 
3.8%
8 72
 
2.8%
9 52
 
2.0%
Uppercase Letter
ValueCountFrequency (%)
B 14
38.9%
A 9
25.0%
D 5
 
13.9%
C 3
 
8.3%
S 2
 
5.6%
F 1
 
2.8%
H 1
 
2.8%
E 1
 
2.8%
Lowercase Letter
ValueCountFrequency (%)
a 3
33.3%
c 2
22.2%
p 1
 
11.1%
r 1
 
11.1%
d 1
 
11.1%
b 1
 
11.1%
Space Separator
ValueCountFrequency (%)
408
96.0%
  17
 
4.0%
Open Punctuation
ValueCountFrequency (%)
( 206
100.0%
Close Punctuation
ValueCountFrequency (%)
) 204
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 66
100.0%
Math Symbol
ValueCountFrequency (%)
~ 3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 3450
53.0%
Hangul 3013
46.3%
Latin 45
 
0.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
610
20.2%
375
 
12.4%
290
 
9.6%
63
 
2.1%
63
 
2.1%
61
 
2.0%
61
 
2.0%
59
 
2.0%
54
 
1.8%
48
 
1.6%
Other values (256) 1329
44.1%
Common
ValueCountFrequency (%)
1 595
17.2%
0 541
15.7%
2 517
15.0%
408
11.8%
3 279
8.1%
( 206
 
6.0%
) 204
 
5.9%
4 166
 
4.8%
5 126
 
3.7%
6 100
 
2.9%
Other values (6) 308
8.9%
Latin
ValueCountFrequency (%)
B 14
31.1%
A 9
20.0%
D 5
 
11.1%
C 3
 
6.7%
a 3
 
6.7%
S 2
 
4.4%
c 2
 
4.4%
F 1
 
2.2%
p 1
 
2.2%
r 1
 
2.2%
Other values (4) 4
 
8.9%

Most occurring blocks

ValueCountFrequency (%)
ASCII 3478
53.4%
Hangul 3013
46.3%
None 17
 
0.3%

Most frequent character per block

Hangul
ValueCountFrequency (%)
610
20.2%
375
 
12.4%
290
 
9.6%
63
 
2.1%
63
 
2.1%
61
 
2.0%
61
 
2.0%
59
 
2.0%
54
 
1.8%
48
 
1.6%
Other values (256) 1329
44.1%
ASCII
ValueCountFrequency (%)
1 595
17.1%
0 541
15.6%
2 517
14.9%
408
11.7%
3 279
8.0%
( 206
 
5.9%
) 204
 
5.9%
4 166
 
4.8%
5 126
 
3.6%
6 100
 
2.9%
Other values (19) 336
9.7%
None
ValueCountFrequency (%)
  17
100.0%

영 폐업 여부
Categorical

IMBALANCE 

Distinct3
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size15.6 KiB
영업
1772 
폐업
 
121
<NA>
 
82

Length

Max length4
Median length2
Mean length2.083038
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row영업
2nd row영업
3rd row영업
4th row영업
5th row영업

Common Values

ValueCountFrequency (%)
영업 1772
89.7%
폐업 121
 
6.1%
<NA> 82
 
4.2%

Length

2023-12-12T16:41:48.246988image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T16:41:48.382499image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
영업 1772
89.7%
폐업 121
 
6.1%
na 82
 
4.2%

시도
Categorical

HIGH CORRELATION 

Distinct25
Distinct (%)1.3%
Missing0
Missing (%)0.0%
Memory size15.6 KiB
경기도
621 
서울특별시
195 
인천광역시
154 
강원도
104 
충남
97 
Other values (20)
804 

Length

Max length7
Median length5
Mean length3.6612658
Min length2

Unique

Unique2 ?
Unique (%)0.1%

Sample

1st row강원도
2nd row강원도
3rd row강원도
4th row강원도
5th row강원도

Common Values

ValueCountFrequency (%)
경기도 621
31.4%
서울특별시 195
 
9.9%
인천광역시 154
 
7.8%
강원도 104
 
5.3%
충남 97
 
4.9%
대전광역시 93
 
4.7%
울산광역시 86
 
4.4%
경북 69
 
3.5%
전남 68
 
3.4%
경남 66
 
3.3%
Other values (15) 422
21.4%

Length

2023-12-12T16:41:48.498482image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
경기도 621
31.4%
서울특별시 195
 
9.9%
인천광역시 154
 
7.8%
강원도 104
 
5.3%
충남 97
 
4.9%
대전광역시 93
 
4.7%
울산광역시 86
 
4.4%
경북 69
 
3.5%
전남 68
 
3.4%
경남 66
 
3.3%
Other values (15) 422
21.4%
Distinct204
Distinct (%)10.3%
Missing0
Missing (%)0.0%
Memory size15.6 KiB
2023-12-12T16:41:48.870975image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length7
Median length4
Mean length3.9377215
Min length1

Characters and Unicode

Total characters7777
Distinct characters139
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique31 ?
Unique (%)1.6%

Sample

1st row춘천시
2nd row춘천시
3rd row춘천시
4th row춘천시
5th row원주시
ValueCountFrequency (%)
성남시 80
 
4.1%
서구 77
 
3.9%
고양시 64
 
3.2%
용인시 63
 
3.2%
수원시 54
 
2.7%
중구 53
 
2.7%
화성시 50
 
2.5%
남동구 42
 
2.1%
청주시 40
 
2.0%
김포시 40
 
2.0%
Other values (193) 1411
71.5%
2023-12-12T16:41:49.423911image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1975
25.4%
1064
13.7%
630
 
8.1%
298
 
3.8%
239
 
3.1%
230
 
3.0%
230
 
3.0%
175
 
2.3%
170
 
2.2%
140
 
1.8%
Other values (129) 2626
33.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 5801
74.6%
Space Separator 1975
 
25.4%
Decimal Number 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1064
18.3%
630
 
10.9%
298
 
5.1%
239
 
4.1%
230
 
4.0%
230
 
4.0%
175
 
3.0%
170
 
2.9%
140
 
2.4%
124
 
2.1%
Other values (127) 2501
43.1%
Space Separator
ValueCountFrequency (%)
1975
100.0%
Decimal Number
ValueCountFrequency (%)
1 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 5801
74.6%
Common 1976
 
25.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1064
18.3%
630
 
10.9%
298
 
5.1%
239
 
4.1%
230
 
4.0%
230
 
4.0%
175
 
3.0%
170
 
2.9%
140
 
2.4%
124
 
2.1%
Other values (127) 2501
43.1%
Common
ValueCountFrequency (%)
1975
99.9%
1 1
 
0.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 5801
74.6%
ASCII 1976
 
25.4%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1975
99.9%
1 1
 
0.1%
Hangul
ValueCountFrequency (%)
1064
18.3%
630
 
10.9%
298
 
5.1%
239
 
4.1%
230
 
4.0%
230
 
4.0%
175
 
3.0%
170
 
2.9%
140
 
2.4%
124
 
2.1%
Other values (127) 2501
43.1%

Interactions

2023-12-12T16:41:43.261136image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T16:41:49.543442image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
사업종류우편번호영 폐업 여부시도
사업종류1.0000.2420.1500.357
우편번호0.2421.0000.4350.977
영 폐업 여부0.1500.4351.0000.562
시도0.3570.9770.5621.000
2023-12-12T16:41:49.653289image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
시도영 폐업 여부사업종류
시도1.0000.4870.307
영 폐업 여부0.4871.0000.096
사업종류0.3070.0961.000
2023-12-12T16:41:49.737699image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
우편번호사업종류영 폐업 여부시도
우편번호1.0000.1860.3330.832
사업종류0.1861.0000.0960.307
영 폐업 여부0.3330.0961.0000.487
시도0.8320.3070.4871.000

Missing values

2023-12-12T16:41:43.455363image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T16:41:43.662828image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

법인명대표자사업종류우편번호주소상세주소영 폐업 여부시도시군구
0주식회사 대동나무병원이경민1종 나무병원24383강원도 춘천시 벌말길47번길 30 (석사동)영업강원도춘천시
1주식회사 나무병원건강한숲김규헌1종 나무병원24233강원도 춘천시 후석로420번길 49-11 (후평동)<NA>영업강원도춘천시
2주식회사 강원나무병원이광세1종 나무병원24226강원도 춘천시 충열로 45 옥산빌딩 4층 (우두동)영업강원도춘천시
3호반방역청소용역 주식회사박인용1종 나무병원24307강원도 춘천시 후만로126번길 24 (후평동)영업강원도춘천시
4주식회사 목원한재국1종 나무병원26348강원도 원주시 호저면 운동들2길 29영업강원도원주시
5주식회사 강산조경김창수1종 나무병원26310강원도 원주시 소초면 황골로 113영업강원도원주시
6유한회사 녹색산림성상수1종 나무병원26316강원도 원주시 북원상가길 13 북원상가아파트 2층 49-1호 (태장동)북원상가아파트 2층 49-1호영업강원도원주시
7주식회사 한오름배형진1종 나무병원26425강원도 원주시 일산초교길 59 2층 (일산동)2층영업강원도원주시
8지원나무영농조합법인조옥선1종 나무병원25525강원도 강릉시 강릉대로 82 유림빌딩 302호 (홍제동)유림빌딩 302호폐업강원도강릉시
9주식회사 솔뫼나무병원이태선1종 나무병원25515강원도 강릉시 솔올로5번길 47 603호 (교동)603호영업강원도강릉시
법인명대표자사업종류우편번호주소상세주소영 폐업 여부시도시군구
1965영농조합법인 중부조경곽혜인2종 나무병원28703충북 청주시 서원구 구룡산로405번길 54 제1층 42호 (모충동 남부상가아파트)<NA>영업충북청주시
1966(주)아주조경지용현2종 나무병원28554충북 청주시 서원구 사직대로 295 2층 (사직동)<NA>영업충북청주시
1967명품조경(주)허대호1종 나무병원28677충북 청주시 서원구 청남로 2107-2 1층 (수곡동)<NA>영업충북청주시
1968(주)세솔나무병원이상엽2종 나무병원28629충북 청주시 서원구 산미로 95 3동 203호 (산남동)<NA>영업충북청주시
1969태일조경(주)성세해2종 나무병원28293충북 청주시 흥덕구 송화로108번길 45 (송절동 태일빌딩)태일빌딩영업충북청주시
1970(주)초록원김형득2종 나무병원29140충북 영동군 영동읍 동정로 39-22 1층1층영업충북영동군
1971(주)유수김동민2종 나무병원28476충북 청주시 흥덕구 직지대로 685 (신봉동)<NA>영업충북청주시
1972주식회사 한숲박승희2종 나무병원27112충북 제천시 봉양읍 북부로7길 38-11<NA>영업충북제천시
1973(주)뿌리나무병원유광환1종 나무병원28365충북 청주시 흥덕구 비하로12번길 14 1010호(현대오피스텔) (비하동)<NA>영업충북청주시
1974주식회사 장백이원섭2종 나무병원28798충북 청주시 서원구 1순환로 1059 118호(분평동)118호영업충북청주시

Duplicate rows

Most frequently occurring

법인명대표자사업종류우편번호주소상세주소영 폐업 여부시도시군구# duplicates
0(주)신화비엠씨정영희2종 나무병원1655서울특별시 노원구 한글비석로52길 6 3층 (상계동)3층영업서울특별시노원구2
1삼미조경공사(주)최재중2종 나무병원6584서울특별시 서초구 방배로42길 35 302호 (방배동)302호영업서울특별시서초구2