Overview

Dataset statistics

Number of variables7
Number of observations6806
Missing cells141
Missing cells (%)0.3%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory379.0 KiB
Average record size in memory57.0 B

Variable types

Numeric1
Text5
Categorical1

Dataset

Description충청남도에 등록되어있는 전문건설업체 데이터로 업체명,대표자,업종,지역,주소,우편번호 등의 항목을 제공합니다.
Author충청남도
URLhttps://www.data.go.kr/data/15044854/fileData.do

Alerts

우편번호 has 97 (1.4%) missing valuesMissing
연번 has unique valuesUnique

Reproduction

Analysis started2024-03-14 15:39:28.684213
Analysis finished2024-03-14 15:39:31.388446
Duration2.7 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

UNIQUE 

Distinct6806
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3403.5
Minimum1
Maximum6806
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size59.9 KiB
2024-03-15T00:39:31.624830image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile341.25
Q11702.25
median3403.5
Q35104.75
95-th percentile6465.75
Maximum6806
Range6805
Interquartile range (IQR)3402.5

Descriptive statistics

Standard deviation1964.8673
Coefficient of variation (CV)0.57730786
Kurtosis-1.2
Mean3403.5
Median Absolute Deviation (MAD)1701.5
Skewness0
Sum23164221
Variance3860703.5
MonotonicityStrictly increasing
2024-03-15T00:39:31.975295image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
< 0.1%
4548 1
 
< 0.1%
4546 1
 
< 0.1%
4545 1
 
< 0.1%
4544 1
 
< 0.1%
4543 1
 
< 0.1%
4542 1
 
< 0.1%
4541 1
 
< 0.1%
4540 1
 
< 0.1%
4539 1
 
< 0.1%
Other values (6796) 6796
99.9%
ValueCountFrequency (%)
1 1
< 0.1%
2 1
< 0.1%
3 1
< 0.1%
4 1
< 0.1%
5 1
< 0.1%
6 1
< 0.1%
7 1
< 0.1%
8 1
< 0.1%
9 1
< 0.1%
10 1
< 0.1%
ValueCountFrequency (%)
6806 1
< 0.1%
6805 1
< 0.1%
6804 1
< 0.1%
6803 1
< 0.1%
6802 1
< 0.1%
6801 1
< 0.1%
6800 1
< 0.1%
6799 1
< 0.1%
6798 1
< 0.1%
6797 1
< 0.1%

상호
Text

Distinct4398
Distinct (%)64.6%
Missing0
Missing (%)0.0%
Memory size53.3 KiB
2024-03-15T00:39:33.128093image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length24
Median length16
Mean length7.4180135
Min length2

Characters and Unicode

Total characters50487
Distinct characters533
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2897 ?
Unique (%)42.6%

Sample

1st row(유)거성개발
2nd row(유)관우환경개발
3rd row(유)금산주거복지센터
4th row(유)네오건설
5th row(유)네오건설
ValueCountFrequency (%)
서원건설(주 10
 
0.1%
세종건설(주 10
 
0.1%
주)대명건설 10
 
0.1%
대한건설(주 10
 
0.1%
삼호개발(주 9
 
0.1%
일진건설(주 9
 
0.1%
현대스틸산업(주 8
 
0.1%
현우건설(주 8
 
0.1%
금강건설(주 8
 
0.1%
주원건설(주 8
 
0.1%
Other values (4389) 6720
98.7%
2024-03-15T00:39:35.137240image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
5659
 
11.2%
) 4651
 
9.2%
( 4650
 
9.2%
3203
 
6.3%
3131
 
6.2%
1289
 
2.6%
1120
 
2.2%
1097
 
2.2%
1055
 
2.1%
910
 
1.8%
Other values (523) 23722
47.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 40924
81.1%
Close Punctuation 4651
 
9.2%
Open Punctuation 4650
 
9.2%
Uppercase Letter 180
 
0.4%
Other Punctuation 33
 
0.1%
Decimal Number 27
 
0.1%
Other Symbol 11
 
< 0.1%
Lowercase Letter 7
 
< 0.1%
Space Separator 4
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
5659
 
13.8%
3203
 
7.8%
3131
 
7.7%
1289
 
3.1%
1120
 
2.7%
1097
 
2.7%
1055
 
2.6%
910
 
2.2%
767
 
1.9%
683
 
1.7%
Other values (481) 22010
53.8%
Uppercase Letter
ValueCountFrequency (%)
G 45
25.0%
E 43
23.9%
N 40
22.2%
S 11
 
6.1%
L 9
 
5.0%
K 6
 
3.3%
R 4
 
2.2%
C 4
 
2.2%
P 4
 
2.2%
A 3
 
1.7%
Other values (9) 11
 
6.1%
Decimal Number
ValueCountFrequency (%)
1 11
40.7%
3 6
22.2%
2 2
 
7.4%
4 2
 
7.4%
0 2
 
7.4%
5 2
 
7.4%
9 2
 
7.4%
Other Punctuation
ValueCountFrequency (%)
. 21
63.6%
& 6
 
18.2%
2
 
6.1%
, 2
 
6.1%
1
 
3.0%
/ 1
 
3.0%
Lowercase Letter
ValueCountFrequency (%)
e 2
28.6%
o 1
14.3%
t 1
14.3%
d 1
14.3%
g 1
14.3%
n 1
14.3%
Close Punctuation
ValueCountFrequency (%)
) 4651
100.0%
Open Punctuation
ValueCountFrequency (%)
( 4650
100.0%
Other Symbol
ValueCountFrequency (%)
11
100.0%
Space Separator
ValueCountFrequency (%)
4
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 40935
81.1%
Common 9365
 
18.5%
Latin 187
 
0.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
5659
 
13.8%
3203
 
7.8%
3131
 
7.6%
1289
 
3.1%
1120
 
2.7%
1097
 
2.7%
1055
 
2.6%
910
 
2.2%
767
 
1.9%
683
 
1.7%
Other values (482) 22021
53.8%
Latin
ValueCountFrequency (%)
G 45
24.1%
E 43
23.0%
N 40
21.4%
S 11
 
5.9%
L 9
 
4.8%
K 6
 
3.2%
R 4
 
2.1%
C 4
 
2.1%
P 4
 
2.1%
A 3
 
1.6%
Other values (15) 18
 
9.6%
Common
ValueCountFrequency (%)
) 4651
49.7%
( 4650
49.7%
. 21
 
0.2%
1 11
 
0.1%
& 6
 
0.1%
3 6
 
0.1%
4
 
< 0.1%
2 2
 
< 0.1%
2
 
< 0.1%
4 2
 
< 0.1%
Other values (6) 10
 
0.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 40924
81.1%
ASCII 9549
 
18.9%
None 14
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
5659
 
13.8%
3203
 
7.8%
3131
 
7.7%
1289
 
3.1%
1120
 
2.7%
1097
 
2.7%
1055
 
2.6%
910
 
2.2%
767
 
1.9%
683
 
1.7%
Other values (481) 22010
53.8%
ASCII
ValueCountFrequency (%)
) 4651
48.7%
( 4650
48.7%
G 45
 
0.5%
E 43
 
0.5%
N 40
 
0.4%
. 21
 
0.2%
S 11
 
0.1%
1 11
 
0.1%
L 9
 
0.1%
K 6
 
0.1%
Other values (29) 62
 
0.6%
None
ValueCountFrequency (%)
11
78.6%
2
 
14.3%
1
 
7.1%

업종
Categorical

Distinct14
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size53.3 KiB
철근ㆍ콘크리트공사업
1296 
가스난방공사업
999 
지반조성ㆍ포장공사업
872 
도장ㆍ습식ㆍ방수ㆍ석공사업
720 
금속ㆍ창호ㆍ지붕ㆍ건축물조립공사업
610 
Other values (9)
2309 

Length

Max length17
Median length13
Mean length10.363797
Min length7

Unique

Unique1 ?
Unique (%)< 0.1%

Sample

1st row철근ㆍ콘크리트공사업
2nd row구조물해체ㆍ비계공사업
3rd row가스난방공사업
4th row지반조성ㆍ포장공사업
5th row철근ㆍ콘크리트공사업

Common Values

ValueCountFrequency (%)
철근ㆍ콘크리트공사업 1296
19.0%
가스난방공사업 999
14.7%
지반조성ㆍ포장공사업 872
12.8%
도장ㆍ습식ㆍ방수ㆍ석공사업 720
10.6%
금속ㆍ창호ㆍ지붕ㆍ건축물조립공사업 610
9.0%
기계가스설비공사업 575
8.4%
상ㆍ하수도설비공사업 513
 
7.5%
조경식재ㆍ시설물공사업 502
 
7.4%
실내건축공사업 335
 
4.9%
구조물해체ㆍ비계공사업 290
 
4.3%
Other values (4) 94
 
1.4%

Length

2024-03-15T00:39:35.624378image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
철근ㆍ콘크리트공사업 1296
19.0%
가스난방공사업 999
14.7%
지반조성ㆍ포장공사업 872
12.8%
도장ㆍ습식ㆍ방수ㆍ석공사업 720
10.6%
금속ㆍ창호ㆍ지붕ㆍ건축물조립공사업 610
9.0%
기계가스설비공사업 575
8.4%
상ㆍ하수도설비공사업 513
 
7.5%
조경식재ㆍ시설물공사업 502
 
7.4%
실내건축공사업 335
 
4.9%
구조물해체ㆍ비계공사업 290
 
4.3%
Other values (4) 94
 
1.4%
Distinct6783
Distinct (%)99.7%
Missing0
Missing (%)0.0%
Memory size53.3 KiB
2024-03-15T00:39:36.779292image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length20
Median length14
Mean length13.281222
Min length2

Characters and Unicode

Total characters90392
Distinct characters146
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique6760 ?
Unique (%)99.3%

Sample

1st row무안02­10­14
2nd row충남서천2017­06­02
3rd row충남 금산 2015­26­01
4th row충남보령2021­02­03
5th row충남보령2006­10­66
ValueCountFrequency (%)
충남 267
 
3.4%
청양 162
 
2.0%
충남홍성 91
 
1.2%
충남금산 64
 
0.8%
충남태안 55
 
0.7%
충남부여 37
 
0.5%
충남논산 31
 
0.4%
태안 25
 
0.3%
논산 21
 
0.3%
충남계룡 21
 
0.3%
Other values (6741) 7130
90.2%
2024-03-15T00:39:38.311775image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 16820
18.6%
­ 13942
15.4%
2 11429
12.6%
1 8917
9.9%
5626
 
6.2%
5592
 
6.2%
9 2363
 
2.6%
3 2279
 
2.5%
2035
 
2.3%
4 1987
 
2.2%
Other values (136) 19402
21.5%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 49673
55.0%
Other Letter 25644
28.4%
Format 13942
 
15.4%
Space Separator 1098
 
1.2%
Close Punctuation 17
 
< 0.1%
Open Punctuation 16
 
< 0.1%
Modifier Symbol 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
5626
21.9%
5592
21.8%
2035
 
7.9%
1295
 
5.0%
1248
 
4.9%
698
 
2.7%
586
 
2.3%
572
 
2.2%
435
 
1.7%
431
 
1.7%
Other values (121) 7126
27.8%
Decimal Number
ValueCountFrequency (%)
0 16820
33.9%
2 11429
23.0%
1 8917
18.0%
9 2363
 
4.8%
3 2279
 
4.6%
4 1987
 
4.0%
7 1636
 
3.3%
6 1498
 
3.0%
8 1477
 
3.0%
5 1267
 
2.6%
Format
ValueCountFrequency (%)
­ 13942
100.0%
Space Separator
ValueCountFrequency (%)
1098
100.0%
Close Punctuation
ValueCountFrequency (%)
) 17
100.0%
Open Punctuation
ValueCountFrequency (%)
( 16
100.0%
Modifier Symbol
ValueCountFrequency (%)
` 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 64748
71.6%
Hangul 25644
 
28.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
5626
21.9%
5592
21.8%
2035
 
7.9%
1295
 
5.0%
1248
 
4.9%
698
 
2.7%
586
 
2.3%
572
 
2.2%
435
 
1.7%
431
 
1.7%
Other values (121) 7126
27.8%
Common
ValueCountFrequency (%)
0 16820
26.0%
­ 13942
21.5%
2 11429
17.7%
1 8917
13.8%
9 2363
 
3.6%
3 2279
 
3.5%
4 1987
 
3.1%
7 1636
 
2.5%
6 1498
 
2.3%
8 1477
 
2.3%
Other values (5) 2400
 
3.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII 50806
56.2%
Hangul 25644
28.4%
None 13942
 
15.4%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 16820
33.1%
2 11429
22.5%
1 8917
17.6%
9 2363
 
4.7%
3 2279
 
4.5%
4 1987
 
3.9%
7 1636
 
3.2%
6 1498
 
2.9%
8 1477
 
2.9%
5 1267
 
2.5%
Other values (4) 1133
 
2.2%
None
ValueCountFrequency (%)
­ 13942
100.0%
Hangul
ValueCountFrequency (%)
5626
21.9%
5592
21.8%
2035
 
7.9%
1295
 
5.0%
1248
 
4.9%
698
 
2.7%
586
 
2.3%
572
 
2.2%
435
 
1.7%
431
 
1.7%
Other values (121) 7126
27.8%
Distinct4243
Distinct (%)62.3%
Missing0
Missing (%)0.0%
Memory size53.3 KiB
2024-03-15T00:39:39.755419image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length16
Median length3
Mean length3.0900676
Min length2

Characters and Unicode

Total characters21031
Distinct characters273
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2633 ?
Unique (%)38.7%

Sample

1st row유인규
2nd row노광현
3rd row김순태
4th row주선희
5th row주선희
ValueCountFrequency (%)
김민수 11
 
0.2%
심재범 9
 
0.1%
김태영 9
 
0.1%
홍애라 9
 
0.1%
이영희 8
 
0.1%
김성일 8
 
0.1%
김현주 8
 
0.1%
이청휴 8
 
0.1%
이홍영 7
 
0.1%
김영호 7
 
0.1%
Other values (4233) 6722
98.8%
2024-03-15T00:39:41.431285image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1345
 
6.4%
1237
 
5.9%
666
 
3.2%
599
 
2.8%
543
 
2.6%
392
 
1.9%
391
 
1.9%
385
 
1.8%
349
 
1.7%
344
 
1.6%
Other values (263) 14780
70.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 20872
99.2%
Other Punctuation 159
 
0.8%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1345
 
6.4%
1237
 
5.9%
666
 
3.2%
599
 
2.9%
543
 
2.6%
392
 
1.9%
391
 
1.9%
385
 
1.8%
349
 
1.7%
344
 
1.6%
Other values (261) 14621
70.1%
Other Punctuation
ValueCountFrequency (%)
, 155
97.5%
4
 
2.5%

Most occurring scripts

ValueCountFrequency (%)
Hangul 20872
99.2%
Common 159
 
0.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1345
 
6.4%
1237
 
5.9%
666
 
3.2%
599
 
2.9%
543
 
2.6%
392
 
1.9%
391
 
1.9%
385
 
1.8%
349
 
1.7%
344
 
1.6%
Other values (261) 14621
70.1%
Common
ValueCountFrequency (%)
, 155
97.5%
4
 
2.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 20872
99.2%
ASCII 155
 
0.7%
None 4
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
1345
 
6.4%
1237
 
5.9%
666
 
3.2%
599
 
2.9%
543
 
2.6%
392
 
1.9%
391
 
1.9%
385
 
1.8%
349
 
1.7%
344
 
1.6%
Other values (261) 14621
70.1%
ASCII
ValueCountFrequency (%)
, 155
100.0%
None
ValueCountFrequency (%)
4
100.0%

우편번호
Text

MISSING 

Distinct1127
Distinct (%)16.8%
Missing97
Missing (%)1.4%
Memory size53.3 KiB
2024-03-15T00:39:42.957347image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length7
Median length5
Mean length5.0005962
Min length5

Characters and Unicode

Total characters33549
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique230 ?
Unique (%)3.4%

Sample

1st row31991
2nd row33673
3rd row32729
4th row33451
5th row33451
ValueCountFrequency (%)
32144 77
 
1.1%
31154 39
 
0.6%
32249 38
 
0.6%
33430 33
 
0.5%
32423 30
 
0.4%
33328 30
 
0.4%
32143 30
 
0.4%
32226 29
 
0.4%
32145 29
 
0.4%
32010 29
 
0.4%
Other values (1117) 6345
94.6%
2024-03-15T00:39:45.001004image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
3 9817
29.3%
1 5467
16.3%
2 4801
14.3%
4 2806
 
8.4%
0 2160
 
6.4%
5 1989
 
5.9%
7 1927
 
5.7%
9 1829
 
5.5%
6 1588
 
4.7%
8 1163
 
3.5%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 33547
> 99.9%
Dash Punctuation 2
 
< 0.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
3 9817
29.3%
1 5467
16.3%
2 4801
14.3%
4 2806
 
8.4%
0 2160
 
6.4%
5 1989
 
5.9%
7 1927
 
5.7%
9 1829
 
5.5%
6 1588
 
4.7%
8 1163
 
3.5%
Dash Punctuation
ValueCountFrequency (%)
- 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 33549
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
3 9817
29.3%
1 5467
16.3%
2 4801
14.3%
4 2806
 
8.4%
0 2160
 
6.4%
5 1989
 
5.9%
7 1927
 
5.7%
9 1829
 
5.5%
6 1588
 
4.7%
8 1163
 
3.5%

Most occurring blocks

ValueCountFrequency (%)
ASCII 33549
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
3 9817
29.3%
1 5467
16.3%
2 4801
14.3%
4 2806
 
8.4%
0 2160
 
6.4%
5 1989
 
5.9%
7 1927
 
5.7%
9 1829
 
5.5%
6 1588
 
4.7%
8 1163
 
3.5%
Distinct4382
Distinct (%)64.8%
Missing44
Missing (%)0.6%
Memory size53.3 KiB
2024-03-15T00:39:46.449654image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length59
Median length50
Mean length24.75037
Min length11

Characters and Unicode

Total characters167362
Distinct characters461
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2888 ?
Unique (%)42.7%

Sample

1st row충청남도 서산시 덕지천로 23-5 (석림동)
2nd row충청남도 서천군 장항읍 장산로 349
3rd row충청남도 금산군 금산읍 북사직6길 45 헤븐 101호
4th row충청남도 보령시 중앙로 294 (대천동)
5th row충청남도 보령시 중앙로 294 (대천동)
ValueCountFrequency (%)
충청남도 6593
 
17.4%
천안시 1144
 
3.0%
아산시 674
 
1.8%
동남구 616
 
1.6%
당진시 580
 
1.5%
논산시 549
 
1.4%
서북구 536
 
1.4%
2층 525
 
1.4%
서산시 521
 
1.4%
보령시 517
 
1.4%
Other values (4609) 25639
67.7%
2024-03-15T00:39:48.531154image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
31132
 
18.6%
7721
 
4.6%
7309
 
4.4%
7025
 
4.2%
1 6758
 
4.0%
6692
 
4.0%
4782
 
2.9%
2 4714
 
2.8%
4619
 
2.8%
3977
 
2.4%
Other values (451) 82633
49.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 100134
59.8%
Space Separator 31132
 
18.6%
Decimal Number 28111
 
16.8%
Close Punctuation 2576
 
1.5%
Open Punctuation 2574
 
1.5%
Dash Punctuation 2108
 
1.3%
Other Punctuation 693
 
0.4%
Uppercase Letter 30
 
< 0.1%
Math Symbol 2
 
< 0.1%
Lowercase Letter 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
7721
 
7.7%
7309
 
7.3%
7025
 
7.0%
6692
 
6.7%
4782
 
4.8%
4619
 
4.6%
3977
 
4.0%
3919
 
3.9%
2932
 
2.9%
2330
 
2.3%
Other values (419) 48828
48.8%
Decimal Number
ValueCountFrequency (%)
1 6758
24.0%
2 4714
16.8%
3 3116
11.1%
0 2575
 
9.2%
4 2368
 
8.4%
5 2017
 
7.2%
6 1955
 
7.0%
7 1658
 
5.9%
8 1513
 
5.4%
9 1437
 
5.1%
Uppercase Letter
ValueCountFrequency (%)
B 13
43.3%
A 8
26.7%
C 2
 
6.7%
M 1
 
3.3%
G 1
 
3.3%
D 1
 
3.3%
S 1
 
3.3%
F 1
 
3.3%
N 1
 
3.3%
E 1
 
3.3%
Other Punctuation
ValueCountFrequency (%)
, 466
67.2%
203
29.3%
. 20
 
2.9%
/ 3
 
0.4%
@ 1
 
0.1%
Lowercase Letter
ValueCountFrequency (%)
g 1
50.0%
h 1
50.0%
Space Separator
ValueCountFrequency (%)
31132
100.0%
Close Punctuation
ValueCountFrequency (%)
) 2576
100.0%
Open Punctuation
ValueCountFrequency (%)
( 2574
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 2108
100.0%
Math Symbol
ValueCountFrequency (%)
~ 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 100134
59.8%
Common 67196
40.2%
Latin 32
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
7721
 
7.7%
7309
 
7.3%
7025
 
7.0%
6692
 
6.7%
4782
 
4.8%
4619
 
4.6%
3977
 
4.0%
3919
 
3.9%
2932
 
2.9%
2330
 
2.3%
Other values (419) 48828
48.8%
Common
ValueCountFrequency (%)
31132
46.3%
1 6758
 
10.1%
2 4714
 
7.0%
3 3116
 
4.6%
) 2576
 
3.8%
0 2575
 
3.8%
( 2574
 
3.8%
4 2368
 
3.5%
- 2108
 
3.1%
5 2017
 
3.0%
Other values (10) 7258
 
10.8%
Latin
ValueCountFrequency (%)
B 13
40.6%
A 8
25.0%
C 2
 
6.2%
M 1
 
3.1%
g 1
 
3.1%
h 1
 
3.1%
G 1
 
3.1%
D 1
 
3.1%
S 1
 
3.1%
F 1
 
3.1%
Other values (2) 2
 
6.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 100134
59.8%
ASCII 67025
40.0%
None 203
 
0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
31132
46.4%
1 6758
 
10.1%
2 4714
 
7.0%
3 3116
 
4.6%
) 2576
 
3.8%
0 2575
 
3.8%
( 2574
 
3.8%
4 2368
 
3.5%
- 2108
 
3.1%
5 2017
 
3.0%
Other values (21) 7087
 
10.6%
Hangul
ValueCountFrequency (%)
7721
 
7.7%
7309
 
7.3%
7025
 
7.0%
6692
 
6.7%
4782
 
4.8%
4619
 
4.6%
3977
 
4.0%
3919
 
3.9%
2932
 
2.9%
2330
 
2.3%
Other values (419) 48828
48.8%
None
ValueCountFrequency (%)
203
100.0%

Interactions

2024-03-15T00:39:30.112192image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-03-15T00:39:48.834001image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번업종
연번1.0000.236
업종0.2361.000
2024-03-15T00:39:49.202364image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번업종
연번1.0000.097
업종0.0971.000

Missing values

2024-03-15T00:39:30.534274image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-15T00:39:30.938976image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2024-03-15T00:39:31.242172image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

연번상호업종등록번호대표자우편번호영업소재지
01(유)거성개발철근ㆍ콘크리트공사업무안02­10­14유인규31991충청남도 서산시 덕지천로 23-5 (석림동)
12(유)관우환경개발구조물해체ㆍ비계공사업충남서천2017­06­02노광현33673충청남도 서천군 장항읍 장산로 349
23(유)금산주거복지센터가스난방공사업충남 금산 2015­26­01김순태32729충청남도 금산군 금산읍 북사직6길 45 헤븐 101호
34(유)네오건설지반조성ㆍ포장공사업충남보령2021­02­03주선희33451충청남도 보령시 중앙로 294 (대천동)
45(유)네오건설철근ㆍ콘크리트공사업충남보령2006­10­66주선희33451충청남도 보령시 중앙로 294 (대천동)
56(유)다살림구조물해체ㆍ비계공사업충남서천2010­06­01양수철33659충청남도 서천군 장항읍 장서로 262-10
67(유)다살림실내건축공사업충남서천2024­나­01양수철33659충청남도 서천군 장항읍 장서로 262-10
78(유)대도구조물해체ㆍ비계공사업동해2009­06­02김기범31449충청남도 아산시 탕정면 꾀꼴성길 110
89(유)대동포장건설철근ㆍ콘크리트공사업충남태안 2018­09­04오현자32141충청남도 태안군 태안읍 경이정1길 16-4 , 1층
910(유)대동포장건설지반조성ㆍ포장공사업충남당진2005­16­01오현자32141충청남도 태안군 태안읍 경이정1길 16-4 , 1층
연번상호업종등록번호대표자우편번호영업소재지
67966797희망가스설비가스난방공사업충남예산2009­24­01최연수32299충청남도 홍성군 장곡면 무한로 936
67976798희망건설(주)철근ㆍ콘크리트공사업울산남구2004­10­03양충모31233충청남도 천안시 동남구 목천읍 충절로 874 상가동 201호(협성엠파이어아파트)
67986799희망건설(주)철근ㆍ콘크리트공사업충남서천2003­10­1신경미33615충청남도 서천군 판교면 종판로887번길 47
67996800희망설비가스난방공사업충남서산2020­27­71유영열31984충청남도 서산시 동헌로 139 (읍내동)
68006801희망설비가스난방공사업충남당진2017­27­02박호현31714충청남도 당진시 송산면 상거길 34
68016802희성건설(주)구조물해체ㆍ비계공사업충남 계룡 2020­06­01윤병서32829충청남도 계룡시 엄사면 소라실길 47
68026803희성건설(주)지반조성ㆍ포장공사업충남계룡 2009­02­01윤병서32829충청남도 계룡시 엄사면 소라실길 47
68036804희성건설(주)철근ㆍ콘크리트공사업충남공주2004­10­06윤병서32829충청남도 계룡시 엄사면 소라실길 47
68046805희성건설(주)금속ㆍ창호ㆍ지붕ㆍ건축물조립공사업충남공주2004­08­04윤병서32829충청남도 계룡시 엄사면 소라실길 47
68056806흰돌조경(주)조경식재ㆍ시설물공사업파주08­16­03노윤영31253충청남도 천안시 동남구 병천면 가전1길 161, 1층