Overview

Dataset statistics

Number of variables7
Number of observations6999
Missing cells151
Missing cells (%)0.3%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory389.7 KiB
Average record size in memory57.0 B

Variable types

Numeric1
Text5
Categorical1

Dataset

Description충청남도에 등록되어있는 전문건설업체 데이터로 업체명,대표자,업종,지역,주소,우편번호 등의 항목을 제공합니다.
Author충청남도
URLhttps://alldam.chungnam.go.kr/index.chungnam?menuCd=DOM_000000201001001001&st=&cds=&orgCd=&apiType=&isOpen=Y&pageIndex=393&beforeMenuCd=DOM_000000201001001000&publicdatapk=15044854

Alerts

우편번호 has 103 (1.5%) missing valuesMissing
연번 has unique valuesUnique

Reproduction

Analysis started2024-01-09 22:39:17.372218
Analysis finished2024-01-09 22:39:18.535630
Duration1.16 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

UNIQUE 

Distinct6999
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3500
Minimum1
Maximum6999
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size61.6 KiB
2024-01-10T07:39:18.593883image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile350.9
Q11750.5
median3500
Q35249.5
95-th percentile6649.1
Maximum6999
Range6998
Interquartile range (IQR)3499

Descriptive statistics

Standard deviation2020.5816
Coefficient of variation (CV)0.57730903
Kurtosis-1.2
Mean3500
Median Absolute Deviation (MAD)1750
Skewness0
Sum24496500
Variance4082750
MonotonicityStrictly increasing
2024-01-10T07:39:18.712082image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
< 0.1%
4664 1
 
< 0.1%
4675 1
 
< 0.1%
4674 1
 
< 0.1%
4673 1
 
< 0.1%
4672 1
 
< 0.1%
4671 1
 
< 0.1%
4670 1
 
< 0.1%
4669 1
 
< 0.1%
4668 1
 
< 0.1%
Other values (6989) 6989
99.9%
ValueCountFrequency (%)
1 1
< 0.1%
2 1
< 0.1%
3 1
< 0.1%
4 1
< 0.1%
5 1
< 0.1%
6 1
< 0.1%
7 1
< 0.1%
8 1
< 0.1%
9 1
< 0.1%
10 1
< 0.1%
ValueCountFrequency (%)
6999 1
< 0.1%
6998 1
< 0.1%
6997 1
< 0.1%
6996 1
< 0.1%
6995 1
< 0.1%
6994 1
< 0.1%
6993 1
< 0.1%
6992 1
< 0.1%
6991 1
< 0.1%
6990 1
< 0.1%

상호
Text

Distinct4454
Distinct (%)63.6%
Missing0
Missing (%)0.0%
Memory size54.8 KiB
2024-01-10T07:39:18.976269image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length24
Median length16
Mean length7.4239177
Min length2

Characters and Unicode

Total characters51960
Distinct characters531
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2900 ?
Unique (%)41.4%

Sample

1st row평강설비
2nd row주식회사청춘
3rd row주식회사에이치앤케이
4th row(주)우원
5th row주식회사윤아트
ValueCountFrequency (%)
주)대명건설 11
 
0.2%
삼호개발(주 10
 
0.1%
서원건설(주 10
 
0.1%
세종건설(주 10
 
0.1%
대한건설(주 10
 
0.1%
현대스틸산업(주 9
 
0.1%
일진건설(주 9
 
0.1%
현우건설(주 9
 
0.1%
에스에이치건설(주 9
 
0.1%
조은건설(주 8
 
0.1%
Other values (4445) 6909
98.6%
2024-01-10T07:39:19.336637image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
5854
 
11.3%
) 4847
 
9.3%
( 4846
 
9.3%
3348
 
6.4%
3278
 
6.3%
1303
 
2.5%
1127
 
2.2%
1102
 
2.1%
1094
 
2.1%
942
 
1.8%
Other values (521) 24219
46.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 42020
80.9%
Close Punctuation 4847
 
9.3%
Open Punctuation 4846
 
9.3%
Uppercase Letter 171
 
0.3%
Other Punctuation 31
 
0.1%
Decimal Number 24
 
< 0.1%
Other Symbol 9
 
< 0.1%
Lowercase Letter 7
 
< 0.1%
Space Separator 5
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
5854
 
13.9%
3348
 
8.0%
3278
 
7.8%
1303
 
3.1%
1127
 
2.7%
1102
 
2.6%
1094
 
2.6%
942
 
2.2%
801
 
1.9%
682
 
1.6%
Other values (479) 22489
53.5%
Uppercase Letter
ValueCountFrequency (%)
G 44
25.7%
E 42
24.6%
N 38
22.2%
S 10
 
5.8%
L 9
 
5.3%
K 6
 
3.5%
P 4
 
2.3%
C 3
 
1.8%
R 3
 
1.8%
A 3
 
1.8%
Other values (8) 9
 
5.3%
Decimal Number
ValueCountFrequency (%)
1 9
37.5%
3 5
20.8%
5 3
 
12.5%
0 2
 
8.3%
4 2
 
8.3%
2 1
 
4.2%
6 1
 
4.2%
9 1
 
4.2%
Other Punctuation
ValueCountFrequency (%)
. 21
67.7%
& 4
 
12.9%
2
 
6.5%
, 2
 
6.5%
/ 1
 
3.2%
1
 
3.2%
Lowercase Letter
ValueCountFrequency (%)
e 2
28.6%
d 1
14.3%
t 1
14.3%
o 1
14.3%
g 1
14.3%
n 1
14.3%
Close Punctuation
ValueCountFrequency (%)
) 4847
100.0%
Open Punctuation
ValueCountFrequency (%)
( 4846
100.0%
Other Symbol
ValueCountFrequency (%)
9
100.0%
Space Separator
ValueCountFrequency (%)
5
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 42029
80.9%
Common 9753
 
18.8%
Latin 178
 
0.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
5854
 
13.9%
3348
 
8.0%
3278
 
7.8%
1303
 
3.1%
1127
 
2.7%
1102
 
2.6%
1094
 
2.6%
942
 
2.2%
801
 
1.9%
682
 
1.6%
Other values (480) 22498
53.5%
Latin
ValueCountFrequency (%)
G 44
24.7%
E 42
23.6%
N 38
21.3%
S 10
 
5.6%
L 9
 
5.1%
K 6
 
3.4%
P 4
 
2.2%
C 3
 
1.7%
R 3
 
1.7%
A 3
 
1.7%
Other values (14) 16
 
9.0%
Common
ValueCountFrequency (%)
) 4847
49.7%
( 4846
49.7%
. 21
 
0.2%
1 9
 
0.1%
5
 
0.1%
3 5
 
0.1%
& 4
 
< 0.1%
5 3
 
< 0.1%
2
 
< 0.1%
, 2
 
< 0.1%
Other values (7) 9
 
0.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 42020
80.9%
ASCII 9928
 
19.1%
None 12
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
5854
 
13.9%
3348
 
8.0%
3278
 
7.8%
1303
 
3.1%
1127
 
2.7%
1102
 
2.6%
1094
 
2.6%
942
 
2.2%
801
 
1.9%
682
 
1.6%
Other values (479) 22489
53.5%
ASCII
ValueCountFrequency (%)
) 4847
48.8%
( 4846
48.8%
G 44
 
0.4%
E 42
 
0.4%
N 38
 
0.4%
. 21
 
0.2%
S 10
 
0.1%
1 9
 
0.1%
L 9
 
0.1%
K 6
 
0.1%
Other values (29) 56
 
0.6%
None
ValueCountFrequency (%)
9
75.0%
2
 
16.7%
1
 
8.3%

업종
Categorical

Distinct15
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size54.8 KiB
철근ㆍ콘크리트공사업
1278 
가스난방공사업
990 
지반조성ㆍ포장공사업
855 
도장ㆍ습식ㆍ방수ㆍ석공사업
695 
금속ㆍ창호ㆍ지붕ㆍ건축물조립공사업
596 
Other values (10)
2585 

Length

Max length17
Median length13
Mean length10.248321
Min length7

Unique

Unique1 ?
Unique (%)< 0.1%

Sample

1st row가스난방공사업
2nd row지반조성ㆍ포장공사업
3rd row구조물해체ㆍ비계공사업
4th row실내건축공사업
5th row실내건축공사업

Common Values

ValueCountFrequency (%)
철근ㆍ콘크리트공사업 1278
18.3%
가스난방공사업 990
14.1%
지반조성ㆍ포장공사업 855
12.2%
도장ㆍ습식ㆍ방수ㆍ석공사업 695
9.9%
금속ㆍ창호ㆍ지붕ㆍ건축물조립공사업 596
8.5%
기계가스설비공사업 559
8.0%
상ㆍ하수도설비공사업 519
7.4%
조경식재ㆍ시설물공사업 498
 
7.1%
시설물유지관리업 347
 
5.0%
실내건축공사업 302
 
4.3%
Other values (5) 360
 
5.1%

Length

2024-01-10T07:39:19.451858image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
철근ㆍ콘크리트공사업 1278
18.3%
가스난방공사업 990
14.1%
지반조성ㆍ포장공사업 855
12.2%
도장ㆍ습식ㆍ방수ㆍ석공사업 695
9.9%
금속ㆍ창호ㆍ지붕ㆍ건축물조립공사업 596
8.5%
기계가스설비공사업 559
8.0%
상ㆍ하수도설비공사업 519
7.4%
조경식재ㆍ시설물공사업 498
 
7.1%
시설물유지관리업 347
 
5.0%
실내건축공사업 302
 
4.3%
Other values (5) 360
 
5.1%
Distinct6978
Distinct (%)99.7%
Missing0
Missing (%)0.0%
Memory size54.8 KiB
2024-01-10T07:39:19.742313image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length20
Median length14
Mean length13.285184
Min length2

Characters and Unicode

Total characters92983
Distinct characters147
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique6957 ?
Unique (%)99.4%

Sample

1st row충남서산2023­14­02
2nd row충남천안2023­가­09
3rd row충남서산2023­7­01
4th row충남금산 2023­04­1
5th row충남천안2023­나­15
ValueCountFrequency (%)
충남 267
 
3.3%
청양 155
 
1.9%
충남홍성 86
 
1.1%
충남금산 60
 
0.7%
충남태안 60
 
0.7%
충남부여 40
 
0.5%
충남논산 31
 
0.4%
태안 28
 
0.3%
충남공주 25
 
0.3%
논산 23
 
0.3%
Other values (6932) 7334
90.4%
2024-01-10T07:39:20.154076image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 17218
18.5%
­ 14333
15.4%
2 11813
12.7%
1 9281
10.0%
5737
 
6.2%
5701
 
6.1%
9 2780
 
3.0%
3 2148
 
2.3%
2077
 
2.2%
4 1968
 
2.1%
Other values (137) 19927
21.4%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 51269
55.1%
Other Letter 26234
28.2%
Format 14333
 
15.4%
Space Separator 1110
 
1.2%
Close Punctuation 19
 
< 0.1%
Open Punctuation 17
 
< 0.1%
Modifier Symbol 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
5737
21.9%
5701
21.7%
2077
 
7.9%
1313
 
5.0%
1264
 
4.8%
734
 
2.8%
623
 
2.4%
570
 
2.2%
470
 
1.8%
465
 
1.8%
Other values (122) 7280
27.8%
Decimal Number
ValueCountFrequency (%)
0 17218
33.6%
2 11813
23.0%
1 9281
18.1%
9 2780
 
5.4%
3 2148
 
4.2%
4 1968
 
3.8%
7 1695
 
3.3%
6 1538
 
3.0%
8 1514
 
3.0%
5 1314
 
2.6%
Format
ValueCountFrequency (%)
­ 14333
100.0%
Space Separator
ValueCountFrequency (%)
1110
100.0%
Close Punctuation
ValueCountFrequency (%)
) 19
100.0%
Open Punctuation
ValueCountFrequency (%)
( 17
100.0%
Modifier Symbol
ValueCountFrequency (%)
` 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 66749
71.8%
Hangul 26234
 
28.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
5737
21.9%
5701
21.7%
2077
 
7.9%
1313
 
5.0%
1264
 
4.8%
734
 
2.8%
623
 
2.4%
570
 
2.2%
470
 
1.8%
465
 
1.8%
Other values (122) 7280
27.8%
Common
ValueCountFrequency (%)
0 17218
25.8%
­ 14333
21.5%
2 11813
17.7%
1 9281
13.9%
9 2780
 
4.2%
3 2148
 
3.2%
4 1968
 
2.9%
7 1695
 
2.5%
6 1538
 
2.3%
8 1514
 
2.3%
Other values (5) 2461
 
3.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII 52416
56.4%
Hangul 26234
28.2%
None 14333
 
15.4%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 17218
32.8%
2 11813
22.5%
1 9281
17.7%
9 2780
 
5.3%
3 2148
 
4.1%
4 1968
 
3.8%
7 1695
 
3.2%
6 1538
 
2.9%
8 1514
 
2.9%
5 1314
 
2.5%
Other values (4) 1147
 
2.2%
None
ValueCountFrequency (%)
­ 14333
100.0%
Hangul
ValueCountFrequency (%)
5737
21.9%
5701
21.7%
2077
 
7.9%
1313
 
5.0%
1264
 
4.8%
734
 
2.8%
623
 
2.4%
570
 
2.2%
470
 
1.8%
465
 
1.8%
Other values (122) 7280
27.8%
Distinct4277
Distinct (%)61.1%
Missing0
Missing (%)0.0%
Memory size54.8 KiB
2024-01-10T07:39:20.428750image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length16
Median length3
Mean length3.0811545
Min length2

Characters and Unicode

Total characters21565
Distinct characters274
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2615 ?
Unique (%)37.4%

Sample

1st row이을우
2nd row윤기상
3rd row박시찬
4th row전옥순
5th row윤현주
ValueCountFrequency (%)
김민수 11
 
0.2%
김성일 9
 
0.1%
홍애라 9
 
0.1%
이청휴 9
 
0.1%
김태영 9
 
0.1%
이영희 9
 
0.1%
심재범 9
 
0.1%
최영숙 8
 
0.1%
김영호 8
 
0.1%
이홍영 8
 
0.1%
Other values (4267) 6910
98.7%
2024-01-10T07:39:20.854837image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1372
 
6.4%
1262
 
5.9%
664
 
3.1%
631
 
2.9%
569
 
2.6%
403
 
1.9%
403
 
1.9%
389
 
1.8%
358
 
1.7%
354
 
1.6%
Other values (264) 15160
70.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 21417
99.3%
Other Punctuation 148
 
0.7%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1372
 
6.4%
1262
 
5.9%
664
 
3.1%
631
 
2.9%
569
 
2.7%
403
 
1.9%
403
 
1.9%
389
 
1.8%
358
 
1.7%
354
 
1.7%
Other values (262) 15012
70.1%
Other Punctuation
ValueCountFrequency (%)
, 145
98.0%
3
 
2.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 21417
99.3%
Common 148
 
0.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1372
 
6.4%
1262
 
5.9%
664
 
3.1%
631
 
2.9%
569
 
2.7%
403
 
1.9%
403
 
1.9%
389
 
1.8%
358
 
1.7%
354
 
1.7%
Other values (262) 15012
70.1%
Common
ValueCountFrequency (%)
, 145
98.0%
3
 
2.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 21417
99.3%
ASCII 145
 
0.7%
None 3
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
1372
 
6.4%
1262
 
5.9%
664
 
3.1%
631
 
2.9%
569
 
2.7%
403
 
1.9%
403
 
1.9%
389
 
1.8%
358
 
1.7%
354
 
1.7%
Other values (262) 15012
70.1%
ASCII
ValueCountFrequency (%)
, 145
100.0%
None
ValueCountFrequency (%)
3
100.0%

우편번호
Text

MISSING 

Distinct1131
Distinct (%)16.4%
Missing103
Missing (%)1.5%
Memory size54.8 KiB
2024-01-10T07:39:21.136925image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length7
Median length5
Mean length5.00058
Min length5

Characters and Unicode

Total characters34484
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique226 ?
Unique (%)3.3%

Sample

1st row32004
2nd row31068
3rd row31906
4th row32711
5th row31068
ValueCountFrequency (%)
32144 78
 
1.1%
31154 40
 
0.6%
32249 39
 
0.6%
32226 34
 
0.5%
32145 34
 
0.5%
32143 33
 
0.5%
32423 33
 
0.5%
33303 32
 
0.5%
33166 31
 
0.4%
32010 31
 
0.4%
Other values (1121) 6511
94.4%
2024-01-10T07:39:21.532364image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
3 10145
29.4%
1 5548
16.1%
2 4992
14.5%
4 2903
 
8.4%
0 2149
 
6.2%
5 2050
 
5.9%
7 1949
 
5.7%
9 1877
 
5.4%
6 1645
 
4.8%
8 1224
 
3.5%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 34482
> 99.9%
Dash Punctuation 2
 
< 0.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
3 10145
29.4%
1 5548
16.1%
2 4992
14.5%
4 2903
 
8.4%
0 2149
 
6.2%
5 2050
 
5.9%
7 1949
 
5.7%
9 1877
 
5.4%
6 1645
 
4.8%
8 1224
 
3.5%
Dash Punctuation
ValueCountFrequency (%)
- 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 34484
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
3 10145
29.4%
1 5548
16.1%
2 4992
14.5%
4 2903
 
8.4%
0 2149
 
6.2%
5 2050
 
5.9%
7 1949
 
5.7%
9 1877
 
5.4%
6 1645
 
4.8%
8 1224
 
3.5%

Most occurring blocks

ValueCountFrequency (%)
ASCII 34484
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
3 10145
29.4%
1 5548
16.1%
2 4992
14.5%
4 2903
 
8.4%
0 2149
 
6.2%
5 2050
 
5.9%
7 1949
 
5.7%
9 1877
 
5.4%
6 1645
 
4.8%
8 1224
 
3.5%
Distinct4411
Distinct (%)63.5%
Missing48
Missing (%)0.7%
Memory size54.8 KiB
2024-01-10T07:39:21.834426image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length69
Median length51
Mean length24.748238
Min length11

Characters and Unicode

Total characters172025
Distinct characters466
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2870 ?
Unique (%)41.3%

Sample

1st row충청남도 서산시 동서1로 127 103호 (석남동)
2nd row충청남도 천안시 동남구 화령길 62 202호 (원성동)
3rd row충청남도 서산시 대산읍 명지1로 140 101호
4th row충청남도 금산군 추부면 서대산로 215
5th row충청남도 천안시 동남구 태조산길 37 (유량동)
ValueCountFrequency (%)
충청남도 6799
 
17.5%
천안시 1149
 
2.9%
아산시 695
 
1.8%
동남구 632
 
1.6%
2층 566
 
1.5%
보령시 559
 
1.4%
당진시 556
 
1.4%
논산시 547
 
1.4%
서산시 541
 
1.4%
공주시 526
 
1.4%
Other values (4604) 26386
67.7%
2024-01-10T07:39:22.269889image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
32005
 
18.6%
7944
 
4.6%
7497
 
4.4%
7218
 
4.2%
6891
 
4.0%
1 6886
 
4.0%
4879
 
2.8%
2 4854
 
2.8%
4716
 
2.7%
4140
 
2.4%
Other values (456) 84995
49.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 102951
59.8%
Space Separator 32005
 
18.6%
Decimal Number 28758
 
16.7%
Close Punctuation 2684
 
1.6%
Open Punctuation 2682
 
1.6%
Dash Punctuation 2156
 
1.3%
Other Punctuation 756
 
0.4%
Uppercase Letter 30
 
< 0.1%
Lowercase Letter 2
 
< 0.1%
Math Symbol 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
7944
 
7.7%
7497
 
7.3%
7218
 
7.0%
6891
 
6.7%
4879
 
4.7%
4716
 
4.6%
4140
 
4.0%
4012
 
3.9%
3056
 
3.0%
2403
 
2.3%
Other values (424) 50195
48.8%
Decimal Number
ValueCountFrequency (%)
1 6886
23.9%
2 4854
16.9%
3 3205
11.1%
0 2624
 
9.1%
4 2432
 
8.5%
5 2056
 
7.1%
6 1992
 
6.9%
7 1682
 
5.8%
8 1546
 
5.4%
9 1481
 
5.1%
Uppercase Letter
ValueCountFrequency (%)
B 11
36.7%
A 8
26.7%
D 3
 
10.0%
C 2
 
6.7%
G 1
 
3.3%
N 1
 
3.3%
E 1
 
3.3%
M 1
 
3.3%
S 1
 
3.3%
F 1
 
3.3%
Other Punctuation
ValueCountFrequency (%)
, 513
67.9%
220
29.1%
. 21
 
2.8%
/ 1
 
0.1%
@ 1
 
0.1%
Lowercase Letter
ValueCountFrequency (%)
g 1
50.0%
h 1
50.0%
Space Separator
ValueCountFrequency (%)
32005
100.0%
Close Punctuation
ValueCountFrequency (%)
) 2684
100.0%
Open Punctuation
ValueCountFrequency (%)
( 2682
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 2156
100.0%
Math Symbol
ValueCountFrequency (%)
~ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 102951
59.8%
Common 69042
40.1%
Latin 32
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
7944
 
7.7%
7497
 
7.3%
7218
 
7.0%
6891
 
6.7%
4879
 
4.7%
4716
 
4.6%
4140
 
4.0%
4012
 
3.9%
3056
 
3.0%
2403
 
2.3%
Other values (424) 50195
48.8%
Common
ValueCountFrequency (%)
32005
46.4%
1 6886
 
10.0%
2 4854
 
7.0%
3 3205
 
4.6%
) 2684
 
3.9%
( 2682
 
3.9%
0 2624
 
3.8%
4 2432
 
3.5%
- 2156
 
3.1%
5 2056
 
3.0%
Other values (10) 7458
 
10.8%
Latin
ValueCountFrequency (%)
B 11
34.4%
A 8
25.0%
D 3
 
9.4%
C 2
 
6.2%
G 1
 
3.1%
N 1
 
3.1%
E 1
 
3.1%
M 1
 
3.1%
g 1
 
3.1%
h 1
 
3.1%
Other values (2) 2
 
6.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 102951
59.8%
ASCII 68854
40.0%
None 220
 
0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
32005
46.5%
1 6886
 
10.0%
2 4854
 
7.0%
3 3205
 
4.7%
) 2684
 
3.9%
( 2682
 
3.9%
0 2624
 
3.8%
4 2432
 
3.5%
- 2156
 
3.1%
5 2056
 
3.0%
Other values (21) 7270
 
10.6%
Hangul
ValueCountFrequency (%)
7944
 
7.7%
7497
 
7.3%
7218
 
7.0%
6891
 
6.7%
4879
 
4.7%
4716
 
4.6%
4140
 
4.0%
4012
 
3.9%
3056
 
3.0%
2403
 
2.3%
Other values (424) 50195
48.8%
None
ValueCountFrequency (%)
220
100.0%

Interactions

2024-01-10T07:39:18.194520image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-01-10T07:39:22.350177image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번업종
연번1.0000.316
업종0.3161.000
2024-01-10T07:39:22.413298image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번업종
연번1.0000.123
업종0.1231.000

Missing values

2024-01-10T07:39:18.306071image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-01-10T07:39:18.409998image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2024-01-10T07:39:18.492969image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

연번상호업종등록번호대표자우편번호영업소재지
01평강설비가스난방공사업충남서산2023­14­02이을우32004충청남도 서산시 동서1로 127 103호 (석남동)
12주식회사청춘지반조성ㆍ포장공사업충남천안2023­가­09윤기상31068충청남도 천안시 동남구 화령길 62 202호 (원성동)
23주식회사에이치앤케이구조물해체ㆍ비계공사업충남서산2023­7­01박시찬31906충청남도 서산시 대산읍 명지1로 140 101호
34(주)우원실내건축공사업충남금산 2023­04­1전옥순32711충청남도 금산군 추부면 서대산로 215
45주식회사윤아트실내건축공사업충남천안2023­나­15윤현주31068충청남도 천안시 동남구 태조산길 37 (유량동)
56주식회사에이치앤케이기계가스설비공사업충남서산2023­13­01박시찬31906충청남도 서산시 대산읍 명지1로 140 101호
67(주)용두건설철근ㆍ콘크리트공사업충남당진2023­바­05강소라31785충청남도 당진시 당진시장길 27-34 202호 (채운동)
78성수건설(주)실내건축공사업충남보령2023­02­04조성수33475충청남도 보령시 흥곡천변로 21 202호 (명천동)
89(주)선일피앤에스금속ㆍ창호ㆍ지붕ㆍ건축물조립공사업충남금산 2023­05­2송석천32718충청남도 금산군 군북면 군북로 1087
910(주)이젠코리아가스난방공사업충남홍성2023­하­02유은숙32256충청남도 홍성군 홍북읍 청사로 146 센텀시티 409호
연번상호업종등록번호대표자우편번호영업소재지
69896990(주)성일공업금속ㆍ창호ㆍ지붕ㆍ건축물조립공사업충남공주78­8­06김선영32621충청남도 공주시 반포면 정광터1길 26
69906991도현산업(주)금속ㆍ창호ㆍ지붕ㆍ건축물조립공사업아산78­8­013강춘성31562충청남도 아산시 온주길26번길 7-18 도현산업 (읍내동)
69916992한국종합기계기술(주)상ㆍ하수도설비공사업서울78­13­50박종춘31903충청남도 서산시 대산읍 대죽1로 173
69926993한국종합기계기술(주)기계가스설비공사업서울78­12­114박종춘31903충청남도 서산시 대산읍 대죽1로 173
69936994씨케이종합건설(주)금속ㆍ창호ㆍ지붕ㆍ건축물조립공사업충북청주제78­08­01호김찬기33001충청남도 논산시 은진면 매죽헌로 37 2
69946995(주)파라텍기계가스설비공사업경기12­8박선기32010서산시 수석산업로 51
69956996삼호개발(주)지반조성ㆍ포장공사업서울02­18심재범31803충청남도 당진시 면천면 면천로 183
69966997(자)연합공사금속ㆍ창호ㆍ지붕ㆍ건축물조립공사업충남서산1976­08­11김재선32010충청남도 서산시 음암면 동암마을길 253
69976998삼호개발(주)철근ㆍ콘크리트공사업서울10­19심재범31803충청남도 당진시 면천면 면천로 183
69986999현대스틸산업(주)철강구조물공사업서울68­23­1이청휴31044충청남도 천안시 서북구 성거읍 천흥8길 18-9