Overview

Dataset statistics

Number of variables11
Number of observations103
Missing cells86
Missing cells (%)7.6%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory9.1 KiB
Average record size in memory90.3 B

Variable types

Text6
Categorical3
DateTime1
Numeric1

Dataset

Description충청남도 내 등록된 부동산개발업 업체현황에 대한 데이터로 부동산개발업 등록번호, 대표자 성명, 법인구분, 영업소소재지의 정보를 제공합니다.
Author충청남도
URLhttps://alldam.chungnam.go.kr/index.chungnam?menuCd=DOM_000000201001001001&st=&cds=&orgCd=&apiType=&isOpen=Y&pageIndex=404&beforeMenuCd=DOM_000000201001001000&publicdatapk=15018881

Alerts

등록상태 has constant value ""Constant
자본금(천원) is highly overall correlated with 법인구분High correlation
법인구분 is highly overall correlated with 자본금(천원)High correlation
법인구분 is highly imbalanced (86.3%)Imbalance
처리상태 is highly imbalanced (76.3%)Imbalance
전화번호 has 2 (1.9%) missing valuesMissing
팩스번호 has 82 (79.6%) missing valuesMissing
영업소재지 has 2 (1.9%) missing valuesMissing
부동산개발업등록번호 has unique valuesUnique
상호 has unique valuesUnique

Reproduction

Analysis started2024-01-09 22:15:32.929315
Analysis finished2024-01-09 22:15:33.677125
Duration0.75 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct103
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size956.0 B
2024-01-10T07:15:33.833248image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length8
Median length8
Mean length8
Min length8

Characters and Unicode

Total characters824
Distinct characters12
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique103 ?
Unique (%)100.0%

Sample

1st row충남080003
2nd row충남080007
3rd row충남080037
4th row충남080040
5th row충남080043
ValueCountFrequency (%)
충남080003 1
 
1.0%
충남190007 1
 
1.0%
충남190004 1
 
1.0%
충남190003 1
 
1.0%
충남180015 1
 
1.0%
충남180014 1
 
1.0%
충남180013 1
 
1.0%
충남180012 1
 
1.0%
충남180011 1
 
1.0%
충남180010 1
 
1.0%
Other values (93) 93
90.3%
2024-01-10T07:15:34.137314image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 293
35.6%
1 134
16.3%
103
 
12.5%
103
 
12.5%
2 35
 
4.2%
8 28
 
3.4%
9 28
 
3.4%
7 22
 
2.7%
6 21
 
2.5%
5 21
 
2.5%
Other values (2) 36
 
4.4%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 618
75.0%
Other Letter 206
 
25.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 293
47.4%
1 134
21.7%
2 35
 
5.7%
8 28
 
4.5%
9 28
 
4.5%
7 22
 
3.6%
6 21
 
3.4%
5 21
 
3.4%
3 18
 
2.9%
4 18
 
2.9%
Other Letter
ValueCountFrequency (%)
103
50.0%
103
50.0%

Most occurring scripts

ValueCountFrequency (%)
Common 618
75.0%
Hangul 206
 
25.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 293
47.4%
1 134
21.7%
2 35
 
5.7%
8 28
 
4.5%
9 28
 
4.5%
7 22
 
3.6%
6 21
 
3.4%
5 21
 
3.4%
3 18
 
2.9%
4 18
 
2.9%
Hangul
ValueCountFrequency (%)
103
50.0%
103
50.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 618
75.0%
Hangul 206
 
25.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 293
47.4%
1 134
21.7%
2 35
 
5.7%
8 28
 
4.5%
9 28
 
4.5%
7 22
 
3.6%
6 21
 
3.4%
5 21
 
3.4%
3 18
 
2.9%
4 18
 
2.9%
Hangul
ValueCountFrequency (%)
103
50.0%
103
50.0%

상호
Text

UNIQUE 

Distinct103
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size956.0 B
2024-01-10T07:15:34.319876image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length21
Median length18
Mean length8.5631068
Min length3

Characters and Unicode

Total characters882
Distinct characters146
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique103 ?
Unique (%)100.0%

Sample

1st row태산종합건설㈜
2nd row경남기업㈜
3rd row㈜동일토건
4th row일산종합건설㈜
5th row하이스종합건설㈜
ValueCountFrequency (%)
주식회사 22
 
17.5%
태산종합건설㈜ 1
 
0.8%
수호건설(주 1
 
0.8%
에스에스(주 1
 
0.8%
신성인프라 1
 
0.8%
현보건설산업(주 1
 
0.8%
주)에스씨종합건설 1
 
0.8%
비오케이건설 1
 
0.8%
태흥종합건설(주 1
 
0.8%
신지건설(주 1
 
0.8%
Other values (95) 95
75.4%
2024-01-10T07:15:34.611771image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
91
 
10.3%
76
 
8.6%
( 66
 
7.5%
) 66
 
7.5%
64
 
7.3%
30
 
3.4%
28
 
3.2%
28
 
3.2%
24
 
2.7%
23
 
2.6%
Other values (136) 386
43.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 713
80.8%
Open Punctuation 66
 
7.5%
Close Punctuation 66
 
7.5%
Space Separator 23
 
2.6%
Other Symbol 12
 
1.4%
Other Punctuation 2
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
91
 
12.8%
76
 
10.7%
64
 
9.0%
30
 
4.2%
28
 
3.9%
28
 
3.9%
24
 
3.4%
23
 
3.2%
18
 
2.5%
17
 
2.4%
Other values (131) 314
44.0%
Open Punctuation
ValueCountFrequency (%)
( 66
100.0%
Close Punctuation
ValueCountFrequency (%)
) 66
100.0%
Space Separator
ValueCountFrequency (%)
23
100.0%
Other Symbol
ValueCountFrequency (%)
12
100.0%
Other Punctuation
ValueCountFrequency (%)
. 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 725
82.2%
Common 157
 
17.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
91
 
12.6%
76
 
10.5%
64
 
8.8%
30
 
4.1%
28
 
3.9%
28
 
3.9%
24
 
3.3%
23
 
3.2%
18
 
2.5%
17
 
2.3%
Other values (132) 326
45.0%
Common
ValueCountFrequency (%)
( 66
42.0%
) 66
42.0%
23
 
14.6%
. 2
 
1.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 713
80.8%
ASCII 157
 
17.8%
None 12
 
1.4%

Most frequent character per block

Hangul
ValueCountFrequency (%)
91
 
12.8%
76
 
10.7%
64
 
9.0%
30
 
4.2%
28
 
3.9%
28
 
3.9%
24
 
3.4%
23
 
3.2%
18
 
2.5%
17
 
2.4%
Other values (131) 314
44.0%
ASCII
ValueCountFrequency (%)
( 66
42.0%
) 66
42.0%
23
 
14.6%
. 2
 
1.3%
None
ValueCountFrequency (%)
12
100.0%
Distinct102
Distinct (%)99.0%
Missing0
Missing (%)0.0%
Memory size956.0 B
2024-01-10T07:15:34.810969image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length8
Median length3
Mean length3.4271845
Min length2

Characters and Unicode

Total characters353
Distinct characters118
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique101 ?
Unique (%)98.1%

Sample

1st row김지찬
2nd row박석준
3rd row고동현
4th row박성배,박재현
5th row신동룡,정원욱
ValueCountFrequency (%)
김명자 2
 
1.9%
강기훈 1
 
1.0%
이정길 1
 
1.0%
박순희 1
 
1.0%
박경수 1
 
1.0%
김형조 1
 
1.0%
이규순 1
 
1.0%
엄기웅 1
 
1.0%
강신학 1
 
1.0%
박용성,홍윤표 1
 
1.0%
Other values (92) 92
89.3%
2024-01-10T07:15:35.104499image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
19
 
5.4%
17
 
4.8%
11
 
3.1%
, 10
 
2.8%
9
 
2.5%
9
 
2.5%
8
 
2.3%
7
 
2.0%
7
 
2.0%
7
 
2.0%
Other values (108) 249
70.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 343
97.2%
Other Punctuation 10
 
2.8%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
19
 
5.5%
17
 
5.0%
11
 
3.2%
9
 
2.6%
9
 
2.6%
8
 
2.3%
7
 
2.0%
7
 
2.0%
7
 
2.0%
7
 
2.0%
Other values (107) 242
70.6%
Other Punctuation
ValueCountFrequency (%)
, 10
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 343
97.2%
Common 10
 
2.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
19
 
5.5%
17
 
5.0%
11
 
3.2%
9
 
2.6%
9
 
2.6%
8
 
2.3%
7
 
2.0%
7
 
2.0%
7
 
2.0%
7
 
2.0%
Other values (107) 242
70.6%
Common
ValueCountFrequency (%)
, 10
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 343
97.2%
ASCII 10
 
2.8%

Most frequent character per block

Hangul
ValueCountFrequency (%)
19
 
5.5%
17
 
5.0%
11
 
3.2%
9
 
2.6%
9
 
2.6%
8
 
2.3%
7
 
2.0%
7
 
2.0%
7
 
2.0%
7
 
2.0%
Other values (107) 242
70.6%
ASCII
ValueCountFrequency (%)
, 10
100.0%

법인구분
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct3
Distinct (%)2.9%
Missing0
Missing (%)0.0%
Memory size956.0 B
일반법인
100 
특수법인
 
2
개인
 
1

Length

Max length4
Median length4
Mean length3.9805825
Min length2

Unique

Unique1 ?
Unique (%)1.0%

Sample

1st row일반법인
2nd row일반법인
3rd row일반법인
4th row일반법인
5th row일반법인

Common Values

ValueCountFrequency (%)
일반법인 100
97.1%
특수법인 2
 
1.9%
개인 1
 
1.0%

Length

2024-01-10T07:15:35.447470image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-10T07:15:35.528428image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
일반법인 100
97.1%
특수법인 2
 
1.9%
개인 1
 
1.0%

전화번호
Text

MISSING 

Distinct101
Distinct (%)100.0%
Missing2
Missing (%)1.9%
Memory size956.0 B
2024-01-10T07:15:35.716339image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length14
Median length12
Mean length12.039604
Min length11

Characters and Unicode

Total characters1216
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique101 ?
Unique (%)100.0%

Sample

1st row041-857-9696
2nd row02-2210-0273
3rd row02-2007-2000
4th row042-471-4567
5th row041-564-3740
ValueCountFrequency (%)
041-544-6337 1
 
1.0%
041-553-8181 1
 
1.0%
042-471-2701 1
 
1.0%
041-578-0987 1
 
1.0%
041-354-4800 1
 
1.0%
041-753-7330 1
 
1.0%
0507-1471-6200 1
 
1.0%
041-732-5965 1
 
1.0%
041-554-9377 1
 
1.0%
042-825-0807 1
 
1.0%
Other values (91) 91
90.1%
2024-01-10T07:15:36.025044image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 202
16.6%
0 183
15.0%
4 162
13.3%
1 156
12.8%
5 119
9.8%
2 76
 
6.2%
3 73
 
6.0%
8 70
 
5.8%
7 68
 
5.6%
6 65
 
5.3%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 1014
83.4%
Dash Punctuation 202
 
16.6%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 183
18.0%
4 162
16.0%
1 156
15.4%
5 119
11.7%
2 76
7.5%
3 73
 
7.2%
8 70
 
6.9%
7 68
 
6.7%
6 65
 
6.4%
9 42
 
4.1%
Dash Punctuation
ValueCountFrequency (%)
- 202
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 1216
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 202
16.6%
0 183
15.0%
4 162
13.3%
1 156
12.8%
5 119
9.8%
2 76
 
6.2%
3 73
 
6.0%
8 70
 
5.8%
7 68
 
5.6%
6 65
 
5.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1216
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 202
16.6%
0 183
15.0%
4 162
13.3%
1 156
12.8%
5 119
9.8%
2 76
 
6.2%
3 73
 
6.0%
8 70
 
5.8%
7 68
 
5.6%
6 65
 
5.3%

팩스번호
Text

MISSING 

Distinct21
Distinct (%)100.0%
Missing82
Missing (%)79.6%
Memory size956.0 B
2024-01-10T07:15:36.185585image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length14
Median length12
Mean length12.095238
Min length11

Characters and Unicode

Total characters254
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique21 ?
Unique (%)100.0%

Sample

1st row02-2210-0274
2nd row041-564-3752
3rd row041-571-0987
4th row042-471-2706
5th row041-533-0486
ValueCountFrequency (%)
02-2210-0274 1
 
4.8%
02-786-0221 1
 
4.8%
041-546-6117 1
 
4.8%
041-545-8280 1
 
4.8%
070-4773-1509 1
 
4.8%
041-578-4108 1
 
4.8%
042-471-1411 1
 
4.8%
041-634-7491 1
 
4.8%
0303-3130-0314 1
 
4.8%
041-334-0059 1
 
4.8%
Other values (11) 11
52.4%
2024-01-10T07:15:36.450902image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 42
16.5%
4 39
15.4%
0 37
14.6%
1 32
12.6%
5 19
7.5%
3 19
7.5%
7 18
7.1%
2 17
6.7%
8 14
 
5.5%
6 11
 
4.3%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 212
83.5%
Dash Punctuation 42
 
16.5%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
4 39
18.4%
0 37
17.5%
1 32
15.1%
5 19
9.0%
3 19
9.0%
7 18
8.5%
2 17
8.0%
8 14
 
6.6%
6 11
 
5.2%
9 6
 
2.8%
Dash Punctuation
ValueCountFrequency (%)
- 42
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 254
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 42
16.5%
4 39
15.4%
0 37
14.6%
1 32
12.6%
5 19
7.5%
3 19
7.5%
7 18
7.1%
2 17
6.7%
8 14
 
5.5%
6 11
 
4.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 254
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 42
16.5%
4 39
15.4%
0 37
14.6%
1 32
12.6%
5 19
7.5%
3 19
7.5%
7 18
7.1%
2 17
6.7%
8 14
 
5.5%
6 11
 
4.3%
Distinct96
Distinct (%)93.2%
Missing0
Missing (%)0.0%
Memory size956.0 B
Minimum2008-01-17 00:00:00
Maximum2021-07-20 00:00:00
2024-01-10T07:15:36.562822image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T07:15:36.664200image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

등록상태
Categorical

CONSTANT 

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size956.0 B
등록완료
103 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row등록완료
2nd row등록완료
3rd row등록완료
4th row등록완료
5th row등록완료

Common Values

ValueCountFrequency (%)
등록완료 103
100.0%

Length

2024-01-10T07:15:36.758057image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-10T07:15:36.828866image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
등록완료 103
100.0%

처리상태
Categorical

IMBALANCE 

Distinct2
Distinct (%)1.9%
Missing0
Missing (%)0.0%
Memory size956.0 B
정상
99 
전입
 
4

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row정상
2nd row정상
3rd row정상
4th row정상
5th row정상

Common Values

ValueCountFrequency (%)
정상 99
96.1%
전입 4
 
3.9%

Length

2024-01-10T07:15:36.900123image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-10T07:15:36.969236image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
정상 99
96.1%
전입 4
 
3.9%

자본금(천원)
Real number (ℝ)

HIGH CORRELATION 

Distinct59
Distinct (%)57.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1983484.5
Minimum300000
Maximum37647260
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2024-01-10T07:15:37.055938image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum300000
5-th percentile300000
Q1500000
median1000000
Q31751000
95-th percentile3685146.5
Maximum37647260
Range37347260
Interquartile range (IQR)1251000

Descriptive statistics

Standard deviation5007409.1
Coefficient of variation (CV)2.5245517
Kurtosis43.45208
Mean1983484.5
Median Absolute Deviation (MAD)550000
Skewness6.493901
Sum2.042989 × 108
Variance2.5074146 × 1013
MonotonicityNot monotonic
2024-01-10T07:15:37.170151image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
500000 18
 
17.5%
300000 11
 
10.7%
1210000 6
 
5.8%
350000 3
 
2.9%
510000 3
 
2.9%
1560000 2
 
1.9%
1000000 2
 
1.9%
1220000 2
 
1.9%
950000 2
 
1.9%
400000 2
 
1.9%
Other values (49) 52
50.5%
ValueCountFrequency (%)
300000 11
10.7%
302000 1
 
1.0%
310000 1
 
1.0%
320000 1
 
1.0%
350000 3
 
2.9%
400000 2
 
1.9%
500000 18
17.5%
510000 3
 
2.9%
520000 2
 
1.9%
553000 1
 
1.0%
ValueCountFrequency (%)
37647260 1
1.0%
35047770 1
1.0%
6696247 1
1.0%
6500000 1
1.0%
4178400 1
1.0%
3700000 1
1.0%
3551465 1
1.0%
3550000 1
1.0%
3400000 1
1.0%
3300000 1
1.0%

영업소재지
Text

MISSING 

Distinct101
Distinct (%)100.0%
Missing2
Missing (%)1.9%
Memory size956.0 B
2024-01-10T07:15:37.405689image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length43
Median length36
Mean length27.613861
Min length18

Characters and Unicode

Total characters2789
Distinct characters191
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique101 ?
Unique (%)100.0%

Sample

1st row충청남도 공주시 반포면 반포초교길 154
2nd row충청남도 아산시 청운로176번길 25, 2층
3rd row충청남도 천안시 서북구 두정로 106(두정동)
4th row충청남도 논산시 연산면 계백로 2514
5th row충청남도 천안시 서북구 성정두정로 71, 201호 (두정동, 하이스타운2차)
ValueCountFrequency (%)
충청남도 100
 
17.4%
천안시 42
 
7.3%
서북구 25
 
4.3%
동남구 17
 
3.0%
공주시 14
 
2.4%
아산시 9
 
1.6%
홍성군 7
 
1.2%
2층 6
 
1.0%
금산군 5
 
0.9%
당진시 5
 
0.9%
Other values (282) 345
60.0%
2024-01-10T07:15:37.748979image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
474
 
17.0%
120
 
4.3%
1 119
 
4.3%
118
 
4.2%
104
 
3.7%
103
 
3.7%
83
 
3.0%
2 80
 
2.9%
, 75
 
2.7%
69
 
2.5%
Other values (181) 1444
51.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1621
58.1%
Decimal Number 494
 
17.7%
Space Separator 474
 
17.0%
Other Punctuation 75
 
2.7%
Close Punctuation 47
 
1.7%
Open Punctuation 47
 
1.7%
Dash Punctuation 28
 
1.0%
Uppercase Letter 3
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
120
 
7.4%
118
 
7.3%
104
 
6.4%
103
 
6.4%
83
 
5.1%
69
 
4.3%
66
 
4.1%
50
 
3.1%
47
 
2.9%
45
 
2.8%
Other values (163) 816
50.3%
Decimal Number
ValueCountFrequency (%)
1 119
24.1%
2 80
16.2%
3 65
13.2%
0 55
11.1%
4 40
 
8.1%
5 37
 
7.5%
6 33
 
6.7%
7 27
 
5.5%
8 21
 
4.3%
9 17
 
3.4%
Uppercase Letter
ValueCountFrequency (%)
T 1
33.3%
B 1
33.3%
I 1
33.3%
Space Separator
ValueCountFrequency (%)
474
100.0%
Other Punctuation
ValueCountFrequency (%)
, 75
100.0%
Close Punctuation
ValueCountFrequency (%)
) 47
100.0%
Open Punctuation
ValueCountFrequency (%)
( 47
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 28
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1621
58.1%
Common 1165
41.8%
Latin 3
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
120
 
7.4%
118
 
7.3%
104
 
6.4%
103
 
6.4%
83
 
5.1%
69
 
4.3%
66
 
4.1%
50
 
3.1%
47
 
2.9%
45
 
2.8%
Other values (163) 816
50.3%
Common
ValueCountFrequency (%)
474
40.7%
1 119
 
10.2%
2 80
 
6.9%
, 75
 
6.4%
3 65
 
5.6%
0 55
 
4.7%
) 47
 
4.0%
( 47
 
4.0%
4 40
 
3.4%
5 37
 
3.2%
Other values (5) 126
 
10.8%
Latin
ValueCountFrequency (%)
T 1
33.3%
B 1
33.3%
I 1
33.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1621
58.1%
ASCII 1168
41.9%

Most frequent character per block

ASCII
ValueCountFrequency (%)
474
40.6%
1 119
 
10.2%
2 80
 
6.8%
, 75
 
6.4%
3 65
 
5.6%
0 55
 
4.7%
) 47
 
4.0%
( 47
 
4.0%
4 40
 
3.4%
5 37
 
3.2%
Other values (8) 129
 
11.0%
Hangul
ValueCountFrequency (%)
120
 
7.4%
118
 
7.3%
104
 
6.4%
103
 
6.4%
83
 
5.1%
69
 
4.3%
66
 
4.1%
50
 
3.1%
47
 
2.9%
45
 
2.8%
Other values (163) 816
50.3%

Interactions

2024-01-10T07:15:33.337430image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-01-10T07:15:37.829813image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
법인구분팩스번호등록일자처리상태자본금(천원)
법인구분1.0001.0001.0000.2960.889
팩스번호1.0001.0001.000NaN1.000
등록일자1.0001.0001.0001.0001.000
처리상태0.296NaN1.0001.0000.168
자본금(천원)0.8891.0001.0000.1681.000
2024-01-10T07:15:37.907917image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
처리상태법인구분
처리상태1.0000.475
법인구분0.4751.000
2024-01-10T07:15:37.971877image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
자본금(천원)법인구분처리상태
자본금(천원)1.0000.5970.226
법인구분0.5971.0000.475
처리상태0.2260.4751.000

Missing values

2024-01-10T07:15:33.437825image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-01-10T07:15:33.551982image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2024-01-10T07:15:33.635951image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

부동산개발업등록번호상호대표자법인구분전화번호팩스번호등록일자등록상태처리상태자본금(천원)영업소재지
0충남080003태산종합건설㈜김지찬일반법인041-857-9696<NA>2008-01-17등록완료정상3200000충청남도 공주시 반포면 반포초교길 154
1충남080007경남기업㈜박석준일반법인02-2210-027302-2210-02742008-03-05등록완료정상35047770충청남도 아산시 청운로176번길 25, 2층
2충남080037㈜동일토건고동현일반법인02-2007-2000<NA>2008-05-29등록완료정상4178400충청남도 천안시 서북구 두정로 106(두정동)
3충남080040일산종합건설㈜박성배,박재현일반법인042-471-4567<NA>2008-06-09등록완료정상3700000충청남도 논산시 연산면 계백로 2514
4충남080043하이스종합건설㈜신동룡,정원욱일반법인041-564-3740041-564-37522008-06-13등록완료정상1762000충청남도 천안시 서북구 성정두정로 71, 201호 (두정동, 하이스타운2차)
5충남080046해유건설(주)한세우일반법인041-543-1892<NA>2008-06-24등록완료정상2000000충청남도 아산시 음봉면 연암산로 71-16, B동
6충남080048한성건설㈜송용상일반법인041-556-4611<NA>2008-07-03등록완료정상3000000충청남도 천안시 서북구 두정역서3길 43(두정동, 한성빌딩 3층)
7충남090001서영건설(주)오세명일반법인041-664-1783<NA>2009-05-08등록완료정상1310000충청남도 예산군 삽교읍 애향13길 10, 1층
8충남090002일조산업개발㈜이태희일반법인041-572-0114<NA>2009-06-09등록완료정상2150000충청남도 천안시 동남구 봉정로 14(봉명동, 봉명빌딩 3층)
9충남090004㈜정보종합건설박경식일반법인041-853-0494<NA>2009-06-23등록완료정상1210000충청남도 공주시 관골1길 38-4
부동산개발업등록번호상호대표자법인구분전화번호팩스번호등록일자등록상태처리상태자본금(천원)영업소재지
93충남200006유티종합건설 주식회사황순성일반법인041-545-8284041-545-82802020-07-03등록완료정상500000충청남도 아산시 탕정면 선문로221번길 11-4, 106호
94충남200007주식회사 다스종합건축사사무소이문규일반법인041-546-6119041-546-61172020-07-27등록완료정상300000충청남도 아산시 온천대로 1502-3, 3층(온천동)
95충남200009주식회사 홍익이노빌드김병준일반법인041-559-5395<NA>2020-09-10등록완료정상300000충청남도 천안시 동남구 천안대로 802
96충남200010서창산업개발 주식회사정경숙일반법인041-741-9161<NA>2020-12-01등록완료정상350000충청남도 논산시 연무읍 연무로 152, 2층
97충남210001주식회사 현성종합건설장만종일반법인041-668-6330<NA>2021-02-10등록완료정상1210000충청남도 서산시 동서1로 263, 101호(예천동)
98충남210002주식회사 에스엘디이창무일반법인041-554-1104<NA>2021-02-10등록완료정상1000000충청남도 천안시 동남구 삼룡4길 58-6(삼룡동)
99충남210003주식회사 서인산업개발맹건호일반법인041-578-2225<NA>2021-03-26등록완료정상300000충청남도 천안시 동남구 통정1로 20-9, 2층(신방동)
100충남210004주식회사 미래박준수일반법인042-476-1660<NA>2021-05-06등록완료정상302000충청남도 공주시 월송동현로 143, 101호(월송동)
101충남210005주식회사 해오름종합건설윤지원일반법인041-584-9992<NA>2021-05-24등록완료정상700000충청남도 천안시 서북구 성환읍 성환1로 190, 3층
102충남210006주식회사 다경종합건설경태현일반법인041-855-7626042-825-25632021-07-20등록완료정상1450000충청남도 공주시 신금1길 11, 2층(신관동)