Overview

Dataset statistics

Number of variables11
Number of observations96
Missing cells95
Missing cells (%)9.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory8.5 KiB
Average record size in memory90.3 B

Variable types

Text6
Categorical3
DateTime1
Numeric1

Dataset

Description충청남도 내 등록된 부동산개발업 업체현황에 대한 데이터로 부동산개발업 등록번호, 대표자 성명, 법인구분, 영업소소재지의 정보를 제공합니다.
Author충청남도
URLhttps://www.data.go.kr/data/15018881/fileData.do

Alerts

등록상태 has constant value ""Constant
자본금(천원) is highly overall correlated with 법인구분High correlation
법인구분 is highly overall correlated with 자본금(천원) and 1 other fieldsHigh correlation
처리상태 is highly overall correlated with 법인구분High correlation
법인구분 is highly imbalanced (85.5%)Imbalance
처리상태 is highly imbalanced (85.4%)Imbalance
전화번호 has 18 (18.8%) missing valuesMissing
팩스번호 has 77 (80.2%) missing valuesMissing
부동산개발업등록번호 has unique valuesUnique
상호 has unique valuesUnique
영업소재지 has unique valuesUnique

Reproduction

Analysis started2024-03-14 11:11:33.199921
Analysis finished2024-03-14 11:11:35.596816
Duration2.4 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct96
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size896.0 B
2024-03-14T20:11:36.544361image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length8
Median length8
Mean length7.9791667
Min length6

Characters and Unicode

Total characters766
Distinct characters12
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique96 ?
Unique (%)100.0%

Sample

1st row220012
2nd row충남080003
3rd row충남080007
4th row충남080037
5th row충남080040
ValueCountFrequency (%)
220012 1
 
1.0%
충남080003 1
 
1.0%
충남210001 1
 
1.0%
충남200006 1
 
1.0%
충남200005 1
 
1.0%
충남200003 1
 
1.0%
충남200002 1
 
1.0%
충남190016 1
 
1.0%
충남190015 1
 
1.0%
충남190014 1
 
1.0%
Other values (86) 86
89.6%
2024-03-14T20:11:37.880169image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 276
36.0%
1 101
 
13.2%
95
 
12.4%
95
 
12.4%
2 60
 
7.8%
3 26
 
3.4%
8 25
 
3.3%
9 21
 
2.7%
7 19
 
2.5%
5 18
 
2.3%
Other values (2) 30
 
3.9%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 576
75.2%
Other Letter 190
 
24.8%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 276
47.9%
1 101
 
17.5%
2 60
 
10.4%
3 26
 
4.5%
8 25
 
4.3%
9 21
 
3.6%
7 19
 
3.3%
5 18
 
3.1%
4 16
 
2.8%
6 14
 
2.4%
Other Letter
ValueCountFrequency (%)
95
50.0%
95
50.0%

Most occurring scripts

ValueCountFrequency (%)
Common 576
75.2%
Hangul 190
 
24.8%

Most frequent character per script

Common
ValueCountFrequency (%)
0 276
47.9%
1 101
 
17.5%
2 60
 
10.4%
3 26
 
4.5%
8 25
 
4.3%
9 21
 
3.6%
7 19
 
3.3%
5 18
 
3.1%
4 16
 
2.8%
6 14
 
2.4%
Hangul
ValueCountFrequency (%)
95
50.0%
95
50.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 576
75.2%
Hangul 190
 
24.8%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 276
47.9%
1 101
 
17.5%
2 60
 
10.4%
3 26
 
4.5%
8 25
 
4.3%
9 21
 
3.6%
7 19
 
3.3%
5 18
 
3.1%
4 16
 
2.8%
6 14
 
2.4%
Hangul
ValueCountFrequency (%)
95
50.0%
95
50.0%

상호
Text

UNIQUE 

Distinct96
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size896.0 B
2024-03-14T20:11:38.705010image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length18
Median length17
Mean length8.75
Min length3

Characters and Unicode

Total characters840
Distinct characters152
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique96 ?
Unique (%)100.0%

Sample

1st row아이엠비개발 주식회사
2nd row태산종합건설㈜
3rd row경남기업㈜
4th row㈜동일토건
5th row일산종합건설㈜
ValueCountFrequency (%)
주식회사 21
 
17.8%
서올건설(주 1
 
0.8%
현성종합건설 1
 
0.8%
유티종합건설 1
 
0.8%
센텀월드 1
 
0.8%
주)비제이글로벌 1
 
0.8%
광천종합건설(주 1
 
0.8%
지혜산업개발(주 1
 
0.8%
영화산업개발(주 1
 
0.8%
주)신동양건설 1
 
0.8%
Other values (88) 88
74.6%
2024-03-14T20:11:39.961879image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
84
 
10.0%
) 62
 
7.4%
62
 
7.4%
( 62
 
7.4%
53
 
6.3%
29
 
3.5%
28
 
3.3%
27
 
3.2%
22
 
2.6%
21
 
2.5%
Other values (142) 390
46.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 681
81.1%
Close Punctuation 62
 
7.4%
Open Punctuation 62
 
7.4%
Space Separator 22
 
2.6%
Other Symbol 11
 
1.3%
Other Punctuation 2
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
84
 
12.3%
62
 
9.1%
53
 
7.8%
29
 
4.3%
28
 
4.1%
27
 
4.0%
21
 
3.1%
21
 
3.1%
17
 
2.5%
16
 
2.3%
Other values (137) 323
47.4%
Close Punctuation
ValueCountFrequency (%)
) 62
100.0%
Open Punctuation
ValueCountFrequency (%)
( 62
100.0%
Space Separator
ValueCountFrequency (%)
22
100.0%
Other Symbol
ValueCountFrequency (%)
11
100.0%
Other Punctuation
ValueCountFrequency (%)
. 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 692
82.4%
Common 148
 
17.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
84
 
12.1%
62
 
9.0%
53
 
7.7%
29
 
4.2%
28
 
4.0%
27
 
3.9%
21
 
3.0%
21
 
3.0%
17
 
2.5%
16
 
2.3%
Other values (138) 334
48.3%
Common
ValueCountFrequency (%)
) 62
41.9%
( 62
41.9%
22
 
14.9%
. 2
 
1.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 681
81.1%
ASCII 148
 
17.6%
None 11
 
1.3%

Most frequent character per block

Hangul
ValueCountFrequency (%)
84
 
12.3%
62
 
9.1%
53
 
7.8%
29
 
4.3%
28
 
4.1%
27
 
4.0%
21
 
3.1%
21
 
3.1%
17
 
2.5%
16
 
2.3%
Other values (137) 323
47.4%
ASCII
ValueCountFrequency (%)
) 62
41.9%
( 62
41.9%
22
 
14.9%
. 2
 
1.4%
None
ValueCountFrequency (%)
11
100.0%
Distinct95
Distinct (%)99.0%
Missing0
Missing (%)0.0%
Memory size896.0 B
2024-03-14T20:11:40.965413image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length10
Median length3
Mean length3.5625
Min length3

Characters and Unicode

Total characters342
Distinct characters111
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique94 ?
Unique (%)97.9%

Sample

1st row김율,김만식
2nd row김지찬
3rd row조유선
4th row고동현
5th row박성배,박재현
ValueCountFrequency (%)
김명자 2
 
2.1%
이영찬,김주환 1
 
1.0%
이도원,한윤수 1
 
1.0%
장만종 1
 
1.0%
황순성 1
 
1.0%
방종혁 1
 
1.0%
조동배 1
 
1.0%
조왕연 1
 
1.0%
강은호 1
 
1.0%
최종은 1
 
1.0%
Other values (85) 85
88.5%
2024-03-14T20:11:42.195285image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
16
 
4.7%
16
 
4.7%
, 14
 
4.1%
8
 
2.3%
8
 
2.3%
8
 
2.3%
8
 
2.3%
8
 
2.3%
7
 
2.0%
7
 
2.0%
Other values (101) 242
70.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 328
95.9%
Other Punctuation 14
 
4.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
16
 
4.9%
16
 
4.9%
8
 
2.4%
8
 
2.4%
8
 
2.4%
8
 
2.4%
8
 
2.4%
7
 
2.1%
7
 
2.1%
7
 
2.1%
Other values (100) 235
71.6%
Other Punctuation
ValueCountFrequency (%)
, 14
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 328
95.9%
Common 14
 
4.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
16
 
4.9%
16
 
4.9%
8
 
2.4%
8
 
2.4%
8
 
2.4%
8
 
2.4%
8
 
2.4%
7
 
2.1%
7
 
2.1%
7
 
2.1%
Other values (100) 235
71.6%
Common
ValueCountFrequency (%)
, 14
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 328
95.9%
ASCII 14
 
4.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
16
 
4.9%
16
 
4.9%
8
 
2.4%
8
 
2.4%
8
 
2.4%
8
 
2.4%
8
 
2.4%
7
 
2.1%
7
 
2.1%
7
 
2.1%
Other values (100) 235
71.6%
ASCII
ValueCountFrequency (%)
, 14
100.0%

법인구분
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct3
Distinct (%)3.1%
Missing0
Missing (%)0.0%
Memory size896.0 B
일반법인
93 
특수법인
 
2
개인
 
1

Length

Max length4
Median length4
Mean length3.9791667
Min length2

Unique

Unique1 ?
Unique (%)1.0%

Sample

1st row일반법인
2nd row일반법인
3rd row일반법인
4th row일반법인
5th row일반법인

Common Values

ValueCountFrequency (%)
일반법인 93
96.9%
특수법인 2
 
2.1%
개인 1
 
1.0%

Length

2024-03-14T20:11:42.435058image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-14T20:11:42.628597image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
일반법인 93
96.9%
특수법인 2
 
2.1%
개인 1
 
1.0%

전화번호
Text

MISSING 

Distinct78
Distinct (%)100.0%
Missing18
Missing (%)18.8%
Memory size896.0 B
2024-03-14T20:11:43.630466image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length14
Median length12
Mean length12.025641
Min length11

Characters and Unicode

Total characters938
Distinct characters12
Distinct categories3 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique78 ?
Unique (%)100.0%

Sample

1st row041-662-2730
2nd row041-857-9696
3rd row02-2210-0273
4th row02-2007-2000
5th row042-471-4567
ValueCountFrequency (%)
041-853-0494 1
 
1.3%
041-553-8181 1
 
1.3%
041-881-6513 1
 
1.3%
041-334-4355 1
 
1.3%
041-533-0481 1
 
1.3%
042-471-2701 1
 
1.3%
041-578-0987 1
 
1.3%
041-732-5965 1
 
1.3%
041-554-9377 1
 
1.3%
042-828-0700 1
 
1.3%
Other values (68) 68
87.2%
2024-03-14T20:11:44.921693image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 156
16.6%
0 138
14.7%
1 125
13.3%
4 119
12.7%
5 86
9.2%
2 66
7.0%
3 60
 
6.4%
7 56
 
6.0%
8 55
 
5.9%
6 47
 
5.0%
Other values (2) 30
 
3.2%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 781
83.3%
Dash Punctuation 156
 
16.6%
Math Symbol 1
 
0.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 138
17.7%
1 125
16.0%
4 119
15.2%
5 86
11.0%
2 66
8.5%
3 60
7.7%
7 56
7.2%
8 55
 
7.0%
6 47
 
6.0%
9 29
 
3.7%
Dash Punctuation
ValueCountFrequency (%)
- 156
100.0%
Math Symbol
ValueCountFrequency (%)
~ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 938
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 156
16.6%
0 138
14.7%
1 125
13.3%
4 119
12.7%
5 86
9.2%
2 66
7.0%
3 60
 
6.4%
7 56
 
6.0%
8 55
 
5.9%
6 47
 
5.0%
Other values (2) 30
 
3.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII 938
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 156
16.6%
0 138
14.7%
1 125
13.3%
4 119
12.7%
5 86
9.2%
2 66
7.0%
3 60
 
6.4%
7 56
 
6.0%
8 55
 
5.9%
6 47
 
5.0%
Other values (2) 30
 
3.2%

팩스번호
Text

MISSING 

Distinct19
Distinct (%)100.0%
Missing77
Missing (%)80.2%
Memory size896.0 B
2024-03-14T20:11:45.639356image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length12
Min length11

Characters and Unicode

Total characters228
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique19 ?
Unique (%)100.0%

Sample

1st row02-2210-0274
2nd row041-588-7777
3rd row041-571-0987
4th row042-471-2706
5th row041-533-0486
ValueCountFrequency (%)
02-2210-0274 1
 
5.3%
070-4773-1509 1
 
5.3%
02-2070-7277 1
 
5.3%
041-558-0054 1
 
5.3%
041-588-9284 1
 
5.3%
041-553-7384 1
 
5.3%
02-571-1876 1
 
5.3%
042-822-8456 1
 
5.3%
041-578-2226 1
 
5.3%
041-578-4108 1
 
5.3%
Other values (9) 9
47.4%
2024-03-14T20:11:46.852474image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 38
16.7%
0 36
15.8%
4 27
11.8%
7 24
10.5%
1 23
10.1%
2 20
8.8%
8 19
8.3%
5 18
7.9%
3 12
 
5.3%
6 6
 
2.6%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 190
83.3%
Dash Punctuation 38
 
16.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 36
18.9%
4 27
14.2%
7 24
12.6%
1 23
12.1%
2 20
10.5%
8 19
10.0%
5 18
9.5%
3 12
 
6.3%
6 6
 
3.2%
9 5
 
2.6%
Dash Punctuation
ValueCountFrequency (%)
- 38
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 228
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 38
16.7%
0 36
15.8%
4 27
11.8%
7 24
10.5%
1 23
10.1%
2 20
8.8%
8 19
8.3%
5 18
7.9%
3 12
 
5.3%
6 6
 
2.6%

Most occurring blocks

ValueCountFrequency (%)
ASCII 228
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 38
16.7%
0 36
15.8%
4 27
11.8%
7 24
10.5%
1 23
10.1%
2 20
8.8%
8 19
8.3%
5 18
7.9%
3 12
 
5.3%
6 6
 
2.6%
Distinct88
Distinct (%)91.7%
Missing0
Missing (%)0.0%
Memory size896.0 B
Minimum2008-01-17 00:00:00
Maximum2023-11-21 00:00:00
2024-03-14T20:11:47.092539image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-14T20:11:47.365506image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

등록상태
Categorical

CONSTANT 

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size896.0 B
등록완료
96 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row등록완료
2nd row등록완료
3rd row등록완료
4th row등록완료
5th row등록완료

Common Values

ValueCountFrequency (%)
등록완료 96
100.0%

Length

2024-03-14T20:11:47.736830image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-14T20:11:47.891304image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
등록완료 96
100.0%

처리상태
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct2
Distinct (%)2.1%
Missing0
Missing (%)0.0%
Memory size896.0 B
정상
94 
전입
 
2

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row정상
2nd row정상
3rd row정상
4th row정상
5th row정상

Common Values

ValueCountFrequency (%)
정상 94
97.9%
전입 2
 
2.1%

Length

2024-03-14T20:11:48.049325image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-14T20:11:48.207899image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
정상 94
97.9%
전입 2
 
2.1%

자본금(천원)
Real number (ℝ)

HIGH CORRELATION 

Distinct54
Distinct (%)56.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1686800.8
Minimum300000
Maximum35051470
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size992.0 B
2024-03-14T20:11:48.394354image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum300000
5-th percentile300000
Q1500000
median950000
Q31750500
95-th percentile3819600
Maximum35051470
Range34751470
Interquartile range (IQR)1250500

Descriptive statistics

Standard deviation3718198.4
Coefficient of variation (CV)2.2042902
Kurtosis69.808273
Mean1686800.8
Median Absolute Deviation (MAD)594500
Skewness7.8723872
Sum1.6193288 × 108
Variance1.3824999 × 1013
MonotonicityNot monotonic
2024-03-14T20:11:48.767944image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
300000 14
 
14.6%
500000 12
 
12.5%
1210000 5
 
5.2%
2500000 3
 
3.1%
350000 3
 
3.1%
950000 2
 
2.1%
650000 2
 
2.1%
520000 2
 
2.1%
1220000 2
 
2.1%
510000 2
 
2.1%
Other values (44) 49
51.0%
ValueCountFrequency (%)
300000 14
14.6%
302000 1
 
1.0%
330000 1
 
1.0%
350000 3
 
3.1%
351000 1
 
1.0%
360000 1
 
1.0%
400000 2
 
2.1%
500000 12
12.5%
510000 2
 
2.1%
520000 2
 
2.1%
ValueCountFrequency (%)
35051470 1
1.0%
8500000 1
1.0%
6696247 1
1.0%
6500000 1
1.0%
4178400 1
1.0%
3700000 1
1.0%
3300000 2
2.1%
3200000 1
1.0%
3000000 1
1.0%
2960000 1
1.0%

영업소재지
Text

UNIQUE 

Distinct96
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size896.0 B
2024-03-14T20:11:49.802941image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length46
Median length38
Mean length29.177083
Min length19

Characters and Unicode

Total characters2801
Distinct characters195
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique96 ?
Unique (%)100.0%

Sample

1st row충청남도 서산시 안견로 213, 4층(동문동)
2nd row충청남도 공주시 반포면 반포초교길 154
3rd row충청남도 아산시 청운로176번길 25, 2층
4th row충청남도 천안시 서북구 두정로 106, (두정동)
5th row충청남도 논산시 연산면 계백로 2514
ValueCountFrequency (%)
충청남도 94
 
16.2%
천안시 46
 
7.9%
서북구 27
 
4.7%
동남구 19
 
3.3%
아산시 12
 
2.1%
공주시 9
 
1.6%
2층 9
 
1.6%
1층 6
 
1.0%
반포면 4
 
0.7%
홍성군 4
 
0.7%
Other values (283) 349
60.3%
2024-03-14T20:11:51.268690image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
483
 
17.2%
1 121
 
4.3%
116
 
4.1%
112
 
4.0%
99
 
3.5%
98
 
3.5%
, 94
 
3.4%
83
 
3.0%
2 82
 
2.9%
74
 
2.6%
Other values (185) 1439
51.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1608
57.4%
Decimal Number 497
 
17.7%
Space Separator 483
 
17.2%
Other Punctuation 94
 
3.4%
Close Punctuation 49
 
1.7%
Open Punctuation 49
 
1.7%
Dash Punctuation 20
 
0.7%
Uppercase Letter 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
116
 
7.2%
112
 
7.0%
99
 
6.2%
98
 
6.1%
83
 
5.2%
74
 
4.6%
73
 
4.5%
51
 
3.2%
51
 
3.2%
50
 
3.1%
Other values (169) 801
49.8%
Decimal Number
ValueCountFrequency (%)
1 121
24.3%
2 82
16.5%
3 62
12.5%
4 54
10.9%
0 53
10.7%
6 40
 
8.0%
5 38
 
7.6%
7 22
 
4.4%
8 14
 
2.8%
9 11
 
2.2%
Space Separator
ValueCountFrequency (%)
483
100.0%
Other Punctuation
ValueCountFrequency (%)
, 94
100.0%
Close Punctuation
ValueCountFrequency (%)
) 49
100.0%
Open Punctuation
ValueCountFrequency (%)
( 49
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 20
100.0%
Uppercase Letter
ValueCountFrequency (%)
B 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1608
57.4%
Common 1192
42.6%
Latin 1
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
116
 
7.2%
112
 
7.0%
99
 
6.2%
98
 
6.1%
83
 
5.2%
74
 
4.6%
73
 
4.5%
51
 
3.2%
51
 
3.2%
50
 
3.1%
Other values (169) 801
49.8%
Common
ValueCountFrequency (%)
483
40.5%
1 121
 
10.2%
, 94
 
7.9%
2 82
 
6.9%
3 62
 
5.2%
4 54
 
4.5%
0 53
 
4.4%
) 49
 
4.1%
( 49
 
4.1%
6 40
 
3.4%
Other values (5) 105
 
8.8%
Latin
ValueCountFrequency (%)
B 1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1608
57.4%
ASCII 1193
42.6%

Most frequent character per block

ASCII
ValueCountFrequency (%)
483
40.5%
1 121
 
10.1%
, 94
 
7.9%
2 82
 
6.9%
3 62
 
5.2%
4 54
 
4.5%
0 53
 
4.4%
) 49
 
4.1%
( 49
 
4.1%
6 40
 
3.4%
Other values (6) 106
 
8.9%
Hangul
ValueCountFrequency (%)
116
 
7.2%
112
 
7.0%
99
 
6.2%
98
 
6.1%
83
 
5.2%
74
 
4.6%
73
 
4.5%
51
 
3.2%
51
 
3.2%
50
 
3.1%
Other values (169) 801
49.8%

Interactions

2024-03-14T20:11:34.498364image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-03-14T20:11:51.438205image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
부동산개발업등록번호상호대표자법인구분전화번호팩스번호등록일자처리상태자본금(천원)영업소재지
부동산개발업등록번호1.0001.0001.0001.0001.0001.0001.0001.0001.0001.000
상호1.0001.0001.0001.0001.0001.0001.0001.0001.0001.000
대표자1.0001.0001.0001.0001.0001.0000.9961.0001.0001.000
법인구분1.0001.0001.0001.0001.0001.0001.0000.4490.6631.000
전화번호1.0001.0001.0001.0001.0001.0001.0001.0001.0001.000
팩스번호1.0001.0001.0001.0001.0001.0001.000NaN1.0001.000
등록일자1.0001.0000.9961.0001.0001.0001.0001.0001.0001.000
처리상태1.0001.0001.0000.4491.000NaN1.0001.0000.5111.000
자본금(천원)1.0001.0001.0000.6631.0001.0001.0000.5111.0001.000
영업소재지1.0001.0001.0001.0001.0001.0001.0001.0001.0001.000
2024-03-14T20:11:51.656892image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
처리상태법인구분
처리상태1.0000.692
법인구분0.6921.000
2024-03-14T20:11:51.794668image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
자본금(천원)법인구분처리상태
자본금(천원)1.0000.6890.353
법인구분0.6891.0000.692
처리상태0.3530.6921.000

Missing values

2024-03-14T20:11:35.057160image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-14T20:11:35.328913image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2024-03-14T20:11:35.515400image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

부동산개발업등록번호상호대표자법인구분전화번호팩스번호등록일자등록상태처리상태자본금(천원)영업소재지
0220012아이엠비개발 주식회사김율,김만식일반법인041-662-2730<NA>2022-11-25등록완료정상300000충청남도 서산시 안견로 213, 4층(동문동)
1충남080003태산종합건설㈜김지찬일반법인041-857-9696<NA>2008-01-17등록완료정상3200000충청남도 공주시 반포면 반포초교길 154
2충남080007경남기업㈜조유선일반법인02-2210-027302-2210-02742008-03-05등록완료정상35051470충청남도 아산시 청운로176번길 25, 2층
3충남080037㈜동일토건고동현일반법인02-2007-2000<NA>2008-05-29등록완료정상4178400충청남도 천안시 서북구 두정로 106, (두정동)
4충남080040일산종합건설㈜박성배,박재현일반법인042-471-4567<NA>2008-06-09등록완료정상3700000충청남도 논산시 연산면 계백로 2514
5충남080046해유건설(주)한세우,최홍윤일반법인041-543-1892<NA>2008-06-24등록완료정상2000000충청남도 아산시 음봉면 연암산로 71-16, B동
6충남080048한성건설㈜송용상일반법인041-556-4611<NA>2008-07-03등록완료정상3000000충청남도 천안시 서북구 두정역서3길 43, (두정동, 한성빌딩 3층)
7충남090001서영건설(주)오세명일반법인041-664-1783<NA>2009-05-08등록완료정상1310000충청남도 예산군 삽교읍 애향13길 10, 1층
8충남090002일조산업개발㈜이태희일반법인041-572-0114<NA>2009-06-09등록완료정상2150000충청남도 천안시 동남구 봉정로 14, (봉명동, 봉명빌딩 3층)
9충남090004㈜정보종합건설박경식일반법인041-853-0494<NA>2009-06-23등록완료정상1210000충청남도 공주시 반포면 내송길 19, 1층 101호
부동산개발업등록번호상호대표자법인구분전화번호팩스번호등록일자등록상태처리상태자본금(천원)영업소재지
86충남230001대광개발 주식회사김성태일반법인041-907-9009<NA>2023-02-16등록완료정상330000충청남도 천안시 서북구 한들1로 218, 112호(백석동, 천안백석하우스토리엔시티)
87충남230002목천복합단지개발 주식회사정홍근일반법인041-564-1122<NA>2023-02-16등록완료정상300000충청남도 천안시 동남구 목천읍 충절로 935-5, 201호
88충남230003큐씨디북천안물류 주식회사배철수일반법인02-794-4061<NA>2023-03-23등록완료정상300000충청남도 천안시 서북구 입장면 망향로 1111-4
89충남230004(주)다경종합건설경태현일반법인041-855-7626<NA>2023-05-26등록완료정상1450000충청남도 공주시 신금1길 11, 2층(신관동)
90충남230005아우어컴퍼니 주식회사장근수일반법인<NA><NA>2023-05-26등록완료정상300000충청남도 천안시 동남구 차돌고개3길 4-1, 2층(다가동)
91충남230006에스디종합건설(주)김경산일반법인041-547-6337<NA>2016-12-08등록완료정상680000충청남도 아산시 배방읍 희망로46번길 45-31, 107호
92충남230007주식회사 종합건축사사무소미당김동후,윤해,신동기일반법인<NA><NA>2023-09-04등록완료정상300000충청남도 천안시 서북구 불당25로 236, 309호(불당동, 디엠센텀시티)
93충남230008주식회사 정민종합건설곽은정일반법인<NA><NA>2023-09-04등록완료정상351000충청남도 아산시 음봉면 연암율금로106번길 1, 206호
94충남230009신미래종합건설 주식회사김진성일반법인<NA><NA>2023-09-11등록완료정상910000충청남도 당진시 송악읍 부동길 10, 2층
95충남230010코리아테니스파크 주식회사손홍근일반법인<NA><NA>2023-11-21등록완료정상990000충청남도 천안시 서북구 성환읍 연암율금로 464, 1층 2호