Overview

Dataset statistics

Number of variables15
Number of observations779
Missing cells3525
Missing cells (%)30.2%
Duplicate rows10
Duplicate rows (%)1.3%
Total size in memory93.7 KiB
Average record size in memory123.2 B

Variable types

Categorical4
Text5
Numeric3
DateTime3

Dataset

Description충청북도 음성군 건축물 인허가 현황(건축구분, 대지위치, 구조, 착공예정일, 실제 착공일, 사용 승인일, 주 용도, 설계 사무소, 감리사무소, 시공사무소 등)의 데이터를 제공합니다.
Author충청북도 음성군
URLhttps://www.data.go.kr/data/15035702/fileData.do

Alerts

기준일자 has constant value ""Constant
Dataset has 10 (1.3%) duplicate rowsDuplicates
건축면적(제곱미터) is highly overall correlated with 연면적(제곱미터)High correlation
연면적(제곱미터) is highly overall correlated with 건축면적(제곱미터)High correlation
증축연면적(제곱미터) is highly overall correlated with 건축구분High correlation
건축구분 is highly overall correlated with 증축연면적(제곱미터)High correlation
증축연면적(제곱미터) has 562 (72.1%) missing valuesMissing
착공예정일 has 287 (36.8%) missing valuesMissing
실제착공일 has 598 (76.8%) missing valuesMissing
사용승인일 has 571 (73.3%) missing valuesMissing
부속용도 has 288 (37.0%) missing valuesMissing
감리사무소명 has 611 (78.4%) missing valuesMissing
시공자사무소명 has 607 (77.9%) missing valuesMissing

Reproduction

Analysis started2023-12-12 11:59:35.978217
Analysis finished2023-12-12 11:59:39.400127
Duration3.42 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

건축구분
Categorical

HIGH CORRELATION 

Distinct5
Distinct (%)0.6%
Missing0
Missing (%)0.0%
Memory size6.2 KiB
신축
519 
증축
218 
용도변경
 
31
대수선
 
9
재축
 
2

Length

Max length4
Median length2
Mean length2.0911425
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row증축
2nd row증축
3rd row신축
4th row신축
5th row증축

Common Values

ValueCountFrequency (%)
신축 519
66.6%
증축 218
28.0%
용도변경 31
 
4.0%
대수선 9
 
1.2%
재축 2
 
0.3%

Length

2023-12-12T20:59:39.507242image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T20:59:39.673693image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
신축 519
66.6%
증축 218
28.0%
용도변경 31
 
4.0%
대수선 9
 
1.2%
재축 2
 
0.3%
Distinct725
Distinct (%)93.1%
Missing0
Missing (%)0.0%
Memory size6.2 KiB
2023-12-12T20:59:40.047465image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length35
Median length30
Mean length23.250321
Min length16

Characters and Unicode

Total characters18112
Distinct characters135
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique680 ?
Unique (%)87.3%

Sample

1st row충청북도 음성군 금왕읍 내송리 530
2nd row충청북도 음성군 맹동면 두성리 1048 외1필지
3rd row충청북도 음성군 맹동면 용촌리 195-41
4th row충청북도 음성군 맹동면 동성리 380
5th row충청북도 음성군 대소면 오류리 642-2
ValueCountFrequency (%)
충청북도 779
18.4%
음성군 779
18.4%
금왕읍 157
 
3.7%
대소면 138
 
3.3%
외1필지 134
 
3.2%
음성읍 100
 
2.4%
삼성면 99
 
2.3%
맹동면 77
 
1.8%
생극면 63
 
1.5%
감곡면 60
 
1.4%
Other values (801) 1853
43.7%
2023-12-12T20:59:40.669266image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
3460
19.1%
1069
 
5.9%
881
 
4.9%
797
 
4.4%
796
 
4.4%
784
 
4.3%
781
 
4.3%
779
 
4.3%
779
 
4.3%
1 631
 
3.5%
Other values (125) 7355
40.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 11066
61.1%
Space Separator 3460
 
19.1%
Decimal Number 3085
 
17.0%
Dash Punctuation 493
 
2.7%
Uppercase Letter 4
 
< 0.1%
Open Punctuation 2
 
< 0.1%
Close Punctuation 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1069
 
9.7%
881
 
8.0%
797
 
7.2%
796
 
7.2%
784
 
7.1%
781
 
7.1%
779
 
7.0%
779
 
7.0%
522
 
4.7%
282
 
2.5%
Other values (108) 3596
32.5%
Decimal Number
ValueCountFrequency (%)
1 631
20.5%
2 484
15.7%
3 339
11.0%
4 331
10.7%
5 298
9.7%
6 254
8.2%
7 199
 
6.5%
9 198
 
6.4%
0 178
 
5.8%
8 173
 
5.6%
Uppercase Letter
ValueCountFrequency (%)
A 2
50.0%
H 1
25.0%
B 1
25.0%
Space Separator
ValueCountFrequency (%)
3460
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 493
100.0%
Open Punctuation
ValueCountFrequency (%)
( 2
100.0%
Close Punctuation
ValueCountFrequency (%)
) 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 11066
61.1%
Common 7042
38.9%
Latin 4
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1069
 
9.7%
881
 
8.0%
797
 
7.2%
796
 
7.2%
784
 
7.1%
781
 
7.1%
779
 
7.0%
779
 
7.0%
522
 
4.7%
282
 
2.5%
Other values (108) 3596
32.5%
Common
ValueCountFrequency (%)
3460
49.1%
1 631
 
9.0%
- 493
 
7.0%
2 484
 
6.9%
3 339
 
4.8%
4 331
 
4.7%
5 298
 
4.2%
6 254
 
3.6%
7 199
 
2.8%
9 198
 
2.8%
Other values (4) 355
 
5.0%
Latin
ValueCountFrequency (%)
A 2
50.0%
H 1
25.0%
B 1
25.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 11066
61.1%
ASCII 7046
38.9%

Most frequent character per block

ASCII
ValueCountFrequency (%)
3460
49.1%
1 631
 
9.0%
- 493
 
7.0%
2 484
 
6.9%
3 339
 
4.8%
4 331
 
4.7%
5 298
 
4.2%
6 254
 
3.6%
7 199
 
2.8%
9 198
 
2.8%
Other values (7) 359
 
5.1%
Hangul
ValueCountFrequency (%)
1069
 
9.7%
881
 
8.0%
797
 
7.2%
796
 
7.2%
784
 
7.1%
781
 
7.1%
779
 
7.0%
779
 
7.0%
522
 
4.7%
282
 
2.5%
Other values (108) 3596
32.5%

건축면적(제곱미터)
Real number (ℝ)

HIGH CORRELATION 

Distinct658
Distinct (%)84.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1419.7415
Minimum18
Maximum55938.78
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size7.0 KiB
2023-12-12T20:59:40.855995image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum18
5-th percentile56.477
Q198.05
median196.42
Q31033.51
95-th percentile6413.5
Maximum55938.78
Range55920.78
Interquartile range (IQR)935.46

Descriptive statistics

Standard deviation3641.3323
Coefficient of variation (CV)2.5647853
Kurtosis74.731531
Mean1419.7415
Median Absolute Deviation (MAD)124.42
Skewness6.8756211
Sum1105978.7
Variance13259301
MonotonicityNot monotonic
2023-12-12T20:59:41.055727image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
84.65 23
 
3.0%
80.28 12
 
1.5%
77.8 5
 
0.6%
99.0 5
 
0.6%
82.86 4
 
0.5%
96.0 4
 
0.5%
73.64 4
 
0.5%
84.66 4
 
0.5%
495.0 4
 
0.5%
196.0 3
 
0.4%
Other values (648) 711
91.3%
ValueCountFrequency (%)
18.0 3
0.4%
23.01 1
 
0.1%
24.0 1
 
0.1%
27.6 1
 
0.1%
28.0 2
0.3%
30.0 2
0.3%
30.36 1
 
0.1%
30.72 1
 
0.1%
31.05 3
0.4%
32.0 1
 
0.1%
ValueCountFrequency (%)
55938.78 1
0.1%
26709.66 1
0.1%
24714.77 1
0.1%
21190.16 1
0.1%
20470.26 1
0.1%
19146.2 1
0.1%
17744.36 1
0.1%
17403.78 1
0.1%
17200.81 1
0.1%
17183.52 2
0.3%

연면적(제곱미터)
Real number (ℝ)

HIGH CORRELATION 

Distinct662
Distinct (%)85.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2174.7213
Minimum18
Maximum75196.65
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size7.0 KiB
2023-12-12T20:59:41.247884image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum18
5-th percentile59.831
Q197.62
median212.08
Q31340.82
95-th percentile9430.957
Maximum75196.65
Range75178.65
Interquartile range (IQR)1243.2

Descriptive statistics

Standard deviation6371.2959
Coefficient of variation (CV)2.9297069
Kurtosis45.488308
Mean2174.7213
Median Absolute Deviation (MAD)153.23
Skewness6.0446914
Sum1694107.9
Variance40593411
MonotonicityNot monotonic
2023-12-12T20:59:41.447091image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
84.65 24
 
3.1%
79.38 12
 
1.5%
77.8 5
 
0.6%
99.0 5
 
0.6%
82.86 4
 
0.5%
84.66 4
 
0.5%
96.0 4
 
0.5%
495.0 4
 
0.5%
78.0 4
 
0.5%
73.64 4
 
0.5%
Other values (652) 709
91.0%
ValueCountFrequency (%)
18.0 3
0.4%
23.01 1
 
0.1%
24.0 1
 
0.1%
27.6 1
 
0.1%
28.0 2
0.3%
30.0 2
0.3%
30.36 1
 
0.1%
30.72 1
 
0.1%
31.05 3
0.4%
32.0 1
 
0.1%
ValueCountFrequency (%)
75196.65 1
0.1%
54976.81 1
0.1%
50993.82 1
0.1%
46778.15 1
0.1%
45581.46 1
0.1%
43589.87 1
0.1%
42801.02 1
0.1%
38556.51 1
0.1%
36116.8 1
0.1%
28411.34 1
0.1%

증축연면적(제곱미터)
Real number (ℝ)

HIGH CORRELATION  MISSING 

Distinct211
Distinct (%)97.2%
Missing562
Missing (%)72.1%
Infinite0
Infinite (%)0.0%
Mean594.46553
Minimum-15.87
Maximum10655.51
Zeros1
Zeros (%)0.1%
Negative2
Negative (%)0.3%
Memory size7.0 KiB
2023-12-12T20:59:41.636045image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum-15.87
5-th percentile12.666
Q147.63
median167.12
Q3623.1
95-th percentile2840.784
Maximum10655.51
Range10671.38
Interquartile range (IQR)575.47

Descriptive statistics

Standard deviation1215.9314
Coefficient of variation (CV)2.0454195
Kurtosis27.104475
Mean594.46553
Median Absolute Deviation (MAD)142.57
Skewness4.5265502
Sum128999.02
Variance1478489.1
MonotonicityNot monotonic
2023-12-12T20:59:41.833197image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
15.0 2
 
0.3%
216.0 2
 
0.3%
18.0 2
 
0.3%
630.0 2
 
0.3%
191.66 2
 
0.3%
98.24 2
 
0.3%
48.72 1
 
0.1%
51.97 1
 
0.1%
19.98 1
 
0.1%
184.68 1
 
0.1%
Other values (201) 201
 
25.8%
(Missing) 562
72.1%
ValueCountFrequency (%)
-15.87 1
0.1%
-9.41 1
0.1%
0.0 1
0.1%
4.4 1
0.1%
5.12 1
0.1%
6.75 1
0.1%
7.18 1
0.1%
8.84 1
0.1%
9.66 1
0.1%
10.8 1
0.1%
ValueCountFrequency (%)
10655.51 1
0.1%
6801.56 1
0.1%
6292.02 1
0.1%
5058.13 1
0.1%
4578.56 1
0.1%
3644.9 1
0.1%
3462.5 1
0.1%
3295.72 1
0.1%
3013.21 1
0.1%
3009.18 1
0.1%

구조
Categorical

Distinct13
Distinct (%)1.7%
Missing0
Missing (%)0.0%
Memory size6.2 KiB
경량철골구조
373 
일반철골구조
242 
철근콘크리트구조
87 
일반목구조
 
28
강파이프구조
 
15
Other values (8)
 
34

Length

Max length13
Median length6
Mean length6.2272144
Min length4

Unique

Unique3 ?
Unique (%)0.4%

Sample

1st row일반철골구조
2nd row일반철골구조
3rd row철근콘크리트구조
4th row일반목구조
5th row일반철골구조

Common Values

ValueCountFrequency (%)
경량철골구조 373
47.9%
일반철골구조 242
31.1%
철근콘크리트구조 87
 
11.2%
일반목구조 28
 
3.6%
강파이프구조 15
 
1.9%
<NA> 10
 
1.3%
컨테이너조 9
 
1.2%
공업화박판강구조(PEB) 4
 
0.5%
철골철근콘크리트구조 4
 
0.5%
프리케스트콘크리트구조 4
 
0.5%
Other values (3) 3
 
0.4%

Length

2023-12-12T20:59:42.010937image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
경량철골구조 373
47.9%
일반철골구조 242
31.1%
철근콘크리트구조 87
 
11.2%
일반목구조 28
 
3.6%
강파이프구조 15
 
1.9%
na 10
 
1.3%
컨테이너조 9
 
1.2%
공업화박판강구조(peb 4
 
0.5%
철골철근콘크리트구조 4
 
0.5%
프리케스트콘크리트구조 4
 
0.5%
Other values (3) 3
 
0.4%

착공예정일
Date

MISSING 

Distinct221
Distinct (%)44.9%
Missing287
Missing (%)36.8%
Memory size6.2 KiB
Minimum2022-11-23 00:00:00
Maximum2023-12-08 00:00:00
2023-12-12T20:59:42.506170image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T20:59:42.689403image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

실제착공일
Date

MISSING 

Distinct122
Distinct (%)67.4%
Missing598
Missing (%)76.8%
Memory size6.2 KiB
Minimum2022-11-23 00:00:00
Maximum2023-10-27 00:00:00
2023-12-12T20:59:42.846821image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T20:59:43.006222image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

사용승인일
Date

MISSING 

Distinct112
Distinct (%)53.8%
Missing571
Missing (%)73.3%
Memory size6.2 KiB
Minimum2022-12-09 00:00:00
Maximum2023-12-06 00:00:00
2023-12-12T20:59:43.188021image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T20:59:43.365641image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

주용도
Categorical

Distinct20
Distinct (%)2.6%
Missing0
Missing (%)0.0%
Memory size6.2 KiB
단독주택
278 
공장
186 
제2종근린생활시설
106 
제1종근린생활시설
88 
동물및식물관련시설
 
26
Other values (15)
95 

Length

Max length10
Median length9
Mean length5.139923
Min length2

Unique

Unique4 ?
Unique (%)0.5%

Sample

1st row공장
2nd row공장
3rd row운동시설
4th row단독주택
5th row공장

Common Values

ValueCountFrequency (%)
단독주택 278
35.7%
공장 186
23.9%
제2종근린생활시설 106
 
13.6%
제1종근린생활시설 88
 
11.3%
동물및식물관련시설 26
 
3.3%
창고시설 24
 
3.1%
자원순환관련시설 19
 
2.4%
자동차관련시설 11
 
1.4%
업무시설 8
 
1.0%
노유자시설 6
 
0.8%
Other values (10) 27
 
3.5%

Length

2023-12-12T20:59:43.536959image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
단독주택 278
35.7%
공장 186
23.9%
제2종근린생활시설 106
 
13.6%
제1종근린생활시설 88
 
11.3%
동물및식물관련시설 26
 
3.3%
창고시설 24
 
3.1%
자원순환관련시설 19
 
2.4%
자동차관련시설 11
 
1.4%
업무시설 8
 
1.0%
교육연구시설 6
 
0.8%
Other values (10) 27
 
3.5%

부속용도
Text

MISSING 

Distinct207
Distinct (%)42.2%
Missing288
Missing (%)37.0%
Memory size6.2 KiB
2023-12-12T20:59:43.789543image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length35
Median length27
Mean length6.2545825
Min length2

Characters and Unicode

Total characters3071
Distinct characters170
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique162 ?
Unique (%)33.0%

Sample

1st row다가구주택
2nd row(단독주택)
3rd row소매점
4th row공장+사무실
5th row오수중계펌프장
ValueCountFrequency (%)
단독주택 97
 
16.7%
사무소 53
 
9.1%
소매점 52
 
9.0%
공장 21
 
3.6%
창고 21
 
3.6%
18
 
3.1%
일반음식점 17
 
2.9%
제조업소 11
 
1.9%
다가구주택 11
 
1.9%
고물상 8
 
1.4%
Other values (177) 272
46.8%
2023-12-12T20:59:44.272252image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
176
 
5.7%
169
 
5.5%
160
 
5.2%
124
 
4.0%
120
 
3.9%
119
 
3.9%
+ 116
 
3.8%
106
 
3.5%
95
 
3.1%
90
 
2.9%
Other values (160) 1796
58.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2655
86.5%
Math Symbol 116
 
3.8%
Space Separator 90
 
2.9%
Open Punctuation 69
 
2.2%
Close Punctuation 69
 
2.2%
Decimal Number 45
 
1.5%
Other Punctuation 18
 
0.6%
Uppercase Letter 6
 
0.2%
Dash Punctuation 3
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
176
 
6.6%
169
 
6.4%
160
 
6.0%
124
 
4.7%
120
 
4.5%
119
 
4.5%
106
 
4.0%
95
 
3.6%
79
 
3.0%
78
 
2.9%
Other values (143) 1429
53.8%
Uppercase Letter
ValueCountFrequency (%)
A 2
33.3%
N 1
16.7%
M 1
16.7%
E 1
16.7%
S 1
16.7%
Other Punctuation
ValueCountFrequency (%)
/ 15
83.3%
# 1
 
5.6%
? 1
 
5.6%
. 1
 
5.6%
Decimal Number
ValueCountFrequency (%)
1 22
48.9%
2 22
48.9%
4 1
 
2.2%
Math Symbol
ValueCountFrequency (%)
+ 116
100.0%
Space Separator
ValueCountFrequency (%)
90
100.0%
Open Punctuation
ValueCountFrequency (%)
( 69
100.0%
Close Punctuation
ValueCountFrequency (%)
) 69
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2655
86.5%
Common 410
 
13.4%
Latin 6
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
176
 
6.6%
169
 
6.4%
160
 
6.0%
124
 
4.7%
120
 
4.5%
119
 
4.5%
106
 
4.0%
95
 
3.6%
79
 
3.0%
78
 
2.9%
Other values (143) 1429
53.8%
Common
ValueCountFrequency (%)
+ 116
28.3%
90
22.0%
( 69
16.8%
) 69
16.8%
1 22
 
5.4%
2 22
 
5.4%
/ 15
 
3.7%
- 3
 
0.7%
# 1
 
0.2%
? 1
 
0.2%
Other values (2) 2
 
0.5%
Latin
ValueCountFrequency (%)
A 2
33.3%
N 1
16.7%
M 1
16.7%
E 1
16.7%
S 1
16.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2655
86.5%
ASCII 416
 
13.5%

Most frequent character per block

Hangul
ValueCountFrequency (%)
176
 
6.6%
169
 
6.4%
160
 
6.0%
124
 
4.7%
120
 
4.5%
119
 
4.5%
106
 
4.0%
95
 
3.6%
79
 
3.0%
78
 
2.9%
Other values (143) 1429
53.8%
ASCII
ValueCountFrequency (%)
+ 116
27.9%
90
21.6%
( 69
16.6%
) 69
16.6%
1 22
 
5.3%
2 22
 
5.3%
/ 15
 
3.6%
- 3
 
0.7%
A 2
 
0.5%
# 1
 
0.2%
Other values (7) 7
 
1.7%
Distinct156
Distinct (%)20.1%
Missing1
Missing (%)0.1%
Memory size6.2 KiB
2023-12-12T20:59:44.526842image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length19
Median length18
Mean length9.9524422
Min length5

Characters and Unicode

Total characters7743
Distinct characters170
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique98 ?
Unique (%)12.6%

Sample

1st row대현건축사사무소
2nd row주식회사 아이에프디건축사사무소
3rd row(주)음성건축사사무소
4th row장원건축사사무소
5th row(주)뿌리건축사사무소
ValueCountFrequency (%)
건축사사무소 139
 
14.3%
나래건축사사무소 67
 
6.9%
주)뿌리건축사사무소 56
 
5.8%
영남건축사사무소 52
 
5.3%
주식회사 52
 
5.3%
주)일건축사사무소 47
 
4.8%
주식회사가이건축사사무소 38
 
3.9%
건축사사무소소담 37
 
3.8%
함영선 37
 
3.8%
주)음성건축사사무소 36
 
3.7%
Other values (148) 412
42.3%
2023-12-12T20:59:44.942098image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1665
21.5%
815
10.5%
796
10.3%
784
10.1%
777
10.0%
325
 
4.2%
( 214
 
2.8%
) 214
 
2.8%
205
 
2.6%
108
 
1.4%
Other values (160) 1840
23.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 7103
91.7%
Open Punctuation 214
 
2.8%
Close Punctuation 214
 
2.8%
Space Separator 205
 
2.6%
Uppercase Letter 5
 
0.1%
Other Punctuation 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1665
23.4%
815
11.5%
796
11.2%
784
11.0%
777
10.9%
325
 
4.6%
108
 
1.5%
108
 
1.5%
106
 
1.5%
92
 
1.3%
Other values (151) 1527
21.5%
Uppercase Letter
ValueCountFrequency (%)
T 1
20.0%
G 1
20.0%
L 1
20.0%
I 1
20.0%
M 1
20.0%
Open Punctuation
ValueCountFrequency (%)
( 214
100.0%
Close Punctuation
ValueCountFrequency (%)
) 214
100.0%
Space Separator
ValueCountFrequency (%)
205
100.0%
Other Punctuation
ValueCountFrequency (%)
. 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 7103
91.7%
Common 635
 
8.2%
Latin 5
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1665
23.4%
815
11.5%
796
11.2%
784
11.0%
777
10.9%
325
 
4.6%
108
 
1.5%
108
 
1.5%
106
 
1.5%
92
 
1.3%
Other values (151) 1527
21.5%
Latin
ValueCountFrequency (%)
T 1
20.0%
G 1
20.0%
L 1
20.0%
I 1
20.0%
M 1
20.0%
Common
ValueCountFrequency (%)
( 214
33.7%
) 214
33.7%
205
32.3%
. 2
 
0.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 7103
91.7%
ASCII 640
 
8.3%

Most frequent character per block

Hangul
ValueCountFrequency (%)
1665
23.4%
815
11.5%
796
11.2%
784
11.0%
777
10.9%
325
 
4.6%
108
 
1.5%
108
 
1.5%
106
 
1.5%
92
 
1.3%
Other values (151) 1527
21.5%
ASCII
ValueCountFrequency (%)
( 214
33.4%
) 214
33.4%
205
32.0%
. 2
 
0.3%
T 1
 
0.2%
G 1
 
0.2%
L 1
 
0.2%
I 1
 
0.2%
M 1
 
0.2%

감리사무소명
Text

MISSING 

Distinct62
Distinct (%)36.9%
Missing611
Missing (%)78.4%
Memory size6.2 KiB
2023-12-12T20:59:45.223512image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length20
Median length17
Mean length10.488095
Min length8

Characters and Unicode

Total characters1762
Distinct characters113
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique34 ?
Unique (%)20.2%

Sample

1st row장원건축사사무소
2nd row(주)뿌리건축사사무소
3rd row함영선 건축사사무소
4th row(주)일건축사사무소
5th row(주)음성건축사사무소
ValueCountFrequency (%)
건축사사무소 28
 
13.1%
주)일건축사사무소 20
 
9.3%
주식회사 14
 
6.5%
주)음성건축사사무소 13
 
6.1%
주)뿌리건축사사무소 12
 
5.6%
함영선 10
 
4.7%
주식회사가이건축사사무소 10
 
4.7%
건축사사무소소담 7
 
3.3%
장원건축사사무소 7
 
3.3%
마루건축사사무소 7
 
3.3%
Other values (57) 86
40.2%
2023-12-12T20:59:45.686723image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
363
20.6%
175
9.9%
171
9.7%
170
9.6%
169
9.6%
94
 
5.3%
( 66
 
3.7%
) 66
 
3.7%
48
 
2.7%
27
 
1.5%
Other values (103) 413
23.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1581
89.7%
Open Punctuation 66
 
3.7%
Close Punctuation 66
 
3.7%
Space Separator 48
 
2.7%
Other Punctuation 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
363
23.0%
175
11.1%
171
10.8%
170
10.8%
169
10.7%
94
 
5.9%
27
 
1.7%
27
 
1.7%
25
 
1.6%
22
 
1.4%
Other values (99) 338
21.4%
Open Punctuation
ValueCountFrequency (%)
( 66
100.0%
Close Punctuation
ValueCountFrequency (%)
) 66
100.0%
Space Separator
ValueCountFrequency (%)
48
100.0%
Other Punctuation
ValueCountFrequency (%)
. 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1581
89.7%
Common 181
 
10.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
363
23.0%
175
11.1%
171
10.8%
170
10.8%
169
10.7%
94
 
5.9%
27
 
1.7%
27
 
1.7%
25
 
1.6%
22
 
1.4%
Other values (99) 338
21.4%
Common
ValueCountFrequency (%)
( 66
36.5%
) 66
36.5%
48
26.5%
. 1
 
0.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1581
89.7%
ASCII 181
 
10.3%

Most frequent character per block

Hangul
ValueCountFrequency (%)
363
23.0%
175
11.1%
171
10.8%
170
10.8%
169
10.7%
94
 
5.9%
27
 
1.7%
27
 
1.7%
25
 
1.6%
22
 
1.4%
Other values (99) 338
21.4%
ASCII
ValueCountFrequency (%)
( 66
36.5%
) 66
36.5%
48
26.5%
. 1
 
0.6%

시공자사무소명
Text

MISSING 

Distinct127
Distinct (%)73.8%
Missing607
Missing (%)77.9%
Memory size6.2 KiB
2023-12-12T20:59:45.986383image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length8.7034884
Min length4

Characters and Unicode

Total characters1497
Distinct characters147
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique103 ?
Unique (%)59.9%

Sample

1st row(주)하우징팩토리
2nd row진성종합건설주식회사
3rd row일진건설(주)
4th row미진종합건설(주)
5th row대건산업건설(주)
ValueCountFrequency (%)
주식회사 22
 
11.3%
대건산업건설(주 8
 
4.1%
일진건설(주 5
 
2.6%
미진종합건설(주 4
 
2.1%
주)태양건설 4
 
2.1%
대덕건설(주 4
 
2.1%
정근건설주식회사 3
 
1.5%
진성종합건설주식회사 3
 
1.5%
운화종합건설(주 3
 
1.5%
주)안도종합건설 3
 
1.5%
Other values (118) 135
69.6%
2023-12-12T20:59:46.457075image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
171
 
11.4%
152
 
10.2%
142
 
9.5%
( 127
 
8.5%
) 127
 
8.5%
82
 
5.5%
80
 
5.3%
44
 
2.9%
43
 
2.9%
43
 
2.9%
Other values (137) 486
32.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1220
81.5%
Open Punctuation 127
 
8.5%
Close Punctuation 127
 
8.5%
Space Separator 22
 
1.5%
Other Punctuation 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
171
14.0%
152
 
12.5%
142
 
11.6%
82
 
6.7%
80
 
6.6%
44
 
3.6%
43
 
3.5%
43
 
3.5%
25
 
2.0%
23
 
1.9%
Other values (133) 415
34.0%
Open Punctuation
ValueCountFrequency (%)
( 127
100.0%
Close Punctuation
ValueCountFrequency (%)
) 127
100.0%
Space Separator
ValueCountFrequency (%)
22
100.0%
Other Punctuation
ValueCountFrequency (%)
. 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1220
81.5%
Common 277
 
18.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
171
14.0%
152
 
12.5%
142
 
11.6%
82
 
6.7%
80
 
6.6%
44
 
3.6%
43
 
3.5%
43
 
3.5%
25
 
2.0%
23
 
1.9%
Other values (133) 415
34.0%
Common
ValueCountFrequency (%)
( 127
45.8%
) 127
45.8%
22
 
7.9%
. 1
 
0.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1220
81.5%
ASCII 277
 
18.5%

Most frequent character per block

Hangul
ValueCountFrequency (%)
171
14.0%
152
 
12.5%
142
 
11.6%
82
 
6.7%
80
 
6.6%
44
 
3.6%
43
 
3.5%
43
 
3.5%
25
 
2.0%
23
 
1.9%
Other values (133) 415
34.0%
ASCII
ValueCountFrequency (%)
( 127
45.8%
) 127
45.8%
22
 
7.9%
. 1
 
0.4%

기준일자
Categorical

CONSTANT 

Distinct1
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size6.2 KiB
2023-12-06
779 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2023-12-06
2nd row2023-12-06
3rd row2023-12-06
4th row2023-12-06
5th row2023-12-06

Common Values

ValueCountFrequency (%)
2023-12-06 779
100.0%

Length

2023-12-12T20:59:46.629934image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T20:59:46.758371image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2023-12-06 779
100.0%

Interactions

2023-12-12T20:59:37.855433image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T20:59:36.889610image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T20:59:37.297936image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T20:59:37.972900image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T20:59:37.008718image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T20:59:37.414721image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T20:59:38.103903image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T20:59:37.149122image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T20:59:37.679389image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T20:59:46.856979image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
건축구분건축면적(제곱미터)연면적(제곱미터)증축연면적(제곱미터)구조주용도감리사무소명
건축구분1.0000.2480.297NaN0.2030.6460.741
건축면적(제곱미터)0.2481.0000.9000.2650.4540.1980.762
연면적(제곱미터)0.2970.9001.0000.4260.5910.3130.859
증축연면적(제곱미터)NaN0.2650.4261.0000.2650.0000.890
구조0.2030.4540.5910.2651.0000.5860.000
주용도0.6460.1980.3130.0000.5861.0000.952
감리사무소명0.7410.7620.8590.8900.0000.9521.000
2023-12-12T20:59:47.011862image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
구조건축구분주용도
구조1.0000.1130.241
건축구분0.1131.0000.333
주용도0.2410.3331.000
2023-12-12T20:59:47.115784image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
건축면적(제곱미터)연면적(제곱미터)증축연면적(제곱미터)건축구분구조주용도
건축면적(제곱미터)1.0000.9790.4560.1700.1940.092
연면적(제곱미터)0.9791.0000.4580.1760.2980.128
증축연면적(제곱미터)0.4560.4581.0001.0000.1260.000
건축구분0.1700.1761.0001.0000.1130.333
구조0.1940.2980.1260.1131.0000.241
주용도0.0920.1280.0000.3330.2411.000

Missing values

2023-12-12T20:59:38.438142image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T20:59:38.893074image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-12T20:59:39.223834image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

건축구분대지위치건축면적(제곱미터)연면적(제곱미터)증축연면적(제곱미터)구조착공예정일실제착공일사용승인일주용도부속용도설계사무소명감리사무소명시공자사무소명기준일자
0증축충청북도 음성군 금왕읍 내송리 53055938.7875196.65160.0일반철골구조<NA><NA><NA>공장<NA>대현건축사사무소<NA><NA>2023-12-06
1증축충청북도 음성군 맹동면 두성리 1048 외1필지10970.8715490.942597.91일반철골구조<NA><NA><NA>공장<NA>주식회사 아이에프디건축사사무소<NA><NA>2023-12-06
2신축충청북도 음성군 맹동면 용촌리 195-41318.881092.56<NA>철근콘크리트구조<NA><NA><NA>운동시설다가구주택(주)음성건축사사무소<NA><NA>2023-12-06
3신축충청북도 음성군 맹동면 동성리 380136.61136.61<NA>일반목구조2023-12-06<NA><NA>단독주택(단독주택)장원건축사사무소장원건축사사무소(주)하우징팩토리2023-12-06
4증축충청북도 음성군 대소면 오류리 642-21099.591257.59245.19일반철골구조<NA><NA><NA>공장<NA>(주)뿌리건축사사무소<NA><NA>2023-12-06
5신축충청북도 음성군 대소면 태생리 524397.9397.9<NA>경량철골구조<NA><NA><NA>제1종근린생활시설소매점제우건축사사무소<NA><NA>2023-12-06
6증축충청북도 음성군 맹동면 쌍정리 7433030.53030.5800.0일반철골구조<NA><NA><NA>공장공장+사무실건축사사무소소담<NA><NA>2023-12-06
7신축충청북도 음성군 맹동면 인곡리 산 25-8 외4필지145.04282.79<NA>철근콘크리트구조<NA><NA><NA>제1종근린생활시설오수중계펌프장제파건축사사무소<NA><NA>2023-12-06
8증축충청북도 음성군 원남면 상노리 7627258.097476.361883.67일반철골구조<NA><NA><NA>공장공장및부대시설단건축사사무소<NA><NA>2023-12-06
9신축충청북도 음성군 생극면 신양리 369-4198.0198.0<NA>일반철골구조<NA><NA><NA>자동차관련시설주기장주식회사현대제이건축사사무소<NA><NA>2023-12-06
건축구분대지위치건축면적(제곱미터)연면적(제곱미터)증축연면적(제곱미터)구조착공예정일실제착공일사용승인일주용도부속용도설계사무소명감리사무소명시공자사무소명기준일자
769신축충청북도 음성군 대소면 태생리 527-1199.099.0<NA>경량철골구조2022-11-232022-11-242023-04-04단독주택<NA>지오건축사사무소<NA><NA>2023-12-06
770신축충청북도 음성군 대소면 태생리 527-1236.036.0<NA>경량철골구조2022-11-232022-11-242023-04-04제1종근린생활시설<NA>지오건축사사무소<NA><NA>2023-12-06
771신축충청북도 음성군 대소면 태생리 527-999.099.0<NA>경량철골구조2022-11-232022-11-242023-04-04단독주택<NA>지오건축사사무소<NA><NA>2023-12-06
772신축충청북도 음성군 원남면 덕정리 365132.0132.0<NA>일반철골구조<NA><NA><NA>제1종근린생활시설소매점(주)범건축사사무소<NA><NA>2023-12-06
773신축충청북도 음성군 삼성면 선정리 343-3 외1필지396.0396.0<NA>강파이프구조<NA><NA><NA>창고시설농업용창고(주)뿌리건축사사무소<NA><NA>2023-12-06
774신축충청북도 음성군 대소면 대풍리 364 외1필지396.0396.0<NA>일반철골구조<NA><NA><NA>제2종근린생활시설제조업소에이원 건축사사무소<NA><NA>2023-12-06
775신축충청북도 음성군 대소면 오류리 39146.52145.62<NA>경량철골구조<NA><NA><NA>단독주택단독주택(농가주택)+농업용창고(주)일건축사사무소<NA><NA>2023-12-06
776용도변경충청북도 음성군 대소면 대풍리 415-132298.02298.0<NA><NA><NA><NA><NA>제1종근린생활시설제1+2종근린생활시설+창고시설+자동차관련시설(주)뿌리건축사사무소<NA><NA>2023-12-06
777증축충청북도 음성군 음성읍 용산리 1740 외7필지2121.142366.5959.14일반철골구조2022-12-12<NA><NA>공장<NA>주식회사 마루건축사사무소<NA><NA>2023-12-06
778증축충청북도 음성군 감곡면 오궁리 454-1 외1필지224.28219.6353.55경량철골구조2022-12-01<NA><NA>제2종근린생활시설<NA>주식회사미공건축사사무소<NA><NA>2023-12-06

Duplicate rows

Most frequently occurring

건축구분대지위치건축면적(제곱미터)연면적(제곱미터)증축연면적(제곱미터)구조착공예정일실제착공일사용승인일주용도부속용도설계사무소명감리사무소명시공자사무소명기준일자# duplicates
6신축충청북도 음성군 음성읍 용산리 2-58 외1필지84.6584.65<NA>경량철골구조<NA><NA><NA>단독주택단독주택나래건축사사무소<NA><NA>2023-12-065
5신축충청북도 음성군 음성읍 용산리 2-3884.6584.65<NA>경량철골구조<NA><NA><NA>단독주택단독주택나래건축사사무소<NA><NA>2023-12-063
7신축충청북도 음성군 음성읍 용산리 2-6 외1필지84.6584.65<NA>경량철골구조<NA><NA><NA>단독주택단독주택나래건축사사무소<NA><NA>2023-12-063
0신축충청북도 음성군 대소면 성본리 566-1 외1필지198.3198.3<NA>경량철골구조<NA><NA><NA>제2종근린생활시설사무소(주)음성건축사사무소<NA><NA>2023-12-062
1신축충청북도 음성군 대소면 소석리 산 1011111.51111.5<NA>일반철골구조<NA><NA><NA>공장<NA>(주)음성건축사사무소<NA><NA>2023-12-062
2신축충청북도 음성군 삼성면 선정리 397-267.267.2<NA>경량철골구조<NA><NA><NA>단독주택<NA>(주)토담건축사사무소<NA><NA>2023-12-062
3신축충청북도 음성군 생극면 관성리 산 65-1 외1필지1713.01713.0<NA>일반철골구조<NA><NA><NA>동물및식물관련시설버섯재배사주식회사 이루안건축사사무소<NA><NA>2023-12-062
4신축충청북도 음성군 원남면 삼용리 226-284.6684.66<NA>경량철골구조2023-06-24<NA><NA>단독주택단독주택나래건축사사무소<NA><NA>2023-12-062
8신축충청북도 음성군 음성읍 한벌리 49299.3699.36<NA>경량철골구조2023-11-13<NA><NA>단독주택단독주택나래건축사사무소<NA><NA>2023-12-062
9용도변경충청북도 음성군 대소면 대풍리 415-132298.02298.0<NA><NA><NA><NA><NA>제1종근린생활시설제1+2종근린생활시설+창고시설+자동차관련시설(주)뿌리건축사사무소<NA><NA>2023-12-062