Overview

Dataset statistics

Number of variables14
Number of observations238
Missing cells792
Missing cells (%)23.8%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory26.4 KiB
Average record size in memory113.6 B

Variable types

Numeric1
Categorical3
Text7
DateTime3

Dataset

Description경상북도 안동시의 건설현장시공정보 중 시공중인 건에 대한 데이터입니다. (연번, 구분, 시공대지위치, 주용도, 착공예정일, 실제착공일, 준공예정일, 시공업체명, 시공업체대표번호, 감리사무소명, 감리사무소 대표번호, 설계사무소명, 설계사무소 대표번호, 데이터기준일자)
URLhttps://www.data.go.kr/data/15035735/fileData.do

Alerts

데이터기준일자 has constant value ""Constant
연번 is highly overall correlated with 구분High correlation
구분 is highly overall correlated with 연번High correlation
시공업체명 has 185 (77.7%) missing valuesMissing
시공업체 대표번호 has 164 (68.9%) missing valuesMissing
감리사무소명 has 174 (73.1%) missing valuesMissing
감리사무소 대표번호 has 185 (77.7%) missing valuesMissing
설계사무소명 has 29 (12.2%) missing valuesMissing
설계사무소 대표번호 has 54 (22.7%) missing valuesMissing
연번 has unique valuesUnique

Reproduction

Analysis started2023-12-12 23:46:06.034606
Analysis finished2023-12-12 23:46:07.066086
Duration1.03 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct238
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean120.5
Minimum2
Maximum239
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size2.2 KiB
2023-12-13T08:46:07.122247image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2
5-th percentile13.85
Q161.25
median120.5
Q3179.75
95-th percentile227.15
Maximum239
Range237
Interquartile range (IQR)118.5

Descriptive statistics

Standard deviation68.848868
Coefficient of variation (CV)0.5713599
Kurtosis-1.2
Mean120.5
Median Absolute Deviation (MAD)59.5
Skewness0
Sum28679
Variance4740.1667
MonotonicityStrictly increasing
2023-12-13T08:46:07.239637image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
2 1
 
0.4%
166 1
 
0.4%
154 1
 
0.4%
155 1
 
0.4%
156 1
 
0.4%
157 1
 
0.4%
158 1
 
0.4%
159 1
 
0.4%
160 1
 
0.4%
161 1
 
0.4%
Other values (228) 228
95.8%
ValueCountFrequency (%)
2 1
0.4%
3 1
0.4%
4 1
0.4%
5 1
0.4%
6 1
0.4%
7 1
0.4%
8 1
0.4%
9 1
0.4%
10 1
0.4%
11 1
0.4%
ValueCountFrequency (%)
239 1
0.4%
238 1
0.4%
237 1
0.4%
236 1
0.4%
235 1
0.4%
234 1
0.4%
233 1
0.4%
232 1
0.4%
231 1
0.4%
230 1
0.4%

구분
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)0.8%
Missing0
Missing (%)0.0%
Memory size2.0 KiB
신고
174 
허가
64 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row허가
2nd row허가
3rd row허가
4th row허가
5th row허가

Common Values

ValueCountFrequency (%)
신고 174
73.1%
허가 64
 
26.9%

Length

2023-12-13T08:46:07.362756image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T08:46:07.431226image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
신고 174
73.1%
허가 64
 
26.9%
Distinct233
Distinct (%)97.9%
Missing0
Missing (%)0.0%
Memory size2.0 KiB
2023-12-13T08:46:07.605486image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length43
Median length27
Mean length22.39916
Min length16

Characters and Unicode

Total characters5331
Distinct characters142
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique229 ?
Unique (%)96.2%

Sample

1st row경상북도 안동시 북후면 도촌리 622-1 외1필지
2nd row경상북도 안동시 풍산읍 노리 212-4 외1필지
3rd row경상북도 안동시 임동면 고천리 40-6
4th row경상북도 안동시 임동면 고천리 40-6
5th row경상북도 안동시 풍산읍 매곡리 1123 외2필지
ValueCountFrequency (%)
경상북도 238
19.3%
안동시 238
19.3%
외1필지 42
 
3.4%
풍산읍 25
 
2.0%
풍천면 21
 
1.7%
17
 
1.4%
서후면 17
 
1.4%
임하면 14
 
1.1%
임동면 14
 
1.1%
일직면 14
 
1.1%
Other values (362) 590
48.0%
2023-12-13T08:46:07.924323image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
992
18.6%
319
 
6.0%
263
 
4.9%
260
 
4.9%
256
 
4.8%
252
 
4.7%
241
 
4.5%
241
 
4.5%
1 237
 
4.4%
177
 
3.3%
Other values (132) 2093
39.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 3215
60.3%
Space Separator 992
 
18.6%
Decimal Number 952
 
17.9%
Dash Punctuation 150
 
2.8%
Uppercase Letter 16
 
0.3%
Open Punctuation 3
 
0.1%
Close Punctuation 3
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
319
 
9.9%
263
 
8.2%
260
 
8.1%
256
 
8.0%
252
 
7.8%
241
 
7.5%
241
 
7.5%
177
 
5.5%
149
 
4.6%
83
 
2.6%
Other values (110) 974
30.3%
Decimal Number
ValueCountFrequency (%)
1 237
24.9%
2 124
13.0%
5 96
10.1%
4 94
 
9.9%
3 91
 
9.6%
6 79
 
8.3%
9 64
 
6.7%
7 61
 
6.4%
0 59
 
6.2%
8 47
 
4.9%
Uppercase Letter
ValueCountFrequency (%)
D 5
31.2%
B 2
 
12.5%
E 2
 
12.5%
F 2
 
12.5%
A 2
 
12.5%
C 1
 
6.2%
K 1
 
6.2%
I 1
 
6.2%
Space Separator
ValueCountFrequency (%)
992
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 150
100.0%
Open Punctuation
ValueCountFrequency (%)
( 3
100.0%
Close Punctuation
ValueCountFrequency (%)
) 3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 3215
60.3%
Common 2100
39.4%
Latin 16
 
0.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
319
 
9.9%
263
 
8.2%
260
 
8.1%
256
 
8.0%
252
 
7.8%
241
 
7.5%
241
 
7.5%
177
 
5.5%
149
 
4.6%
83
 
2.6%
Other values (110) 974
30.3%
Common
ValueCountFrequency (%)
992
47.2%
1 237
 
11.3%
- 150
 
7.1%
2 124
 
5.9%
5 96
 
4.6%
4 94
 
4.5%
3 91
 
4.3%
6 79
 
3.8%
9 64
 
3.0%
7 61
 
2.9%
Other values (4) 112
 
5.3%
Latin
ValueCountFrequency (%)
D 5
31.2%
B 2
 
12.5%
E 2
 
12.5%
F 2
 
12.5%
A 2
 
12.5%
C 1
 
6.2%
K 1
 
6.2%
I 1
 
6.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 3215
60.3%
ASCII 2116
39.7%

Most frequent character per block

ASCII
ValueCountFrequency (%)
992
46.9%
1 237
 
11.2%
- 150
 
7.1%
2 124
 
5.9%
5 96
 
4.5%
4 94
 
4.4%
3 91
 
4.3%
6 79
 
3.7%
9 64
 
3.0%
7 61
 
2.9%
Other values (12) 128
 
6.0%
Hangul
ValueCountFrequency (%)
319
 
9.9%
263
 
8.2%
260
 
8.1%
256
 
8.0%
252
 
7.8%
241
 
7.5%
241
 
7.5%
177
 
5.5%
149
 
4.6%
83
 
2.6%
Other values (110) 974
30.3%

주용도
Categorical

Distinct14
Distinct (%)5.9%
Missing0
Missing (%)0.0%
Memory size2.0 KiB
단독주택
83 
제1종근린생활시설
40 
제2종근린생활시설
37 
동물및식물관련시설
29 
창고시설
26 
Other values (9)
23 

Length

Max length9
Median length8
Mean length6.2941176
Min length2

Unique

Unique5 ?
Unique (%)2.1%

Sample

1st row단독주택
2nd row제1종근린생활시설
3rd row창고시설
4th row창고시설
5th row공장

Common Values

ValueCountFrequency (%)
단독주택 83
34.9%
제1종근린생활시설 40
16.8%
제2종근린생활시설 37
15.5%
동물및식물관련시설 29
 
12.2%
창고시설 26
 
10.9%
공장 6
 
2.5%
문화및집회시설 4
 
1.7%
교육연구시설 4
 
1.7%
노유자시설 4
 
1.7%
업무시설 1
 
0.4%
Other values (4) 4
 
1.7%

Length

2023-12-13T08:46:08.039318image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
단독주택 83
34.9%
제1종근린생활시설 40
16.8%
제2종근린생활시설 37
15.5%
동물및식물관련시설 29
 
12.2%
창고시설 26
 
10.9%
공장 6
 
2.5%
문화및집회시설 4
 
1.7%
교육연구시설 4
 
1.7%
노유자시설 4
 
1.7%
업무시설 1
 
0.4%
Other values (4) 4
 
1.7%
Distinct137
Distinct (%)57.6%
Missing0
Missing (%)0.0%
Memory size2.0 KiB
Minimum2022-07-17 00:00:00
Maximum2023-08-17 00:00:00
2023-12-13T08:46:08.135665image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T08:46:08.263645image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
Distinct132
Distinct (%)55.7%
Missing1
Missing (%)0.4%
Memory size2.0 KiB
Minimum2022-07-15 00:00:00
Maximum2023-08-16 00:00:00
2023-12-13T08:46:08.398128image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T08:46:08.507355image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
Distinct102
Distinct (%)42.9%
Missing0
Missing (%)0.0%
Memory size2.0 KiB
Minimum2022-03-28 00:00:00
Maximum2025-12-31 00:00:00
2023-12-13T08:46:08.617347image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T08:46:08.725640image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

시공업체명
Text

MISSING 

Distinct46
Distinct (%)86.8%
Missing185
Missing (%)77.7%
Memory size2.0 KiB
2023-12-13T08:46:08.910921image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length15
Median length11
Mean length8.6415094
Min length5

Characters and Unicode

Total characters458
Distinct characters88
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique43 ?
Unique (%)81.1%

Sample

1st row시우종합건설(주)
2nd row합자회사 세원이엔씨
3rd row주식회사태수건설
4th row주식회사 지에스건설
5th row지오종합건설(주)
ValueCountFrequency (%)
주식회사 7
 
11.5%
진일건설(주 6
 
9.8%
해오래종합건설(주 2
 
3.3%
태형종합건설(주 2
 
3.3%
경일건설(주 1
 
1.6%
영암건설(합 1
 
1.6%
주식회사세움건설 1
 
1.6%
농업회사법인주식회사청어람푸드 1
 
1.6%
주)금상종합건설 1
 
1.6%
금곡건설(주 1
 
1.6%
Other values (38) 38
62.3%
2023-12-13T08:46:09.472864image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
51
 
11.1%
43
 
9.4%
41
 
9.0%
( 32
 
7.0%
) 32
 
7.0%
23
 
5.0%
22
 
4.8%
22
 
4.8%
21
 
4.6%
20
 
4.4%
Other values (78) 151
33.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 386
84.3%
Open Punctuation 32
 
7.0%
Close Punctuation 32
 
7.0%
Space Separator 8
 
1.7%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
51
 
13.2%
43
 
11.1%
41
 
10.6%
23
 
6.0%
22
 
5.7%
22
 
5.7%
21
 
5.4%
20
 
5.2%
7
 
1.8%
7
 
1.8%
Other values (75) 129
33.4%
Open Punctuation
ValueCountFrequency (%)
( 32
100.0%
Close Punctuation
ValueCountFrequency (%)
) 32
100.0%
Space Separator
ValueCountFrequency (%)
8
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 386
84.3%
Common 72
 
15.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
51
 
13.2%
43
 
11.1%
41
 
10.6%
23
 
6.0%
22
 
5.7%
22
 
5.7%
21
 
5.4%
20
 
5.2%
7
 
1.8%
7
 
1.8%
Other values (75) 129
33.4%
Common
ValueCountFrequency (%)
( 32
44.4%
) 32
44.4%
8
 
11.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 386
84.3%
ASCII 72
 
15.7%

Most frequent character per block

Hangul
ValueCountFrequency (%)
51
 
13.2%
43
 
11.1%
41
 
10.6%
23
 
6.0%
22
 
5.7%
22
 
5.7%
21
 
5.4%
20
 
5.2%
7
 
1.8%
7
 
1.8%
Other values (75) 129
33.4%
ASCII
ValueCountFrequency (%)
( 32
44.4%
) 32
44.4%
8
 
11.1%
Distinct52
Distinct (%)70.3%
Missing164
Missing (%)68.9%
Memory size2.0 KiB
2023-12-13T08:46:09.745403image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length12.040541
Min length11

Characters and Unicode

Total characters891
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique40 ?
Unique (%)54.1%

Sample

1st row054-901-4436
2nd row054-440-0352
3rd row054-553-7625
4th row053-257-9078
5th row054-741-7720
ValueCountFrequency (%)
054-843-8304 6
 
8.1%
054-857-3118 5
 
6.8%
054-854-5703 3
 
4.1%
031-809-69582 3
 
4.1%
054-854-9201 3
 
4.1%
054-856-2627 2
 
2.7%
054-901-4436 2
 
2.7%
054-855-3200 2
 
2.7%
054-854-8826 2
 
2.7%
054-773-0437 2
 
2.7%
Other values (42) 44
59.5%
2023-12-13T08:46:10.130791image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 148
16.6%
0 142
15.9%
5 136
15.3%
4 107
12.0%
8 85
9.5%
3 72
8.1%
7 58
 
6.5%
1 48
 
5.4%
2 41
 
4.6%
9 29
 
3.3%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 743
83.4%
Dash Punctuation 148
 
16.6%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 142
19.1%
5 136
18.3%
4 107
14.4%
8 85
11.4%
3 72
9.7%
7 58
7.8%
1 48
 
6.5%
2 41
 
5.5%
9 29
 
3.9%
6 25
 
3.4%
Dash Punctuation
ValueCountFrequency (%)
- 148
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 891
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 148
16.6%
0 142
15.9%
5 136
15.3%
4 107
12.0%
8 85
9.5%
3 72
8.1%
7 58
 
6.5%
1 48
 
5.4%
2 41
 
4.6%
9 29
 
3.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 891
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 148
16.6%
0 142
15.9%
5 136
15.3%
4 107
12.0%
8 85
9.5%
3 72
8.1%
7 58
 
6.5%
1 48
 
5.4%
2 41
 
4.6%
9 29
 
3.3%

감리사무소명
Text

MISSING 

Distinct39
Distinct (%)60.9%
Missing174
Missing (%)73.1%
Memory size2.0 KiB
2023-12-13T08:46:10.342199image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length18
Median length17
Mean length10.09375
Min length7

Characters and Unicode

Total characters646
Distinct characters67
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique27 ?
Unique (%)42.2%

Sample

1st row건축사사무소 한림
2nd row길 건축사사무소
3rd row건축사사무소 우진건축
4th row건축사사무소 우진건축
5th row건축사사무소 이상건축
ValueCountFrequency (%)
건축사사무소 37
34.6%
가원건축사사무소 6
 
5.6%
동인 6
 
5.6%
우진건축 4
 
3.7%
에이원 3
 
2.8%
3
 
2.8%
건축사사무소반석 3
 
2.8%
주식회사 3
 
2.8%
종합건축사사무소 2
 
1.9%
광명건축 2
 
1.9%
Other values (33) 38
35.5%
2023-12-13T08:46:10.676230image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
132
20.4%
84
13.0%
83
12.8%
64
9.9%
64
9.9%
46
 
7.1%
14
 
2.2%
10
 
1.5%
9
 
1.4%
8
 
1.2%
Other values (57) 132
20.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 590
91.3%
Space Separator 46
 
7.1%
Close Punctuation 5
 
0.8%
Open Punctuation 5
 
0.8%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
132
22.4%
84
14.2%
83
14.1%
64
10.8%
64
10.8%
14
 
2.4%
10
 
1.7%
9
 
1.5%
8
 
1.4%
7
 
1.2%
Other values (54) 115
19.5%
Space Separator
ValueCountFrequency (%)
46
100.0%
Close Punctuation
ValueCountFrequency (%)
) 5
100.0%
Open Punctuation
ValueCountFrequency (%)
( 5
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 590
91.3%
Common 56
 
8.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
132
22.4%
84
14.2%
83
14.1%
64
10.8%
64
10.8%
14
 
2.4%
10
 
1.7%
9
 
1.5%
8
 
1.4%
7
 
1.2%
Other values (54) 115
19.5%
Common
ValueCountFrequency (%)
46
82.1%
) 5
 
8.9%
( 5
 
8.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 590
91.3%
ASCII 56
 
8.7%

Most frequent character per block

Hangul
ValueCountFrequency (%)
132
22.4%
84
14.2%
83
14.1%
64
10.8%
64
10.8%
14
 
2.4%
10
 
1.7%
9
 
1.5%
8
 
1.4%
7
 
1.2%
Other values (54) 115
19.5%
ASCII
ValueCountFrequency (%)
46
82.1%
) 5
 
8.9%
( 5
 
8.9%
Distinct32
Distinct (%)60.4%
Missing185
Missing (%)77.7%
Memory size2.0 KiB
2023-12-13T08:46:10.884678image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length12
Mean length12
Min length12

Characters and Unicode

Total characters636
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique21 ?
Unique (%)39.6%

Sample

1st row054-853-1933
2nd row054-841-5587
3rd row054-852-3995
4th row054-852-3995
5th row054-855-3200
ValueCountFrequency (%)
054-854-5703 6
 
11.3%
054-852-3995 4
 
7.5%
054-842-0056 4
 
7.5%
054-841-5587 3
 
5.7%
054-841-8877 3
 
5.7%
054-852-0112 2
 
3.8%
054-852-9016 2
 
3.8%
054-853-5400 2
 
3.8%
054-857-8007 2
 
3.8%
054-856-2627 2
 
3.8%
Other values (22) 23
43.4%
2023-12-13T08:46:11.229415image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
5 111
17.5%
- 106
16.7%
0 99
15.6%
4 81
12.7%
8 69
10.8%
3 36
 
5.7%
2 36
 
5.7%
7 35
 
5.5%
1 27
 
4.2%
6 19
 
3.0%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 530
83.3%
Dash Punctuation 106
 
16.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
5 111
20.9%
0 99
18.7%
4 81
15.3%
8 69
13.0%
3 36
 
6.8%
2 36
 
6.8%
7 35
 
6.6%
1 27
 
5.1%
6 19
 
3.6%
9 17
 
3.2%
Dash Punctuation
ValueCountFrequency (%)
- 106
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 636
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
5 111
17.5%
- 106
16.7%
0 99
15.6%
4 81
12.7%
8 69
10.8%
3 36
 
5.7%
2 36
 
5.7%
7 35
 
5.5%
1 27
 
4.2%
6 19
 
3.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 636
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
5 111
17.5%
- 106
16.7%
0 99
15.6%
4 81
12.7%
8 69
10.8%
3 36
 
5.7%
2 36
 
5.7%
7 35
 
5.5%
1 27
 
4.2%
6 19
 
3.0%

설계사무소명
Text

MISSING 

Distinct64
Distinct (%)30.6%
Missing29
Missing (%)12.2%
Memory size2.0 KiB
2023-12-13T08:46:11.510321image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length17
Median length16
Mean length10.200957
Min length7

Characters and Unicode

Total characters2132
Distinct characters98
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique40 ?
Unique (%)19.1%

Sample

1st row건축사사무소 한림
2nd row하마 건축사사무소
3rd row건축사사무소 우진건축
4th row건축사사무소 우진건축
5th row(주)이지에이건축사사무소
ValueCountFrequency (%)
건축사사무소 137
37.8%
에이원 19
 
5.2%
안동건축 14
 
3.9%
용화 13
 
3.6%
우진건축 13
 
3.6%
동인 12
 
3.3%
가원건축사사무소 10
 
2.8%
경안건축 9
 
2.5%
건원건축 9
 
2.5%
합동엔건축사사무소 9
 
2.5%
Other values (57) 117
32.3%
2023-12-13T08:46:11.861304image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
424
19.9%
293
13.7%
282
13.2%
209
9.8%
209
9.8%
168
 
7.9%
54
 
2.5%
35
 
1.6%
30
 
1.4%
23
 
1.1%
Other values (88) 405
19.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1936
90.8%
Space Separator 168
 
7.9%
Open Punctuation 14
 
0.7%
Close Punctuation 14
 
0.7%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
424
21.9%
293
15.1%
282
14.6%
209
10.8%
209
10.8%
54
 
2.8%
35
 
1.8%
30
 
1.5%
23
 
1.2%
23
 
1.2%
Other values (85) 354
18.3%
Space Separator
ValueCountFrequency (%)
168
100.0%
Open Punctuation
ValueCountFrequency (%)
( 14
100.0%
Close Punctuation
ValueCountFrequency (%)
) 14
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1936
90.8%
Common 196
 
9.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
424
21.9%
293
15.1%
282
14.6%
209
10.8%
209
10.8%
54
 
2.8%
35
 
1.8%
30
 
1.5%
23
 
1.2%
23
 
1.2%
Other values (85) 354
18.3%
Common
ValueCountFrequency (%)
168
85.7%
( 14
 
7.1%
) 14
 
7.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1936
90.8%
ASCII 196
 
9.2%

Most frequent character per block

Hangul
ValueCountFrequency (%)
424
21.9%
293
15.1%
282
14.6%
209
10.8%
209
10.8%
54
 
2.8%
35
 
1.8%
30
 
1.5%
23
 
1.2%
23
 
1.2%
Other values (85) 354
18.3%
ASCII
ValueCountFrequency (%)
168
85.7%
( 14
 
7.1%
) 14
 
7.1%
Distinct54
Distinct (%)29.3%
Missing54
Missing (%)22.7%
Memory size2.0 KiB
2023-12-13T08:46:12.085344image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length12.005435
Min length12

Characters and Unicode

Total characters2209
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique31 ?
Unique (%)16.8%

Sample

1st row054-853-1933
2nd row054-901-7262
3rd row054-852-3995
4th row054-852-3995
5th row02-3446-8846
ValueCountFrequency (%)
054-842-0056 19
 
10.3%
054-852-3995 13
 
7.1%
054-841-8877 13
 
7.1%
054-854-5703 12
 
6.5%
054-854-4444 9
 
4.9%
054-841-4533 9
 
4.9%
054-857-8826 9
 
4.9%
054-843-0320 7
 
3.8%
054-856-2627 7
 
3.8%
054-901-7262 6
 
3.3%
Other values (44) 80
43.5%
2023-12-13T08:46:12.433348image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
5 369
16.7%
- 368
16.7%
4 333
15.1%
0 319
14.4%
8 236
10.7%
2 137
 
6.2%
3 116
 
5.3%
7 103
 
4.7%
6 91
 
4.1%
1 80
 
3.6%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 1841
83.3%
Dash Punctuation 368
 
16.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
5 369
20.0%
4 333
18.1%
0 319
17.3%
8 236
12.8%
2 137
 
7.4%
3 116
 
6.3%
7 103
 
5.6%
6 91
 
4.9%
1 80
 
4.3%
9 57
 
3.1%
Dash Punctuation
ValueCountFrequency (%)
- 368
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 2209
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
5 369
16.7%
- 368
16.7%
4 333
15.1%
0 319
14.4%
8 236
10.7%
2 137
 
6.2%
3 116
 
5.3%
7 103
 
4.7%
6 91
 
4.1%
1 80
 
3.6%

Most occurring blocks

ValueCountFrequency (%)
ASCII 2209
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
5 369
16.7%
- 368
16.7%
4 333
15.1%
0 319
14.4%
8 236
10.7%
2 137
 
6.2%
3 116
 
5.3%
7 103
 
4.7%
6 91
 
4.1%
1 80
 
3.6%

데이터기준일자
Categorical

CONSTANT 

Distinct1
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size2.0 KiB
2023-08-17
238 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2023-08-17
2nd row2023-08-17
3rd row2023-08-17
4th row2023-08-17
5th row2023-08-17

Common Values

ValueCountFrequency (%)
2023-08-17 238
100.0%

Length

2023-12-13T08:46:12.561846image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T08:46:12.644822image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2023-08-17 238
100.0%

Interactions

2023-12-13T08:46:06.611710image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T08:46:12.703932image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번구분주용도시공업체명시공업체 대표번호감리사무소명감리사무소 대표번호설계사무소명설계사무소 대표번호
연번1.0000.9950.2900.4230.6520.8060.9120.3730.435
구분0.9951.0000.3620.9540.9590.6251.0000.4090.399
주용도0.2900.3621.0000.9980.9880.0000.6170.9160.895
시공업체명0.4230.9540.9981.0001.0001.0001.0000.9971.000
시공업체 대표번호0.6520.9590.9881.0001.0000.9840.9790.9980.999
감리사무소명0.8060.6250.0001.0000.9841.0001.0000.9860.986
감리사무소 대표번호0.9121.0000.6171.0000.9791.0001.0000.9820.985
설계사무소명0.3730.4090.9160.9970.9980.9860.9821.0001.000
설계사무소 대표번호0.4350.3990.8951.0000.9990.9860.9851.0001.000
2023-12-13T08:46:12.814889image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
구분주용도
구분1.0000.276
주용도0.2761.000
2023-12-13T08:46:12.902608image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번구분주용도
연번1.0000.9230.119
구분0.9231.0000.276
주용도0.1190.2761.000

Missing values

2023-12-13T08:46:06.706194image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T08:46:06.887412image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-13T08:46:06.997774image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

연번구분시공 대지위치주용도착공예정일착공처리일준공예정일시공업체명시공업체 대표번호감리사무소명감리사무소 대표번호설계사무소명설계사무소 대표번호데이터기준일자
02허가경상북도 안동시 북후면 도촌리 622-1 외1필지단독주택2023-07-172023-07-172023-08-10<NA><NA>건축사사무소 한림054-853-1933건축사사무소 한림054-853-19332023-08-17
13허가경상북도 안동시 풍산읍 노리 212-4 외1필지제1종근린생활시설2023-08-112023-08-162023-10-31시우종합건설(주)054-901-4436길 건축사사무소054-841-5587하마 건축사사무소054-901-72622023-08-17
24허가경상북도 안동시 임동면 고천리 40-6창고시설2023-07-312023-07-202023-09-30<NA><NA>건축사사무소 우진건축054-852-3995건축사사무소 우진건축054-852-39952023-08-17
35허가경상북도 안동시 임동면 고천리 40-6창고시설2023-07-312023-07-202023-09-30<NA><NA>건축사사무소 우진건축054-852-3995건축사사무소 우진건축054-852-39952023-08-17
46허가경상북도 안동시 풍산읍 매곡리 1123 외2필지공장2023-08-022023-08-082024-01-29합자회사 세원이엔씨054-440-0352<NA><NA>(주)이지에이건축사사무소02-3446-88462023-08-17
57허가경상북도 안동시 옥야동 359-10제2종근린생활시설2023-06-302023-07-312023-10-31주식회사태수건설054-553-7625건축사사무소 이상건축054-855-3200준 건축사사무소054-859-77752023-08-17
68허가경상북도 안동시 수상동 820-124제2종근린생활시설2023-06-222023-06-222023-09-30주식회사 지에스건설053-257-9078지호건축사사무소<NA>지호건축사사무소053-282-90782023-08-17
79허가경상북도 안동시 정하동 308-6단독주택2023-06-192023-06-132023-09-20<NA><NA>건축사사무소 광명건축054-856-2627도원건축사사무소054-843-03202023-08-17
810허가경상북도 안동시 길안면 천지리 455 외4필지공장2023-07-262023-07-252023-11-30지오종합건설(주)054-741-7720건축사사무소반석<NA>(주)휘츠엔지니어링건축사사무소031-421-46672023-08-17
911허가경상북도 안동시 일직면 국곡리 산 156-22동물및식물관련시설2023-06-072023-06-092023-12-31<NA><NA>주식회사 종합건축사사무소 원건축054-853-5400주식회사 종합건축사사무소 원건축054-853-54002023-08-17
연번구분시공 대지위치주용도착공예정일착공처리일준공예정일시공업체명시공업체 대표번호감리사무소명감리사무소 대표번호설계사무소명설계사무소 대표번호데이터기준일자
228230신고경상북도 안동시 풍산읍 매곡리 720-4창고시설2022-08-102022-08-112022-12-20<NA><NA><NA><NA><NA><NA>2023-08-17
229231신고경상북도 안동시 와룡면 중가구리 산 129제1종근린생활시설2022-08-012022-08-012023-12-31<NA><NA><NA><NA>가원건축사사무소<NA>2023-08-17
230232신고경상북도 안동시 일직면 조탑리 645동물및식물관련시설2023-07-032023-06-292024-07-02<NA><NA><NA><NA><NA>054-633-06032023-08-17
231233신고경상북도 안동시 북후면 옹천리 산 128 외2필지제2종근린생활시설2022-08-162022-08-162022-12-30<NA><NA><NA><NA>세다건축사사무소054-852-90162023-08-17
232234신고경상북도 안동시 풍천면 도양리 1116-1 외1필지단독주택2022-10-012022-09-262023-06-01<NA><NA><NA><NA>예도건축사사무소<NA>2023-08-17
233235신고경상북도 안동시 와룡면 태리 121-9단독주택2022-08-012022-07-262025-06-30<NA><NA><NA><NA>건축사사무소 웅부054-841-36962023-08-17
234236신고경상북도 안동시 임하면 추목리 125-3제2종근린생활시설2023-03-312023-03-272023-06-30<NA><NA><NA><NA>하마 건축사사무소054-901-72622023-08-17
235237신고경상북도 안동시 녹전면 원천리 227-1동물및식물관련시설2022-08-012022-07-262022-12-30<NA>054-854-8826<NA><NA><NA>054-854-88262023-08-17
236238신고경상북도 안동시 임동면 고천리 산 21-5단독주택2022-07-172022-07-152022-12-31<NA><NA><NA><NA>용화 건축사사무소054-841-88772023-08-17
237239신고경상북도 안동시 서후면 대두서리 442-1제1종근린생활시설2022-07-252022-07-292023-12-31바론산업개발주식회사054-982-9901건축사사무소이도054-857-3118건축사사무소현대건축054-331-04262023-08-17