Overview

Dataset statistics

Number of variables15
Number of observations601
Missing cells2010
Missing cells (%)22.3%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory71.1 KiB
Average record size in memory121.2 B

Variable types

Numeric1
Categorical3
Text7
DateTime4

Dataset

Description경상북도 안동시의 건설현장시공정보 중 사용승인 시공 건에 대한 데이터입니다. (연번, 구분, 시공대지위치, 주용도, 착공예정일, 실제착공일, 사용승인일, 시공업체명, 시공업체대표번호, 감리사무소명, 감리사무소 대표번호, 설계사무소명, 설계사무소 대표번호, 데이터기준일자)
URLhttps://www.data.go.kr/data/15121236/fileData.do

Alerts

데이터기준일자 has constant value ""Constant
연번 is highly overall correlated with 구분High correlation
구분 is highly overall correlated with 연번High correlation
실제착공일 has 70 (11.6%) missing valuesMissing
시공업체명 has 501 (83.4%) missing valuesMissing
시공업체 대표번호 has 430 (71.5%) missing valuesMissing
감리사무소명 has 419 (69.7%) missing valuesMissing
감리사무소 대표번호 has 435 (72.4%) missing valuesMissing
설계사무소명 has 35 (5.8%) missing valuesMissing
설계사무소 대표번호 has 118 (19.6%) missing valuesMissing
연번 has unique valuesUnique

Reproduction

Analysis started2023-12-12 04:07:45.261227
Analysis finished2023-12-12 04:07:47.075372
Duration1.81 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct601
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean301
Minimum1
Maximum601
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size5.4 KiB
2023-12-12T13:07:47.163846image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile31
Q1151
median301
Q3451
95-th percentile571
Maximum601
Range600
Interquartile range (IQR)300

Descriptive statistics

Standard deviation173.63803
Coefficient of variation (CV)0.57687054
Kurtosis-1.2
Mean301
Median Absolute Deviation (MAD)150
Skewness0
Sum180901
Variance30150.167
MonotonicityStrictly increasing
2023-12-12T13:07:47.343493image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.2%
396 1
 
0.2%
398 1
 
0.2%
399 1
 
0.2%
400 1
 
0.2%
401 1
 
0.2%
402 1
 
0.2%
403 1
 
0.2%
404 1
 
0.2%
405 1
 
0.2%
Other values (591) 591
98.3%
ValueCountFrequency (%)
1 1
0.2%
2 1
0.2%
3 1
0.2%
4 1
0.2%
5 1
0.2%
6 1
0.2%
7 1
0.2%
8 1
0.2%
9 1
0.2%
10 1
0.2%
ValueCountFrequency (%)
601 1
0.2%
600 1
0.2%
599 1
0.2%
598 1
0.2%
597 1
0.2%
596 1
0.2%
595 1
0.2%
594 1
0.2%
593 1
0.2%
592 1
0.2%

구분
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size4.8 KiB
신고
414 
허가
187 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row신고
2nd row신고
3rd row신고
4th row신고
5th row신고

Common Values

ValueCountFrequency (%)
신고 414
68.9%
허가 187
31.1%

Length

2023-12-12T13:07:47.565336image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T13:07:47.709659image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
신고 414
68.9%
허가 187
31.1%
Distinct582
Distinct (%)96.8%
Missing0
Missing (%)0.0%
Memory size4.8 KiB
2023-12-12T13:07:48.178826image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length33
Median length30
Mean length21.68386
Min length15

Characters and Unicode

Total characters13032
Distinct characters148
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique565 ?
Unique (%)94.0%

Sample

1st row경상북도 안동시 화성동 14-32
2nd row경상북도 안동시 풍천면 가곡리 262-4
3rd row경상북도 안동시 풍산읍 괴정리 626 외1필지
4th row경상북도 안동시 풍천면 구담리 287
5th row경상북도 안동시 풍산읍 매곡리 1138
ValueCountFrequency (%)
경상북도 601
19.8%
안동시 601
19.8%
외1필지 88
 
2.9%
풍산읍 69
 
2.3%
와룡면 41
 
1.3%
서후면 41
 
1.3%
풍천면 39
 
1.3%
남후면 36
 
1.2%
외2필지 33
 
1.1%
북후면 32
 
1.1%
Other values (738) 1458
48.0%
2023-12-12T13:07:48.697373image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2440
18.7%
792
 
6.1%
659
 
5.1%
657
 
5.0%
636
 
4.9%
626
 
4.8%
602
 
4.6%
601
 
4.6%
1 547
 
4.2%
444
 
3.4%
Other values (138) 5028
38.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 7834
60.1%
Space Separator 2440
 
18.7%
Decimal Number 2399
 
18.4%
Dash Punctuation 353
 
2.7%
Uppercase Letter 4
 
< 0.1%
Open Punctuation 1
 
< 0.1%
Close Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
792
 
10.1%
659
 
8.4%
657
 
8.4%
636
 
8.1%
626
 
8.0%
602
 
7.7%
601
 
7.7%
444
 
5.7%
365
 
4.7%
184
 
2.3%
Other values (120) 2268
29.0%
Decimal Number
ValueCountFrequency (%)
1 547
22.8%
2 323
13.5%
4 236
9.8%
5 236
9.8%
3 228
9.5%
8 178
 
7.4%
6 173
 
7.2%
7 164
 
6.8%
9 160
 
6.7%
0 154
 
6.4%
Uppercase Letter
ValueCountFrequency (%)
A 1
25.0%
B 1
25.0%
J 1
25.0%
K 1
25.0%
Space Separator
ValueCountFrequency (%)
2440
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 353
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 7834
60.1%
Common 5194
39.9%
Latin 4
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
792
 
10.1%
659
 
8.4%
657
 
8.4%
636
 
8.1%
626
 
8.0%
602
 
7.7%
601
 
7.7%
444
 
5.7%
365
 
4.7%
184
 
2.3%
Other values (120) 2268
29.0%
Common
ValueCountFrequency (%)
2440
47.0%
1 547
 
10.5%
- 353
 
6.8%
2 323
 
6.2%
4 236
 
4.5%
5 236
 
4.5%
3 228
 
4.4%
8 178
 
3.4%
6 173
 
3.3%
7 164
 
3.2%
Other values (4) 316
 
6.1%
Latin
ValueCountFrequency (%)
A 1
25.0%
B 1
25.0%
J 1
25.0%
K 1
25.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 7834
60.1%
ASCII 5198
39.9%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2440
46.9%
1 547
 
10.5%
- 353
 
6.8%
2 323
 
6.2%
4 236
 
4.5%
5 236
 
4.5%
3 228
 
4.4%
8 178
 
3.4%
6 173
 
3.3%
7 164
 
3.2%
Other values (8) 320
 
6.2%
Hangul
ValueCountFrequency (%)
792
 
10.1%
659
 
8.4%
657
 
8.4%
636
 
8.1%
626
 
8.0%
602
 
7.7%
601
 
7.7%
444
 
5.7%
365
 
4.7%
184
 
2.3%
Other values (120) 2268
29.0%

주용도
Categorical

Distinct20
Distinct (%)3.3%
Missing0
Missing (%)0.0%
Memory size4.8 KiB
단독주택
216 
동물및식물관련시설
94 
제2종근린생활시설
81 
창고시설
79 
제1종근린생활시설
73 
Other values (15)
58 

Length

Max length10
Median length4
Mean length6.1181364
Min length2

Unique

Unique3 ?
Unique (%)0.5%

Sample

1st row제2종근린생활시설
2nd row단독주택
3rd row제2종근린생활시설
4th row동물및식물관련시설
5th row교육연구시설

Common Values

ValueCountFrequency (%)
단독주택 216
35.9%
동물및식물관련시설 94
15.6%
제2종근린생활시설 81
 
13.5%
창고시설 79
 
13.1%
제1종근린생활시설 73
 
12.1%
공장 18
 
3.0%
교육연구시설 6
 
1.0%
노유자시설 4
 
0.7%
위험물저장및처리시설 4
 
0.7%
숙박시설 4
 
0.7%
Other values (10) 22
 
3.7%

Length

2023-12-12T13:07:48.880645image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
단독주택 216
35.9%
동물및식물관련시설 94
15.6%
제2종근린생활시설 81
 
13.5%
창고시설 79
 
13.1%
제1종근린생활시설 73
 
12.1%
공장 18
 
3.0%
교육연구시설 6
 
1.0%
숙박시설 4
 
0.7%
의료시설 4
 
0.7%
위험물저장및처리시설 4
 
0.7%
Other values (10) 22
 
3.7%
Distinct348
Distinct (%)58.0%
Missing1
Missing (%)0.2%
Memory size4.8 KiB
Minimum2011-04-29 00:00:00
Maximum2023-07-26 00:00:00
2023-12-12T13:07:49.033513image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T13:07:49.547280image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
Distinct332
Distinct (%)55.3%
Missing1
Missing (%)0.2%
Memory size4.8 KiB
Minimum2011-04-27 00:00:00
Maximum2023-07-26 00:00:00
2023-12-12T13:07:49.712506image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T13:07:49.910425image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

실제착공일
Date

MISSING 

Distinct326
Distinct (%)61.4%
Missing70
Missing (%)11.6%
Memory size4.8 KiB
Minimum2013-05-30 00:00:00
Maximum2023-09-05 00:00:00
2023-12-12T13:07:50.133534image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T13:07:50.375940image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
Distinct243
Distinct (%)40.4%
Missing0
Missing (%)0.0%
Memory size4.8 KiB
Minimum2022-07-01 00:00:00
Maximum2023-08-16 00:00:00
2023-12-12T13:07:50.567546image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T13:07:50.752661image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

시공업체명
Text

MISSING 

Distinct77
Distinct (%)77.0%
Missing501
Missing (%)83.4%
Memory size4.8 KiB
2023-12-12T13:07:51.087134image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length11
Median length10
Mean length8.1
Min length4

Characters and Unicode

Total characters810
Distinct characters102
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique58 ?
Unique (%)58.0%

Sample

1st row(주)가온건설
2nd row(주)한울건업
3rd row(주)에이치엘종합건설
4th row(주)호제
5th row대림종합개발(주)
ValueCountFrequency (%)
주식회사 7
 
6.5%
주)도현개발 4
 
3.7%
태형종합건설(주 3
 
2.8%
주식회사이건 3
 
2.8%
우대건설(주 2
 
1.9%
주식회사정원종합건설 2
 
1.9%
주)흥원종합건설 2
 
1.9%
주)세림종합건설 2
 
1.9%
주)지평종합건설 2
 
1.9%
주)한길건설 2
 
1.9%
Other values (68) 78
72.9%
2023-12-12T13:07:51.600712image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
100
 
12.3%
78
 
9.6%
72
 
8.9%
( 69
 
8.5%
) 69
 
8.5%
40
 
4.9%
40
 
4.9%
29
 
3.6%
29
 
3.6%
29
 
3.6%
Other values (92) 255
31.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 664
82.0%
Open Punctuation 69
 
8.5%
Close Punctuation 69
 
8.5%
Space Separator 7
 
0.9%
Other Symbol 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
100
15.1%
78
 
11.7%
72
 
10.8%
40
 
6.0%
40
 
6.0%
29
 
4.4%
29
 
4.4%
29
 
4.4%
10
 
1.5%
10
 
1.5%
Other values (88) 227
34.2%
Open Punctuation
ValueCountFrequency (%)
( 69
100.0%
Close Punctuation
ValueCountFrequency (%)
) 69
100.0%
Space Separator
ValueCountFrequency (%)
7
100.0%
Other Symbol
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 665
82.1%
Common 145
 
17.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
100
15.0%
78
 
11.7%
72
 
10.8%
40
 
6.0%
40
 
6.0%
29
 
4.4%
29
 
4.4%
29
 
4.4%
10
 
1.5%
10
 
1.5%
Other values (89) 228
34.3%
Common
ValueCountFrequency (%)
( 69
47.6%
) 69
47.6%
7
 
4.8%

Most occurring blocks

ValueCountFrequency (%)
Hangul 664
82.0%
ASCII 145
 
17.9%
None 1
 
0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
100
15.1%
78
 
11.7%
72
 
10.8%
40
 
6.0%
40
 
6.0%
29
 
4.4%
29
 
4.4%
29
 
4.4%
10
 
1.5%
10
 
1.5%
Other values (88) 227
34.2%
ASCII
ValueCountFrequency (%)
( 69
47.6%
) 69
47.6%
7
 
4.8%
None
ValueCountFrequency (%)
1
100.0%
Distinct98
Distinct (%)57.3%
Missing430
Missing (%)71.5%
Memory size4.8 KiB
2023-12-12T13:07:51.928350image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length12
Min length11

Characters and Unicode

Total characters2052
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique72 ?
Unique (%)42.1%

Sample

1st row054-850-6905
2nd row054-854-5703
3rd row054-856-2627
4th row054-854-9201
5th row054-857-3118
ValueCountFrequency (%)
054-854-5703 15
 
8.8%
054-857-3118 15
 
8.8%
054-855-3200 11
 
6.4%
054-856-2627 6
 
3.5%
054-854-8826 6
 
3.5%
054-854-9201 4
 
2.3%
054-822-0070 3
 
1.8%
054-843-0661 3
 
1.8%
054-464-0043 2
 
1.2%
054-456-0456 2
 
1.2%
Other values (88) 104
60.8%
2023-12-12T13:07:52.516361image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 342
16.7%
5 325
15.8%
0 300
14.6%
4 249
12.1%
8 203
9.9%
3 150
7.3%
7 118
 
5.8%
2 111
 
5.4%
1 99
 
4.8%
6 83
 
4.0%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 1710
83.3%
Dash Punctuation 342
 
16.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
5 325
19.0%
0 300
17.5%
4 249
14.6%
8 203
11.9%
3 150
8.8%
7 118
 
6.9%
2 111
 
6.5%
1 99
 
5.8%
6 83
 
4.9%
9 72
 
4.2%
Dash Punctuation
ValueCountFrequency (%)
- 342
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 2052
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 342
16.7%
5 325
15.8%
0 300
14.6%
4 249
12.1%
8 203
9.9%
3 150
7.3%
7 118
 
5.8%
2 111
 
5.4%
1 99
 
4.8%
6 83
 
4.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 2052
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 342
16.7%
5 325
15.8%
0 300
14.6%
4 249
12.1%
8 203
9.9%
3 150
7.3%
7 118
 
5.8%
2 111
 
5.4%
1 99
 
4.8%
6 83
 
4.0%

감리사무소명
Text

MISSING 

Distinct75
Distinct (%)41.2%
Missing419
Missing (%)69.7%
Memory size4.8 KiB
2023-12-12T13:07:52.773381image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length18
Median length16.5
Mean length10.362637
Min length6

Characters and Unicode

Total characters1886
Distinct characters97
Distinct categories5 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique38 ?
Unique (%)20.9%

Sample

1st row경덕건축사사무소
2nd row(주)선진엔지니어링종합건축사사무소
3rd row건축사사무소 우신건축
4th row올바른건축사사무소
5th row진영기술 건축사사무소
ValueCountFrequency (%)
건축사사무소 99
31.9%
우진건축 11
 
3.5%
종합건축사사무소 11
 
3.5%
주식회사 10
 
3.2%
원건축 10
 
3.2%
용화 9
 
2.9%
동인 8
 
2.6%
건원건축 8
 
2.6%
건축사사무소반석 7
 
2.3%
가원건축사사무소 6
 
1.9%
Other values (67) 131
42.3%
2023-12-12T13:07:53.276632image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
369
19.6%
254
13.5%
245
13.0%
177
9.4%
177
9.4%
128
 
6.8%
38
 
2.0%
27
 
1.4%
25
 
1.3%
24
 
1.3%
Other values (87) 422
22.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1724
91.4%
Space Separator 128
 
6.8%
Close Punctuation 15
 
0.8%
Open Punctuation 15
 
0.8%
Uppercase Letter 4
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
369
21.4%
254
14.7%
245
14.2%
177
10.3%
177
10.3%
38
 
2.2%
27
 
1.6%
25
 
1.5%
24
 
1.4%
22
 
1.3%
Other values (81) 366
21.2%
Uppercase Letter
ValueCountFrequency (%)
M 2
50.0%
C 1
25.0%
S 1
25.0%
Space Separator
ValueCountFrequency (%)
128
100.0%
Close Punctuation
ValueCountFrequency (%)
) 15
100.0%
Open Punctuation
ValueCountFrequency (%)
( 15
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1724
91.4%
Common 158
 
8.4%
Latin 4
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
369
21.4%
254
14.7%
245
14.2%
177
10.3%
177
10.3%
38
 
2.2%
27
 
1.6%
25
 
1.5%
24
 
1.4%
22
 
1.3%
Other values (81) 366
21.2%
Common
ValueCountFrequency (%)
128
81.0%
) 15
 
9.5%
( 15
 
9.5%
Latin
ValueCountFrequency (%)
M 2
50.0%
C 1
25.0%
S 1
25.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1724
91.4%
ASCII 162
 
8.6%

Most frequent character per block

Hangul
ValueCountFrequency (%)
369
21.4%
254
14.7%
245
14.2%
177
10.3%
177
10.3%
38
 
2.2%
27
 
1.6%
25
 
1.5%
24
 
1.4%
22
 
1.3%
Other values (81) 366
21.2%
ASCII
ValueCountFrequency (%)
128
79.0%
) 15
 
9.3%
( 15
 
9.3%
M 2
 
1.2%
C 1
 
0.6%
S 1
 
0.6%
Distinct62
Distinct (%)37.3%
Missing435
Missing (%)72.4%
Memory size4.8 KiB
2023-12-12T13:07:53.583552image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length12
Min length11

Characters and Unicode

Total characters1992
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique32 ?
Unique (%)19.3%

Sample

1st row054-842-0508
2nd row02-6333-3000
3rd row054-841-6200
4th row054-853-1933
5th row054-857-3118
ValueCountFrequency (%)
054-852-3995 11
 
6.6%
054-841-8877 11
 
6.6%
054-853-5400 10
 
6.0%
054-841-4533 9
 
5.4%
054-854-5703 8
 
4.8%
054-857-8007 6
 
3.6%
054-842-0508 6
 
3.6%
054-842-0056 5
 
3.0%
054-857-3118 5
 
3.0%
054-859-7775 5
 
3.0%
Other values (52) 90
54.2%
2023-12-12T13:07:54.151260image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
5 363
18.2%
- 332
16.7%
0 307
15.4%
4 270
13.6%
8 217
10.9%
3 109
 
5.5%
7 106
 
5.3%
2 90
 
4.5%
1 87
 
4.4%
9 60
 
3.0%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 1660
83.3%
Dash Punctuation 332
 
16.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
5 363
21.9%
0 307
18.5%
4 270
16.3%
8 217
13.1%
3 109
 
6.6%
7 106
 
6.4%
2 90
 
5.4%
1 87
 
5.2%
9 60
 
3.6%
6 51
 
3.1%
Dash Punctuation
ValueCountFrequency (%)
- 332
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 1992
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
5 363
18.2%
- 332
16.7%
0 307
15.4%
4 270
13.6%
8 217
10.9%
3 109
 
5.5%
7 106
 
5.3%
2 90
 
4.5%
1 87
 
4.4%
9 60
 
3.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1992
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
5 363
18.2%
- 332
16.7%
0 307
15.4%
4 270
13.6%
8 217
10.9%
3 109
 
5.5%
7 106
 
5.3%
2 90
 
4.5%
1 87
 
4.4%
9 60
 
3.0%

설계사무소명
Text

MISSING 

Distinct122
Distinct (%)21.6%
Missing35
Missing (%)5.8%
Memory size4.8 KiB
2023-12-12T13:07:54.420774image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length18
Median length17
Mean length10.166078
Min length5

Characters and Unicode

Total characters5754
Distinct characters131
Distinct categories5 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique66 ?
Unique (%)11.7%

Sample

1st row용화 건축사사무소
2nd row무이건축사사무소
3rd row용화 건축사사무소
4th row종합건축사사무소 건원건축
5th row건축사사무소 동인
ValueCountFrequency (%)
건축사사무소 330
33.6%
주식회사 29
 
3.0%
종합건축사사무소 28
 
2.9%
우진건축 27
 
2.8%
건축사사무소반석 25
 
2.5%
성원 24
 
2.4%
원건축 24
 
2.4%
동인 24
 
2.4%
건원건축 23
 
2.3%
합동엔건축사사무소 22
 
2.2%
Other values (118) 425
43.3%
2023-12-12T13:07:54.873765image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1149
20.0%
790
13.7%
764
13.3%
556
9.7%
555
9.6%
429
 
7.5%
125
 
2.2%
78
 
1.4%
63
 
1.1%
61
 
1.1%
Other values (121) 1184
20.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 5255
91.3%
Space Separator 429
 
7.5%
Close Punctuation 32
 
0.6%
Open Punctuation 32
 
0.6%
Uppercase Letter 6
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1149
21.9%
790
15.0%
764
14.5%
556
10.6%
555
10.6%
125
 
2.4%
78
 
1.5%
63
 
1.2%
61
 
1.2%
61
 
1.2%
Other values (114) 1053
20.0%
Uppercase Letter
ValueCountFrequency (%)
M 2
33.3%
S 2
33.3%
C 1
16.7%
J 1
16.7%
Space Separator
ValueCountFrequency (%)
429
100.0%
Close Punctuation
ValueCountFrequency (%)
) 32
100.0%
Open Punctuation
ValueCountFrequency (%)
( 32
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 5255
91.3%
Common 493
 
8.6%
Latin 6
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1149
21.9%
790
15.0%
764
14.5%
556
10.6%
555
10.6%
125
 
2.4%
78
 
1.5%
63
 
1.2%
61
 
1.2%
61
 
1.2%
Other values (114) 1053
20.0%
Latin
ValueCountFrequency (%)
M 2
33.3%
S 2
33.3%
C 1
16.7%
J 1
16.7%
Common
ValueCountFrequency (%)
429
87.0%
) 32
 
6.5%
( 32
 
6.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 5255
91.3%
ASCII 499
 
8.7%

Most frequent character per block

Hangul
ValueCountFrequency (%)
1149
21.9%
790
15.0%
764
14.5%
556
10.6%
555
10.6%
125
 
2.4%
78
 
1.5%
63
 
1.2%
61
 
1.2%
61
 
1.2%
Other values (114) 1053
20.0%
ASCII
ValueCountFrequency (%)
429
86.0%
) 32
 
6.4%
( 32
 
6.4%
M 2
 
0.4%
S 2
 
0.4%
C 1
 
0.2%
J 1
 
0.2%
Distinct86
Distinct (%)17.8%
Missing118
Missing (%)19.6%
Memory size4.8 KiB
2023-12-12T13:07:55.147762image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length11.993789
Min length11

Characters and Unicode

Total characters5793
Distinct characters12
Distinct categories3 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique46 ?
Unique (%)9.5%

Sample

1st row054-841-8877
2nd row054-841-8877
3rd row054-841-4533
4th row054-854-5703
5th row054-853-1933
ValueCountFrequency (%)
054-853-5400 28
 
5.8%
054-852-3995 27
 
5.6%
054-856-3297 26
 
5.4%
054-841-8877 25
 
5.2%
054-854-5703 24
 
5.0%
054-841-4533 23
 
4.8%
054-854-4444 22
 
4.6%
054-856-2627 20
 
4.1%
054-855-4521 19
 
3.9%
054-842-0056 18
 
3.7%
Other values (76) 251
52.0%
2023-12-12T13:07:55.570565image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
5 1050
18.1%
- 965
16.7%
4 887
15.3%
0 800
13.8%
8 623
10.8%
3 319
 
5.5%
2 318
 
5.5%
7 279
 
4.8%
1 213
 
3.7%
6 181
 
3.1%
Other values (2) 158
 
2.7%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 4827
83.3%
Dash Punctuation 965
 
16.7%
Close Punctuation 1
 
< 0.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
5 1050
21.8%
4 887
18.4%
0 800
16.6%
8 623
12.9%
3 319
 
6.6%
2 318
 
6.6%
7 279
 
5.8%
1 213
 
4.4%
6 181
 
3.7%
9 157
 
3.3%
Dash Punctuation
ValueCountFrequency (%)
- 965
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 5793
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
5 1050
18.1%
- 965
16.7%
4 887
15.3%
0 800
13.8%
8 623
10.8%
3 319
 
5.5%
2 318
 
5.5%
7 279
 
4.8%
1 213
 
3.7%
6 181
 
3.1%
Other values (2) 158
 
2.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII 5793
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
5 1050
18.1%
- 965
16.7%
4 887
15.3%
0 800
13.8%
8 623
10.8%
3 319
 
5.5%
2 318
 
5.5%
7 279
 
4.8%
1 213
 
3.7%
6 181
 
3.1%
Other values (2) 158
 
2.7%

데이터기준일자
Categorical

CONSTANT 

Distinct1
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size4.8 KiB
2023-08-17
601 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2023-08-17
2nd row2023-08-17
3rd row2023-08-17
4th row2023-08-17
5th row2023-08-17

Common Values

ValueCountFrequency (%)
2023-08-17 601
100.0%

Length

2023-12-12T13:07:55.714333image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T13:07:55.812962image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2023-08-17 601
100.0%

Interactions

2023-12-12T13:07:46.283107image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T13:07:55.889156image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번구분주용도시공업체명시공업체 대표번호감리사무소명감리사무소 대표번호설계사무소 대표번호
연번1.0000.9990.3830.9430.6160.0000.0000.492
구분0.9991.0000.4720.8370.7740.7160.7080.435
주용도0.3830.4721.0000.9030.9750.9130.9520.914
시공업체명0.9430.8370.9031.0001.0000.9830.9840.991
시공업체 대표번호0.6160.7740.9751.0001.0000.9830.9790.999
감리사무소명0.0000.7160.9130.9830.9831.0001.0000.995
감리사무소 대표번호0.0000.7080.9520.9840.9791.0001.0000.994
설계사무소 대표번호0.4920.4350.9140.9910.9990.9950.9941.000
2023-12-12T13:07:56.009079image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
구분주용도
구분1.0000.369
주용도0.3691.000
2023-12-12T13:07:56.142980image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번구분주용도
연번1.0000.9690.128
구분0.9691.0000.369
주용도0.1280.3691.000

Missing values

2023-12-12T13:07:46.451225image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T13:07:46.716618image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-12T13:07:46.923977image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

연번구분시공 대지위치주용도착공예정일착공처리일실제착공일사용승인일시공업체명시공업체 대표번호감리사무소명감리사무소 대표번호설계사무소명설계사무소 대표번호데이터기준일자
01신고경상북도 안동시 화성동 14-32제2종근린생활시설2023-07-262023-07-262023-07-262023-08-01<NA><NA><NA><NA>용화 건축사사무소054-841-88772023-08-17
12신고경상북도 안동시 풍천면 가곡리 262-4단독주택2023-07-242023-07-19<NA>2023-08-02<NA><NA><NA><NA>무이건축사사무소<NA>2023-08-17
23신고경상북도 안동시 풍산읍 괴정리 626 외1필지제2종근린생활시설2023-06-232023-06-232023-06-232023-07-10<NA><NA><NA><NA>용화 건축사사무소054-841-88772023-08-17
34신고경상북도 안동시 풍천면 구담리 287동물및식물관련시설2023-06-152023-06-162023-06-162023-07-26<NA><NA><NA><NA><NA><NA>2023-08-17
45신고경상북도 안동시 풍산읍 매곡리 1138교육연구시설2023-06-102023-06-12<NA>2023-06-14<NA>054-850-6905<NA><NA>종합건축사사무소 건원건축054-841-45332023-08-17
56신고경상북도 안동시 임하면 추목리 109단독주택2023-06-142023-06-142023-06-142023-08-16<NA>054-854-5703<NA><NA>건축사사무소 동인054-854-57032023-08-17
67신고경상북도 안동시 남선면 원림리 816-1 외2필지동물및식물관련시설2023-06-202023-06-212023-06-212023-08-11<NA><NA><NA><NA>건축사사무소 안동건축<NA>2023-08-17
78신고경상북도 안동시 태화동 223-45단독주택2023-06-082023-06-082023-06-082023-07-27<NA><NA><NA><NA>건축사사무소 한림054-853-19332023-08-17
89신고경상북도 안동시 풍산읍 괴정리 601동물및식물관련시설2023-06-092023-06-022023-06-022023-06-14<NA><NA><NA><NA>건축사사무소 둥지054-655-88062023-08-17
910신고경상북도 안동시 풍산읍 괴정리 738창고시설2023-05-152023-05-122023-05-152023-06-07<NA><NA><NA><NA>(주)종합건축사사무소 건원건축054-841-45332023-08-17
연번구분시공 대지위치주용도착공예정일착공처리일실제착공일사용승인일시공업체명시공업체 대표번호감리사무소명감리사무소 대표번호설계사무소명설계사무소 대표번호데이터기준일자
591592허가경상북도 안동시 일직면 망호리 140 외1필지동물및식물관련시설2019-06-302019-06-202019-06-302023-05-04<NA><NA>주식회사 종합건축사사무소 원건축054-853-5400주식회사 종합건축사사무소 원건축054-853-54002023-08-17
592593허가경상북도 안동시 풍천면 갈전리 1226단독주택2018-11-152018-11-122018-11-152022-09-07<NA><NA>세다건축사사무소054-852-9016하마 건축사사무소054-901-72622023-08-17
593594허가경상북도 안동시 도산면 동부리 산 124-1 외140필지문화및집회시설2017-12-282018-05-012017-12-282022-08-19계룡건설산업(주)042-480-7114<NA>054-650-3000(주)삼우종합건축사사무소02-2184-52422023-08-17
594595허가경상북도 안동시 송현동 344-8단독주택2017-10-162017-10-172017-10-162022-12-22<NA><NA><NA>054-854-9201건축사사무소 장한건축054-854-92012023-08-17
595596허가경상북도 안동시 서후면 이송천리 138 외1필지단독주택2017-07-212017-07-212017-07-212023-06-08㈜대신종합건설054-853-7465건축사사무소 우진건축054-852-3995건축사사무소 우진건축054-852-39952023-08-17
596597허가경상북도 안동시 안막동 산 91 외1필지운동시설2017-06-302017-06-302017-06-302023-07-14(주)일성종합건설053-641-3804종합건축사 건원건축054-841-4533건축사사무소애드이엔씨<NA>2023-08-17
597598허가경상북도 안동시 풍산읍 매곡리 1114-4단독주택2016-11-112016-11-022016-11-112023-02-02<NA><NA>건축사사무소미래건축054-854-8826건축사사무소 한림054-853-19332023-08-17
598599허가경상북도 안동시 북후면 옹천리 531-17단독주택2015-05-012015-04-302015-05-012023-02-20<NA><NA>(주)종합건축사사무소 건원건축054-841-4533(주)종합건축사 건원건축054-841-45332023-08-17
599600허가경상북도 안동시 안막동 202-11 외1필지단독주택2015-12-232015-12-232015-12-232023-01-13<NA><NA>건축사사무소 우신건축054-841-6200(주)경북건축사사무소054-872-48322023-08-17
600601허가경상북도 안동시 풍산읍 괴정리 1015공장2014-04-062014-04-072014-04-062022-11-20<NA>054-859-7718용화건축사사무소054-841-8877주식회사 종합건축사사무소 원건축054-853-54002023-08-17