Overview

Dataset statistics

Number of variables9
Number of observations1378
Missing cells2216
Missing cells (%)17.9%
Duplicate rows1
Duplicate rows (%)0.1%
Total size in memory97.0 KiB
Average record size in memory72.1 B

Variable types

Text6
Categorical1
DateTime2

Dataset

Description하동군 건축허가 현황 대지위치,주용도,착공예정일,실제착공일, 착공처리일, 준공예정일, 시공자사무소명, 감리사무소명, 설계사무소명 등
Author경상남도 하동군
URLhttps://bigdata.gyeongnam.go.kr/index.gn?menuCd=DOM_000000114002001000&publicdatapk=15036786

Alerts

Dataset has 1 (0.1%) duplicate rowsDuplicates
착공예정일 has 27 (2.0%) missing valuesMissing
착공처리일 has 22 (1.6%) missing valuesMissing
준공예정일(사용승인예정일) has 915 (66.4%) missing valuesMissing
시공자사무소명 has 991 (71.9%) missing valuesMissing
감리사무소명 has 257 (18.7%) missing valuesMissing

Reproduction

Analysis started2023-12-11 00:33:00.276179
Analysis finished2023-12-11 00:33:01.163292
Duration0.89 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct1281
Distinct (%)93.0%
Missing0
Missing (%)0.0%
Memory size10.9 KiB
2023-12-11T09:33:01.443565image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length30
Median length28
Mean length23.862119
Min length17

Characters and Unicode

Total characters32882
Distinct characters130
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1199 ?
Unique (%)87.0%

Sample

1st row경상남도 하동군 옥종면 북방리 551
2nd row경상남도 하동군 진교면 진교리 311-15
3rd row경상남도 하동군 옥종면 대곡리 432
4th row경상남도 하동군 옥종면 법대리 411-2 외2필지
5th row경상남도 하동군 진교면 안심리 229-1 외13필지
ValueCountFrequency (%)
경상남도 1378
18.3%
하동군 1378
18.3%
외1필지 325
 
4.3%
하동읍 293
 
3.9%
진교면 205
 
2.7%
옥종면 148
 
2.0%
금남면 121
 
1.6%
화개면 117
 
1.6%
읍내리 112
 
1.5%
진교리 102
 
1.4%
Other values (1278) 3367
44.6%
2023-12-11T09:33:01.892505image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
6168
18.8%
1713
 
5.2%
1678
 
5.1%
1506
 
4.6%
1411
 
4.3%
1 1406
 
4.3%
1395
 
4.2%
1378
 
4.2%
1378
 
4.2%
1373
 
4.2%
Other values (120) 13476
41.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 19704
59.9%
Space Separator 6168
 
18.8%
Decimal Number 6042
 
18.4%
Dash Punctuation 968
 
2.9%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1713
 
8.7%
1678
 
8.5%
1506
 
7.6%
1411
 
7.2%
1395
 
7.1%
1378
 
7.0%
1378
 
7.0%
1373
 
7.0%
1085
 
5.5%
619
 
3.1%
Other values (108) 6168
31.3%
Decimal Number
ValueCountFrequency (%)
1 1406
23.3%
2 890
14.7%
3 741
12.3%
4 604
10.0%
5 451
 
7.5%
6 441
 
7.3%
0 425
 
7.0%
7 423
 
7.0%
9 333
 
5.5%
8 328
 
5.4%
Space Separator
ValueCountFrequency (%)
6168
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 968
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 19704
59.9%
Common 13178
40.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1713
 
8.7%
1678
 
8.5%
1506
 
7.6%
1411
 
7.2%
1395
 
7.1%
1378
 
7.0%
1378
 
7.0%
1373
 
7.0%
1085
 
5.5%
619
 
3.1%
Other values (108) 6168
31.3%
Common
ValueCountFrequency (%)
6168
46.8%
1 1406
 
10.7%
- 968
 
7.3%
2 890
 
6.8%
3 741
 
5.6%
4 604
 
4.6%
5 451
 
3.4%
6 441
 
3.3%
0 425
 
3.2%
7 423
 
3.2%
Other values (2) 661
 
5.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 19704
59.9%
ASCII 13178
40.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
6168
46.8%
1 1406
 
10.7%
- 968
 
7.3%
2 890
 
6.8%
3 741
 
5.6%
4 604
 
4.6%
5 451
 
3.4%
6 441
 
3.3%
0 425
 
3.2%
7 423
 
3.2%
Other values (2) 661
 
5.0%
Hangul
ValueCountFrequency (%)
1713
 
8.7%
1678
 
8.5%
1506
 
7.6%
1411
 
7.2%
1395
 
7.1%
1378
 
7.0%
1378
 
7.0%
1373
 
7.0%
1085
 
5.5%
619
 
3.1%
Other values (108) 6168
31.3%

주용도
Categorical

Distinct31
Distinct (%)2.2%
Missing0
Missing (%)0.0%
Memory size10.9 KiB
동.식물관련시설
282 
제2종근린생활시설
238 
단독주택
206 
제1종근린생활시설
205 
창고시설
157 
Other values (26)
290 

Length

Max length10
Median length9
Mean length6.4811321
Min length2

Unique

Unique4 ?
Unique (%)0.3%

Sample

1st row동.식물관련시설
2nd row제2종근린생활시설
3rd row동.식물관련시설
4th row동.식물관련시설
5th row창고시설

Common Values

ValueCountFrequency (%)
동.식물관련시설 282
20.5%
제2종근린생활시설 238
17.3%
단독주택 206
14.9%
제1종근린생활시설 205
14.9%
창고시설 157
11.4%
공장 99
 
7.2%
숙박시설 26
 
1.9%
노유자시설 25
 
1.8%
공동주택 21
 
1.5%
교육연구및복지시설 16
 
1.2%
Other values (21) 103
 
7.5%

Length

2023-12-11T09:33:02.031214image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
동.식물관련시설 282
20.5%
제2종근린생활시설 238
17.3%
단독주택 206
14.9%
제1종근린생활시설 205
14.9%
창고시설 157
11.4%
공장 99
 
7.2%
숙박시설 26
 
1.9%
노유자시설 25
 
1.8%
공동주택 21
 
1.5%
교육연구및복지시설 16
 
1.2%
Other values (21) 103
 
7.5%

착공예정일
Text

MISSING 

Distinct1149
Distinct (%)85.0%
Missing27
Missing (%)2.0%
Memory size10.9 KiB
2023-12-11T09:33:02.318638image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length10
Median length10
Mean length9.9911177
Min length6

Characters and Unicode

Total characters13498
Distinct characters12
Distinct categories3 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique982 ?
Unique (%)72.7%

Sample

1st row2022-05-27
2nd row2022-01-21
3rd row2021-12-10
4th row2022-03-23
5th row2021-11-01
ValueCountFrequency (%)
2020-04-13 5
 
0.4%
2019-12-16 4
 
0.3%
2019-06-07 4
 
0.3%
2006-10-04 4
 
0.3%
2018-10-10 4
 
0.3%
2005-11-15 4
 
0.3%
2006-04-12 3
 
0.2%
2020-10-20 3
 
0.2%
2014-06-16 3
 
0.2%
2018-07-25 3
 
0.2%
Other values (1140) 1315
97.3%
2023-12-11T09:33:02.751652image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 3672
27.2%
- 2702
20.0%
2 2423
18.0%
1 1986
14.7%
3 442
 
3.3%
9 412
 
3.1%
8 401
 
3.0%
7 381
 
2.8%
5 369
 
2.7%
6 361
 
2.7%
Other values (2) 349
 
2.6%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 10795
80.0%
Dash Punctuation 2702
 
20.0%
Space Separator 1
 
< 0.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 3672
34.0%
2 2423
22.4%
1 1986
18.4%
3 442
 
4.1%
9 412
 
3.8%
8 401
 
3.7%
7 381
 
3.5%
5 369
 
3.4%
6 361
 
3.3%
4 348
 
3.2%
Dash Punctuation
ValueCountFrequency (%)
- 2702
100.0%
Space Separator
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 13498
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 3672
27.2%
- 2702
20.0%
2 2423
18.0%
1 1986
14.7%
3 442
 
3.3%
9 412
 
3.1%
8 401
 
3.0%
7 381
 
2.8%
5 369
 
2.7%
6 361
 
2.7%
Other values (2) 349
 
2.6%

Most occurring blocks

ValueCountFrequency (%)
ASCII 13498
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 3672
27.2%
- 2702
20.0%
2 2423
18.0%
1 1986
14.7%
3 442
 
3.3%
9 412
 
3.1%
8 401
 
3.0%
7 381
 
2.8%
5 369
 
2.7%
6 361
 
2.7%
Other values (2) 349
 
2.6%
Distinct1186
Distinct (%)86.1%
Missing0
Missing (%)0.0%
Memory size10.9 KiB
2023-12-11T09:33:03.049201image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length10
Median length10
Mean length9.9941945
Min length8

Characters and Unicode

Total characters13772
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1021 ?
Unique (%)74.1%

Sample

1st row2022-05-27
2nd row2022-01-21
3rd row2021-12-10
4th row2022-03-23
5th row2021-11-01
ValueCountFrequency (%)
2020-04-13 5
 
0.4%
2019-06-07 4
 
0.3%
2005-11-15 4
 
0.3%
2018-10-10 4
 
0.3%
2019-12-16 4
 
0.3%
2011-08-03 3
 
0.2%
2010-02-25 3
 
0.2%
2009-12-22 3
 
0.2%
2020-05-20 3
 
0.2%
2019-11-01 3
 
0.2%
Other values (1176) 1342
97.4%
2023-12-11T09:33:03.553490image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 3773
27.4%
- 2756
20.0%
2 2488
18.1%
1 2003
14.5%
3 442
 
3.2%
9 431
 
3.1%
8 400
 
2.9%
7 390
 
2.8%
5 373
 
2.7%
4 358
 
2.6%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 11016
80.0%
Dash Punctuation 2756
 
20.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 3773
34.3%
2 2488
22.6%
1 2003
18.2%
3 442
 
4.0%
9 431
 
3.9%
8 400
 
3.6%
7 390
 
3.5%
5 373
 
3.4%
4 358
 
3.2%
6 358
 
3.2%
Dash Punctuation
ValueCountFrequency (%)
- 2756
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 13772
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 3773
27.4%
- 2756
20.0%
2 2488
18.1%
1 2003
14.5%
3 442
 
3.2%
9 431
 
3.1%
8 400
 
2.9%
7 390
 
2.8%
5 373
 
2.7%
4 358
 
2.6%

Most occurring blocks

ValueCountFrequency (%)
ASCII 13772
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 3773
27.4%
- 2756
20.0%
2 2488
18.1%
1 2003
14.5%
3 442
 
3.2%
9 431
 
3.1%
8 400
 
2.9%
7 390
 
2.8%
5 373
 
2.7%
4 358
 
2.6%

착공처리일
Date

MISSING 

Distinct1137
Distinct (%)83.8%
Missing22
Missing (%)1.6%
Memory size10.9 KiB
Minimum1982-10-26 00:00:00
Maximum2022-05-26 00:00:00
2023-12-11T09:33:03.720704image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T09:33:03.876831image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
Distinct277
Distinct (%)59.8%
Missing915
Missing (%)66.4%
Memory size10.9 KiB
Minimum2016-04-30 00:00:00
Maximum2025-03-03 00:00:00
2023-12-11T09:33:04.045868image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T09:33:04.255308image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

시공자사무소명
Text

MISSING 

Distinct282
Distinct (%)72.9%
Missing991
Missing (%)71.9%
Memory size10.9 KiB
2023-12-11T09:33:04.522718image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length17
Median length14
Mean length8.3074935
Min length4

Characters and Unicode

Total characters3215
Distinct characters175
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique235 ?
Unique (%)60.7%

Sample

1st row(주)추성
2nd row청양종합건설주식회사
3rd row정우건설(주)
4th row(주)수반건설
5th row금오산업(주)
ValueCountFrequency (%)
주)세진종합건설 25
 
6.0%
주식회사 25
 
6.0%
명진종합건설(주 12
 
2.9%
명진종합건설주식회사 6
 
1.4%
주)아라한건설 6
 
1.4%
푸르미건설주식회사 5
 
1.2%
관성종합건설(주 3
 
0.7%
주)유원종합건설 3
 
0.7%
세진종합건설 3
 
0.7%
청도종합건설(주 3
 
0.7%
Other values (275) 327
78.2%
2023-12-11T09:33:05.229225image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
383
 
11.9%
338
 
10.5%
317
 
9.9%
) 287
 
8.9%
( 285
 
8.9%
179
 
5.6%
178
 
5.5%
91
 
2.8%
87
 
2.7%
85
 
2.6%
Other values (165) 985
30.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2612
81.2%
Close Punctuation 287
 
8.9%
Open Punctuation 285
 
8.9%
Space Separator 31
 
1.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
383
14.7%
338
 
12.9%
317
 
12.1%
179
 
6.9%
178
 
6.8%
91
 
3.5%
87
 
3.3%
85
 
3.3%
63
 
2.4%
38
 
1.5%
Other values (162) 853
32.7%
Close Punctuation
ValueCountFrequency (%)
) 287
100.0%
Open Punctuation
ValueCountFrequency (%)
( 285
100.0%
Space Separator
ValueCountFrequency (%)
31
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2612
81.2%
Common 603
 
18.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
383
14.7%
338
 
12.9%
317
 
12.1%
179
 
6.9%
178
 
6.8%
91
 
3.5%
87
 
3.3%
85
 
3.3%
63
 
2.4%
38
 
1.5%
Other values (162) 853
32.7%
Common
ValueCountFrequency (%)
) 287
47.6%
( 285
47.3%
31
 
5.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2612
81.2%
ASCII 603
 
18.8%

Most frequent character per block

Hangul
ValueCountFrequency (%)
383
14.7%
338
 
12.9%
317
 
12.1%
179
 
6.9%
178
 
6.8%
91
 
3.5%
87
 
3.3%
85
 
3.3%
63
 
2.4%
38
 
1.5%
Other values (162) 853
32.7%
ASCII
ValueCountFrequency (%)
) 287
47.6%
( 285
47.3%
31
 
5.1%

감리사무소명
Text

MISSING 

Distinct304
Distinct (%)27.1%
Missing257
Missing (%)18.7%
Memory size10.9 KiB
2023-12-11T09:33:05.515664image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length19
Median length8
Mean length8.5771632
Min length2

Characters and Unicode

Total characters9615
Distinct characters200
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique200 ?
Unique (%)17.8%

Sample

1st row유성건축사사무소
2nd row모두이종록건축사사무소
3rd row으뜸건축사사무소
4th row건축사사무소오름
5th row주식회사길림건축사사무소
ValueCountFrequency (%)
건축사사무소 131
 
9.9%
강림건축사사무소 112
 
8.5%
건축사사무소성림 102
 
7.7%
다원건축사사무소 75
 
5.7%
태화건축사사무소 56
 
4.2%
건축사사무소오름 44
 
3.3%
강림건축사(사 40
 
3.0%
유성건축사사무소 29
 
2.2%
사무소 23
 
1.7%
건축사 23
 
1.7%
Other values (293) 688
52.0%
2023-12-11T09:33:05.894584image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2214
23.0%
1121
11.7%
1109
11.5%
996
10.4%
992
10.3%
337
 
3.5%
202
 
2.1%
198
 
2.1%
158
 
1.6%
) 155
 
1.6%
Other values (190) 2133
22.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 9065
94.3%
Space Separator 202
 
2.1%
Close Punctuation 155
 
1.6%
Open Punctuation 155
 
1.6%
Decimal Number 17
 
0.2%
Uppercase Letter 11
 
0.1%
Other Punctuation 8
 
0.1%
Math Symbol 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2214
24.4%
1121
12.4%
1109
12.2%
996
11.0%
992
10.9%
337
 
3.7%
198
 
2.2%
158
 
1.7%
128
 
1.4%
96
 
1.1%
Other values (172) 1716
18.9%
Uppercase Letter
ValueCountFrequency (%)
C 3
27.3%
S 2
18.2%
M 2
18.2%
N 1
 
9.1%
E 1
 
9.1%
K 1
 
9.1%
Y 1
 
9.1%
Decimal Number
ValueCountFrequency (%)
1 7
41.2%
2 4
23.5%
0 3
17.6%
5 3
17.6%
Other Punctuation
ValueCountFrequency (%)
. 5
62.5%
& 3
37.5%
Math Symbol
ValueCountFrequency (%)
> 1
50.0%
< 1
50.0%
Space Separator
ValueCountFrequency (%)
202
100.0%
Close Punctuation
ValueCountFrequency (%)
) 155
100.0%
Open Punctuation
ValueCountFrequency (%)
( 155
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 9065
94.3%
Common 539
 
5.6%
Latin 11
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2214
24.4%
1121
12.4%
1109
12.2%
996
11.0%
992
10.9%
337
 
3.7%
198
 
2.2%
158
 
1.7%
128
 
1.4%
96
 
1.1%
Other values (172) 1716
18.9%
Common
ValueCountFrequency (%)
202
37.5%
) 155
28.8%
( 155
28.8%
1 7
 
1.3%
. 5
 
0.9%
2 4
 
0.7%
0 3
 
0.6%
5 3
 
0.6%
& 3
 
0.6%
> 1
 
0.2%
Latin
ValueCountFrequency (%)
C 3
27.3%
S 2
18.2%
M 2
18.2%
N 1
 
9.1%
E 1
 
9.1%
K 1
 
9.1%
Y 1
 
9.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 9065
94.3%
ASCII 550
 
5.7%

Most frequent character per block

Hangul
ValueCountFrequency (%)
2214
24.4%
1121
12.4%
1109
12.2%
996
11.0%
992
10.9%
337
 
3.7%
198
 
2.2%
158
 
1.7%
128
 
1.4%
96
 
1.1%
Other values (172) 1716
18.9%
ASCII
ValueCountFrequency (%)
202
36.7%
) 155
28.2%
( 155
28.2%
1 7
 
1.3%
. 5
 
0.9%
2 4
 
0.7%
0 3
 
0.5%
C 3
 
0.5%
5 3
 
0.5%
& 3
 
0.5%
Other values (8) 10
 
1.8%
Distinct355
Distinct (%)25.8%
Missing4
Missing (%)0.3%
Memory size10.9 KiB
2023-12-11T09:33:06.152908image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length20
Median length8
Mean length8.7256186
Min length2

Characters and Unicode

Total characters11989
Distinct characters211
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique238 ?
Unique (%)17.3%

Sample

1st row유성건축사사무소
2nd row건축사사무소 돌채
3rd row다올H건축사사무소
4th row모두이종록건축사사무소
5th row으뜸건축사사무소
ValueCountFrequency (%)
건축사사무소 152
 
9.4%
건축사사무소성림 120
 
7.4%
강림건축사사무소 114
 
7.1%
다원건축사사무소 75
 
4.6%
모두이종록건축사사무소 70
 
4.3%
태화건축사사무소 61
 
3.8%
건축사사무소오름 57
 
3.5%
강림건축사(사 49
 
3.0%
유성건축사사무소 37
 
2.3%
정원건축사사무소 31
 
1.9%
Other values (344) 847
52.5%
2023-12-11T09:33:06.588627image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2721
22.7%
1383
11.5%
1370
11.4%
1223
10.2%
1218
10.2%
376
 
3.1%
239
 
2.0%
226
 
1.9%
) 200
 
1.7%
( 200
 
1.7%
Other values (201) 2833
23.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 11292
94.2%
Space Separator 239
 
2.0%
Close Punctuation 200
 
1.7%
Open Punctuation 200
 
1.7%
Decimal Number 25
 
0.2%
Uppercase Letter 23
 
0.2%
Other Punctuation 10
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2721
24.1%
1383
12.2%
1370
12.1%
1223
10.8%
1218
10.8%
376
 
3.3%
226
 
2.0%
173
 
1.5%
146
 
1.3%
136
 
1.2%
Other values (182) 2320
20.5%
Uppercase Letter
ValueCountFrequency (%)
C 4
17.4%
E 3
13.0%
S 3
13.0%
M 3
13.0%
K 2
8.7%
Y 2
8.7%
A 2
8.7%
H 2
8.7%
N 1
 
4.3%
D 1
 
4.3%
Decimal Number
ValueCountFrequency (%)
1 11
44.0%
2 8
32.0%
0 3
 
12.0%
5 3
 
12.0%
Other Punctuation
ValueCountFrequency (%)
. 6
60.0%
& 4
40.0%
Space Separator
ValueCountFrequency (%)
239
100.0%
Close Punctuation
ValueCountFrequency (%)
) 200
100.0%
Open Punctuation
ValueCountFrequency (%)
( 200
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 11292
94.2%
Common 674
 
5.6%
Latin 23
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2721
24.1%
1383
12.2%
1370
12.1%
1223
10.8%
1218
10.8%
376
 
3.3%
226
 
2.0%
173
 
1.5%
146
 
1.3%
136
 
1.2%
Other values (182) 2320
20.5%
Latin
ValueCountFrequency (%)
C 4
17.4%
E 3
13.0%
S 3
13.0%
M 3
13.0%
K 2
8.7%
Y 2
8.7%
A 2
8.7%
H 2
8.7%
N 1
 
4.3%
D 1
 
4.3%
Common
ValueCountFrequency (%)
239
35.5%
) 200
29.7%
( 200
29.7%
1 11
 
1.6%
2 8
 
1.2%
. 6
 
0.9%
& 4
 
0.6%
0 3
 
0.4%
5 3
 
0.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 11292
94.2%
ASCII 697
 
5.8%

Most frequent character per block

Hangul
ValueCountFrequency (%)
2721
24.1%
1383
12.2%
1370
12.1%
1223
10.8%
1218
10.8%
376
 
3.3%
226
 
2.0%
173
 
1.5%
146
 
1.3%
136
 
1.2%
Other values (182) 2320
20.5%
ASCII
ValueCountFrequency (%)
239
34.3%
) 200
28.7%
( 200
28.7%
1 11
 
1.6%
2 8
 
1.1%
. 6
 
0.9%
& 4
 
0.6%
C 4
 
0.6%
E 3
 
0.4%
S 3
 
0.4%
Other values (9) 19
 
2.7%

Missing values

2023-12-11T09:33:00.819479image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T09:33:00.961854image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-11T09:33:01.085209image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

대지위치주용도착공예정일실제착공일착공처리일준공예정일(사용승인예정일)시공자사무소명감리사무소명설계사무소명
0경상남도 하동군 옥종면 북방리 551동.식물관련시설2022-05-272022-05-272022-05-262022-06-30<NA><NA>유성건축사사무소
1경상남도 하동군 진교면 진교리 311-15제2종근린생활시설2022-01-212022-01-212022-01-252022-04-30<NA>유성건축사사무소건축사사무소 돌채
2경상남도 하동군 옥종면 대곡리 432동.식물관련시설2021-12-102021-12-102021-12-082022-12-09<NA><NA>다올H건축사사무소
3경상남도 하동군 옥종면 법대리 411-2 외2필지동.식물관련시설2022-03-232022-03-232022-03-282025-03-03<NA>모두이종록건축사사무소모두이종록건축사사무소
4경상남도 하동군 진교면 안심리 229-1 외13필지창고시설2021-11-012021-11-012021-11-052021-12-31(주)추성으뜸건축사사무소으뜸건축사사무소
5경상남도 하동군 화개면 삼신리 231제1종근린생활시설2021-10-282021-10-282021-11-012022-04-28청양종합건설주식회사건축사사무소오름건축사사무소오름
6경상남도 하동군 금남면 대송리 352-10 외3필지동.식물관련시설2021-10-262021-10-262021-10-252021-12-30<NA><NA>건축사사무소성림
7경상남도 하동군 악양면 정서리 256-1단독주택2021-11-202021-11-202021-11-152022-11-19<NA>주식회사길림건축사사무소주식회사길림건축사사무소
8경상남도 하동군 고전면 범아리 312-3제1종근린생활시설2021-10-052021-10-052021-10-062021-12-31<NA><NA>건축사사무소오름
9경상남도 하동군 진교면 고이리 494-1동.식물관련시설2021-09-272021-09-272021-09-242021-09-30<NA><NA>유성건축사사무소
대지위치주용도착공예정일실제착공일착공처리일준공예정일(사용승인예정일)시공자사무소명감리사무소명설계사무소명
1368경상남도 하동군 화개면 덕은리 821 외1필지숙박시설1998-03-131998-03-131998-03-12<NA><NA><NA>강림건축사사무소
1369경상남도 하동군 화개면 탑리 565 외1필지숙박시설1997-05-011997-05-01<NA><NA>오흥종합건설(주)토담건축사사무소토담건축사사무소
1370경상남도 하동군 화개면 운수리 384-17 외1필지숙박시설1995-10-301995-10-301995-10-30<NA>주식회사이랜드건설태화건축사사무소유중엔지니어링 건축사사무소
1371경상남도 하동군 하동읍 읍내리 224-14제1종근린생활시설1994-11-081994-11-08<NA><NA><NA><NA><NA>
1372경상남도 하동군 하동읍 광평리 299-17제2종근린생활시설2004-11-021994-01-142004-10-30<NA><NA>태화건축사사무소태화건축사사무소
1373경상남도 하동군 하동읍 광평리 296-3제2종근린생활시설1991-12-261991-12-26<NA><NA><NA><NA>태화건축사사무소
1374경상남도 하동군 하동읍 광평리 216-5단독주택1989-11-091989-11-071989-11-09<NA><NA>태화설계사무소태화설계사무소
1375경상남도 하동군 화개면 탑리 640 외1필지제2종근린생활시설1982-10-261982-10-261982-10-26<NA><NA>고광건축사사무소고광건축사사무소
1376경상남도 하동군 금남면 노량리 521-2제2종근린생활시설<NA>2000-03-091999-10-11<NA><NA>태화태화건축사사무소
1377경상남도 하동군 악양면 정서리 570공장2006-05-272006-05-272006-05-26<NA><NA>강림건축사사무소강림건축사사무소

Duplicate rows

Most frequently occurring

대지위치주용도착공예정일실제착공일착공처리일준공예정일(사용승인예정일)시공자사무소명감리사무소명설계사무소명# duplicates
0경상남도 하동군 하동읍 읍내리 233-22 외2필지제1종근린생활시설2006-10-042006-10-022006-10-02<NA><NA>강림건축사사무소강림건축사사무소2