Overview

Dataset statistics

Number of variables7
Number of observations6390
Missing cells12332
Missing cells (%)27.6%
Duplicate rows20
Duplicate rows (%)0.3%
Total size in memory349.6 KiB
Average record size in memory56.0 B

Variable types

Text5
Categorical1
DateTime1

Dataset

Description경상남도 남해군 건설현장 시공정보 현황입니다. 항목으로는 시공대지 위치, 주용도, 착공일, 준공일, 시공업체명, 감리사무소, 설계사무소명과 전화번호 등을 포함한 정보입니다.
Author경상남도 남해군
URLhttps://www.data.go.kr/data/15035684/fileData.do

Alerts

Dataset has 20 (0.3%) duplicate rowsDuplicates
주용도 is highly imbalanced (56.4%)Imbalance
착공일 has 307 (4.8%) missing valuesMissing
준공일 has 495 (7.7%) missing valuesMissing
시공업체명(전화번호) has 5927 (92.8%) missing valuesMissing
감리사무소명(전화번호) has 5509 (86.2%) missing valuesMissing
설계사무소명(전화번호) has 94 (1.5%) missing valuesMissing

Reproduction

Analysis started2023-12-12 07:45:08.557706
Analysis finished2023-12-12 07:45:09.949791
Duration1.39 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct5834
Distinct (%)91.3%
Missing0
Missing (%)0.0%
Memory size50.1 KiB
2023-12-12T16:45:10.349569image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length30
Median length28
Mean length22.788419
Min length17

Characters and Unicode

Total characters145618
Distinct characters108
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique5348 ?
Unique (%)83.7%

Sample

1st row경상남도 남해군 서면 연죽리 1307
2nd row경상남도 남해군 설천면 문항리 1480
3rd row경상남도 남해군 설천면 금음리 825
4th row경상남도 남해군 고현면 도마리 700-9 외3필지
5th row경상남도 남해군 서면 작장리 1466
ValueCountFrequency (%)
경상남도 6390
18.8%
남해군 6390
18.8%
외1필지 1214
 
3.6%
창선면 1084
 
3.2%
남해읍 1043
 
3.1%
삼동면 774
 
2.3%
남면 768
 
2.3%
이동면 629
 
1.8%
설천면 529
 
1.6%
고현면 509
 
1.5%
Other values (4263) 14711
43.2%
2023-12-12T16:45:10.993538image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
27651
19.0%
14843
 
10.2%
7433
 
5.1%
7088
 
4.9%
1 6621
 
4.5%
6567
 
4.5%
6390
 
4.4%
6390
 
4.4%
6389
 
4.4%
5371
 
3.7%
Other values (98) 50875
34.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 87690
60.2%
Space Separator 27651
 
19.0%
Decimal Number 26514
 
18.2%
Dash Punctuation 3763
 
2.6%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
14843
16.9%
7433
 
8.5%
7088
 
8.1%
6567
 
7.5%
6390
 
7.3%
6390
 
7.3%
6389
 
7.3%
5371
 
6.1%
2232
 
2.5%
1947
 
2.2%
Other values (86) 23040
26.3%
Decimal Number
ValueCountFrequency (%)
1 6621
25.0%
2 3434
13.0%
3 2606
 
9.8%
4 2513
 
9.5%
5 2169
 
8.2%
6 2116
 
8.0%
8 1821
 
6.9%
7 1816
 
6.8%
0 1800
 
6.8%
9 1618
 
6.1%
Space Separator
ValueCountFrequency (%)
27651
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 3763
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 87690
60.2%
Common 57928
39.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
14843
16.9%
7433
 
8.5%
7088
 
8.1%
6567
 
7.5%
6390
 
7.3%
6390
 
7.3%
6389
 
7.3%
5371
 
6.1%
2232
 
2.5%
1947
 
2.2%
Other values (86) 23040
26.3%
Common
ValueCountFrequency (%)
27651
47.7%
1 6621
 
11.4%
- 3763
 
6.5%
2 3434
 
5.9%
3 2606
 
4.5%
4 2513
 
4.3%
5 2169
 
3.7%
6 2116
 
3.7%
8 1821
 
3.1%
7 1816
 
3.1%
Other values (2) 3418
 
5.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 87690
60.2%
ASCII 57928
39.8%

Most frequent character per block

ASCII
ValueCountFrequency (%)
27651
47.7%
1 6621
 
11.4%
- 3763
 
6.5%
2 3434
 
5.9%
3 2606
 
4.5%
4 2513
 
4.3%
5 2169
 
3.7%
6 2116
 
3.7%
8 1821
 
3.1%
7 1816
 
3.1%
Other values (2) 3418
 
5.9%
Hangul
ValueCountFrequency (%)
14843
16.9%
7433
 
8.5%
7088
 
8.1%
6567
 
7.5%
6390
 
7.3%
6390
 
7.3%
6389
 
7.3%
5371
 
6.1%
2232
 
2.5%
1947
 
2.2%
Other values (86) 23040
26.3%

주용도
Categorical

IMBALANCE 

Distinct28
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size50.1 KiB
단독주택
3834 
창고시설
790 
제1종근린생활시설
556 
제2종근린생활시설
459 
동.식물관련시설
403 
Other values (23)
 
348

Length

Max length10
Median length4
Mean length5.1
Min length2

Unique

Unique2 ?
Unique (%)< 0.1%

Sample

1st row창고시설
2nd row단독주택
3rd row창고시설
4th row제1종근린생활시설
5th row단독주택

Common Values

ValueCountFrequency (%)
단독주택 3834
60.0%
창고시설 790
 
12.4%
제1종근린생활시설 556
 
8.7%
제2종근린생활시설 459
 
7.2%
동.식물관련시설 403
 
6.3%
숙박시설 55
 
0.9%
공장 37
 
0.6%
공동주택 36
 
0.6%
노유자시설 27
 
0.4%
업무시설 20
 
0.3%
Other values (18) 173
 
2.7%

Length

2023-12-12T16:45:11.221579image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
단독주택 3834
60.0%
창고시설 790
 
12.4%
제1종근린생활시설 556
 
8.7%
제2종근린생활시설 459
 
7.2%
동.식물관련시설 403
 
6.3%
숙박시설 55
 
0.9%
공장 37
 
0.6%
공동주택 36
 
0.6%
노유자시설 27
 
0.4%
업무시설 20
 
0.3%
Other values (18) 173
 
2.7%

착공일
Date

MISSING 

Distinct3081
Distinct (%)50.6%
Missing307
Missing (%)4.8%
Memory size50.1 KiB
Minimum1993-06-25 00:00:00
Maximum2022-11-15 00:00:00
2023-12-12T16:45:11.427429image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T16:45:11.604128image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

준공일
Text

MISSING 

Distinct2586
Distinct (%)43.9%
Missing495
Missing (%)7.7%
Memory size50.1 KiB
2023-12-12T16:45:11.920707image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length10
Median length10
Mean length10
Min length10

Characters and Unicode

Total characters58950
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1059 ?
Unique (%)18.0%

Sample

1st row2022-10-17
2nd row2022-10-19
3rd row2022-10-17
4th row2022-10-19
5th row2022-10-12
ValueCountFrequency (%)
2021-12-31 66
 
1.1%
2021-12-30 48
 
0.8%
2021-06-30 32
 
0.5%
2021-05-30 16
 
0.3%
2021-03-31 16
 
0.3%
2021-07-30 13
 
0.2%
2021-09-30 13
 
0.2%
2020-12-31 12
 
0.2%
2021-04-30 12
 
0.2%
2017-09-06 11
 
0.2%
Other values (2576) 5656
95.9%
2023-12-12T16:45:12.386317image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 14055
23.8%
- 11790
20.0%
2 11312
19.2%
1 10397
17.6%
3 2044
 
3.5%
9 1744
 
3.0%
8 1740
 
3.0%
7 1663
 
2.8%
6 1469
 
2.5%
5 1421
 
2.4%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 47160
80.0%
Dash Punctuation 11790
 
20.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 14055
29.8%
2 11312
24.0%
1 10397
22.0%
3 2044
 
4.3%
9 1744
 
3.7%
8 1740
 
3.7%
7 1663
 
3.5%
6 1469
 
3.1%
5 1421
 
3.0%
4 1315
 
2.8%
Dash Punctuation
ValueCountFrequency (%)
- 11790
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 58950
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 14055
23.8%
- 11790
20.0%
2 11312
19.2%
1 10397
17.6%
3 2044
 
3.5%
9 1744
 
3.0%
8 1740
 
3.0%
7 1663
 
2.8%
6 1469
 
2.5%
5 1421
 
2.4%

Most occurring blocks

ValueCountFrequency (%)
ASCII 58950
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 14055
23.8%
- 11790
20.0%
2 11312
19.2%
1 10397
17.6%
3 2044
 
3.5%
9 1744
 
3.0%
8 1740
 
3.0%
7 1663
 
2.8%
6 1469
 
2.5%
5 1421
 
2.4%
Distinct348
Distinct (%)75.2%
Missing5927
Missing (%)92.8%
Memory size50.1 KiB
2023-12-12T16:45:12.706313image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length32
Median length30
Mean length20.546436
Min length5

Characters and Unicode

Total characters9513
Distinct characters206
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique287 ?
Unique (%)62.0%

Sample

1st row(주)민수종합건설(055-867-8438)
2nd row덕진토건㈜(055-268-6100)
3rd row(주)민수종합건설(055-867-8438)
4th row한라종합건설㈜(055-253-5823)
5th row동휘종합건설㈜(055-863-4561)
ValueCountFrequency (%)
주식회사 13
 
2.7%
정남종합건설(주)(055-863-2523 12
 
2.4%
주)금강종합건설(055-864-0599 10
 
2.0%
주)와이비(055-862-0488 10
 
2.0%
한라종합건설(주)(055-253-5823 8
 
1.6%
수광종합건설(주)(055-758-9000 6
 
1.2%
거성종합건설(주)(055-864-8043 4
 
0.8%
주)민수종합건설(055-867-8438 4
 
0.8%
용봉종합건설(주)(053-633-5795 4
 
0.8%
한남종합건설주식회사(055-863-2523 4
 
0.8%
Other values (351) 415
84.7%
2023-12-12T16:45:13.189558image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
5 1001
 
10.5%
- 800
 
8.4%
0 704
 
7.4%
) 691
 
7.3%
( 689
 
7.2%
398
 
4.2%
389
 
4.1%
374
 
3.9%
3 374
 
3.9%
8 359
 
3.8%
Other values (196) 3734
39.3%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 4065
42.7%
Other Letter 3182
33.4%
Dash Punctuation 800
 
8.4%
Close Punctuation 691
 
7.3%
Open Punctuation 689
 
7.2%
Other Symbol 57
 
0.6%
Space Separator 27
 
0.3%
Uppercase Letter 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
398
 
12.5%
389
 
12.2%
374
 
11.8%
267
 
8.4%
264
 
8.3%
106
 
3.3%
106
 
3.3%
100
 
3.1%
54
 
1.7%
48
 
1.5%
Other values (179) 1076
33.8%
Decimal Number
ValueCountFrequency (%)
5 1001
24.6%
0 704
17.3%
3 374
 
9.2%
8 359
 
8.8%
2 341
 
8.4%
6 306
 
7.5%
4 290
 
7.1%
7 270
 
6.6%
1 253
 
6.2%
9 167
 
4.1%
Uppercase Letter
ValueCountFrequency (%)
N 1
50.0%
H 1
50.0%
Dash Punctuation
ValueCountFrequency (%)
- 800
100.0%
Close Punctuation
ValueCountFrequency (%)
) 691
100.0%
Open Punctuation
ValueCountFrequency (%)
( 689
100.0%
Other Symbol
ValueCountFrequency (%)
57
100.0%
Space Separator
ValueCountFrequency (%)
27
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 6272
65.9%
Hangul 3239
34.0%
Latin 2
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
398
 
12.3%
389
 
12.0%
374
 
11.5%
267
 
8.2%
264
 
8.2%
106
 
3.3%
106
 
3.3%
100
 
3.1%
57
 
1.8%
54
 
1.7%
Other values (180) 1124
34.7%
Common
ValueCountFrequency (%)
5 1001
16.0%
- 800
12.8%
0 704
11.2%
) 691
11.0%
( 689
11.0%
3 374
 
6.0%
8 359
 
5.7%
2 341
 
5.4%
6 306
 
4.9%
4 290
 
4.6%
Other values (4) 717
11.4%
Latin
ValueCountFrequency (%)
N 1
50.0%
H 1
50.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 6274
66.0%
Hangul 3182
33.4%
None 57
 
0.6%

Most frequent character per block

ASCII
ValueCountFrequency (%)
5 1001
16.0%
- 800
12.8%
0 704
11.2%
) 691
11.0%
( 689
11.0%
3 374
 
6.0%
8 359
 
5.7%
2 341
 
5.4%
6 306
 
4.9%
4 290
 
4.6%
Other values (6) 719
11.5%
Hangul
ValueCountFrequency (%)
398
 
12.5%
389
 
12.2%
374
 
11.8%
267
 
8.4%
264
 
8.3%
106
 
3.3%
106
 
3.3%
100
 
3.1%
54
 
1.7%
48
 
1.5%
Other values (179) 1076
33.8%
None
ValueCountFrequency (%)
57
100.0%
Distinct216
Distinct (%)24.5%
Missing5509
Missing (%)86.2%
Memory size50.1 KiB
2023-12-12T16:45:13.457249image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length32
Median length31
Mean length22.438138
Min length2

Characters and Unicode

Total characters19768
Distinct characters194
Distinct categories9 ?
Distinct scripts4 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique151 ?
Unique (%)17.1%

Sample

1st row장문호건축사사무소(055-864-7400)
2nd row예터건축사사무소(070-7504-0153)
3rd row남해건축사(사)(055-863-4441)
4th row남해건축사(사)(055-863-4441)
5th row건축사사무소동성(055-862-5900)
ValueCountFrequency (%)
김윤섭건축사(사)(055-864-3315 104
 
10.8%
건축사사무소동성(055-862-5900 91
 
9.5%
남해건축사(사)(055-863-4441 85
 
8.9%
장문호건축사(사)(055-864-7400 63
 
6.6%
장문호건축사사무소(055-864-7400 58
 
6.0%
김윤섭건축사사무소(055-864-3315 55
 
5.7%
고원건축사사무소(055-863-4300 46
 
4.8%
건축사사무소 28
 
2.9%
도하건축사사무소(055-863-4182 24
 
2.5%
예터건축사사무소(070-7504-0153 24
 
2.5%
Other values (234) 381
39.7%
2023-12-12T16:45:13.829303image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
5 2034
 
10.3%
1745
 
8.8%
- 1678
 
8.5%
0 1674
 
8.5%
) 1182
 
6.0%
( 1180
 
6.0%
4 956
 
4.8%
881
 
4.5%
876
 
4.4%
8 784
 
4.0%
Other values (184) 6778
34.3%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 8501
43.0%
Other Letter 7123
36.0%
Dash Punctuation 1678
 
8.5%
Close Punctuation 1182
 
6.0%
Open Punctuation 1180
 
6.0%
Space Separator 78
 
0.4%
Uppercase Letter 23
 
0.1%
Other Punctuation 2
 
< 0.1%
Math Symbol 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1745
24.5%
881
12.4%
876
12.3%
580
 
8.1%
579
 
8.1%
169
 
2.4%
168
 
2.4%
166
 
2.3%
129
 
1.8%
127
 
1.8%
Other values (161) 1703
23.9%
Decimal Number
ValueCountFrequency (%)
5 2034
23.9%
0 1674
19.7%
4 956
11.2%
8 784
 
9.2%
3 772
 
9.1%
6 768
 
9.0%
1 542
 
6.4%
7 418
 
4.9%
2 370
 
4.4%
9 183
 
2.2%
Uppercase Letter
ValueCountFrequency (%)
C 5
21.7%
A 5
21.7%
M 3
13.0%
S 3
13.0%
T 3
13.0%
E 3
13.0%
L 1
 
4.3%
Dash Punctuation
ValueCountFrequency (%)
- 1678
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1182
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1180
100.0%
Space Separator
ValueCountFrequency (%)
78
100.0%
Other Punctuation
ValueCountFrequency (%)
& 2
100.0%
Math Symbol
ValueCountFrequency (%)
+ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 12622
63.9%
Hangul 7120
36.0%
Latin 23
 
0.1%
Han 3
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1745
24.5%
881
12.4%
876
12.3%
580
 
8.1%
579
 
8.1%
169
 
2.4%
168
 
2.4%
166
 
2.3%
129
 
1.8%
127
 
1.8%
Other values (158) 1700
23.9%
Common
ValueCountFrequency (%)
5 2034
16.1%
- 1678
13.3%
0 1674
13.3%
) 1182
9.4%
( 1180
9.3%
4 956
7.6%
8 784
 
6.2%
3 772
 
6.1%
6 768
 
6.1%
1 542
 
4.3%
Other values (6) 1052
8.3%
Latin
ValueCountFrequency (%)
C 5
21.7%
A 5
21.7%
M 3
13.0%
S 3
13.0%
T 3
13.0%
E 3
13.0%
L 1
 
4.3%
Han
ValueCountFrequency (%)
1
33.3%
1
33.3%
1
33.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 12645
64.0%
Hangul 7120
36.0%
CJK 2
 
< 0.1%
CJK Compat Ideographs 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
5 2034
16.1%
- 1678
13.3%
0 1674
13.2%
) 1182
9.3%
( 1180
9.3%
4 956
7.6%
8 784
 
6.2%
3 772
 
6.1%
6 768
 
6.1%
1 542
 
4.3%
Other values (13) 1075
8.5%
Hangul
ValueCountFrequency (%)
1745
24.5%
881
12.4%
876
12.3%
580
 
8.1%
579
 
8.1%
169
 
2.4%
168
 
2.4%
166
 
2.3%
129
 
1.8%
127
 
1.8%
Other values (158) 1700
23.9%
CJK
ValueCountFrequency (%)
1
50.0%
1
50.0%
CJK Compat Ideographs
ValueCountFrequency (%)
1
100.0%
Distinct645
Distinct (%)10.2%
Missing94
Missing (%)1.5%
Memory size50.1 KiB
2023-12-12T16:45:14.120784image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length34
Median length32
Mean length20.480464
Min length2

Characters and Unicode

Total characters128945
Distinct characters283
Distinct categories10 ?
Distinct scripts4 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique427 ?
Unique (%)6.8%

Sample

1st row남해건축사(사)
2nd row남해건축사(사)
3rd row건축사사무소동성
4th row건축사사무소동성
5th row남해건축사(사)
ValueCountFrequency (%)
김윤섭건축사(사)(055-864-3315 864
 
13.1%
남해건축사(사)(055-863-4441 810
 
12.3%
건축사사무소동성(055-862-5900 644
 
9.8%
고원건축사사무소(055-863-4300 584
 
8.9%
장문호건축사(사)(055-864-7400 460
 
7.0%
장문호건축사사무소(055-864-7400 438
 
6.7%
김윤섭건축사사무소(055-864-3315 184
 
2.8%
건축사사무소 155
 
2.4%
장문호건축사사무소 129
 
2.0%
부산종합건축사(사)(055-863-1156 128
 
1.9%
Other values (687) 2185
33.2%
2023-12-12T16:45:14.550342image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
5 12797
 
9.9%
12626
 
9.8%
0 10579
 
8.2%
- 10486
 
8.1%
) 8031
 
6.2%
( 8021
 
6.2%
4 6625
 
5.1%
6358
 
4.9%
6345
 
4.9%
3 5258
 
4.1%
Other values (273) 41819
32.4%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 52715
40.9%
Other Letter 49310
38.2%
Dash Punctuation 10486
 
8.1%
Close Punctuation 8031
 
6.2%
Open Punctuation 8021
 
6.2%
Space Separator 286
 
0.2%
Uppercase Letter 81
 
0.1%
Other Punctuation 11
 
< 0.1%
Lowercase Letter 3
 
< 0.1%
Math Symbol 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
12626
25.6%
6358
12.9%
6345
12.9%
3702
 
7.5%
3697
 
7.5%
1212
 
2.5%
1199
 
2.4%
1195
 
2.4%
1047
 
2.1%
1038
 
2.1%
Other values (240) 10891
22.1%
Uppercase Letter
ValueCountFrequency (%)
A 15
18.5%
L 12
14.8%
M 11
13.6%
S 9
11.1%
E 9
11.1%
C 7
8.6%
T 6
 
7.4%
D 3
 
3.7%
O 2
 
2.5%
P 2
 
2.5%
Other values (4) 5
 
6.2%
Decimal Number
ValueCountFrequency (%)
5 12797
24.3%
0 10579
20.1%
4 6625
12.6%
3 5258
10.0%
6 4998
 
9.5%
8 4965
 
9.4%
1 3124
 
5.9%
7 1947
 
3.7%
2 1483
 
2.8%
9 939
 
1.8%
Other Punctuation
ValueCountFrequency (%)
& 6
54.5%
. 5
45.5%
Lowercase Letter
ValueCountFrequency (%)
m 2
66.7%
c 1
33.3%
Dash Punctuation
ValueCountFrequency (%)
- 10486
100.0%
Close Punctuation
ValueCountFrequency (%)
) 8031
100.0%
Open Punctuation
ValueCountFrequency (%)
( 8021
100.0%
Space Separator
ValueCountFrequency (%)
286
100.0%
Math Symbol
ValueCountFrequency (%)
+ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 79551
61.7%
Hangul 49307
38.2%
Latin 84
 
0.1%
Han 3
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
12626
25.6%
6358
12.9%
6345
12.9%
3702
 
7.5%
3697
 
7.5%
1212
 
2.5%
1199
 
2.4%
1195
 
2.4%
1047
 
2.1%
1038
 
2.1%
Other values (237) 10888
22.1%
Common
ValueCountFrequency (%)
5 12797
16.1%
0 10579
13.3%
- 10486
13.2%
) 8031
10.1%
( 8021
10.1%
4 6625
8.3%
3 5258
6.6%
6 4998
 
6.3%
8 4965
 
6.2%
1 3124
 
3.9%
Other values (7) 4667
 
5.9%
Latin
ValueCountFrequency (%)
A 15
17.9%
L 12
14.3%
M 11
13.1%
S 9
10.7%
E 9
10.7%
C 7
8.3%
T 6
 
7.1%
D 3
 
3.6%
O 2
 
2.4%
P 2
 
2.4%
Other values (6) 8
9.5%
Han
ValueCountFrequency (%)
1
33.3%
1
33.3%
1
33.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 79635
61.8%
Hangul 49307
38.2%
CJK 2
 
< 0.1%
CJK Compat Ideographs 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
5 12797
16.1%
0 10579
13.3%
- 10486
13.2%
) 8031
10.1%
( 8021
10.1%
4 6625
8.3%
3 5258
6.6%
6 4998
 
6.3%
8 4965
 
6.2%
1 3124
 
3.9%
Other values (23) 4751
 
6.0%
Hangul
ValueCountFrequency (%)
12626
25.6%
6358
12.9%
6345
12.9%
3702
 
7.5%
3697
 
7.5%
1212
 
2.5%
1199
 
2.4%
1195
 
2.4%
1047
 
2.1%
1038
 
2.1%
Other values (237) 10888
22.1%
CJK
ValueCountFrequency (%)
1
50.0%
1
50.0%
CJK Compat Ideographs
ValueCountFrequency (%)
1
100.0%

Missing values

2023-12-12T16:45:09.625018image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T16:45:09.747918image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-12T16:45:09.876209image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

시공 대지위치주용도착공일준공일시공업체명(전화번호)감리사무소명(전화번호)설계사무소명(전화번호)
0경상남도 남해군 서면 연죽리 1307창고시설2022-11-15<NA><NA><NA><NA>
1경상남도 남해군 설천면 문항리 1480단독주택2022-11-08<NA><NA><NA>남해건축사(사)
2경상남도 남해군 설천면 금음리 825창고시설2022-11-05<NA><NA><NA>남해건축사(사)
3경상남도 남해군 고현면 도마리 700-9 외3필지제1종근린생활시설2022-11-04<NA><NA><NA>건축사사무소동성
4경상남도 남해군 서면 작장리 1466단독주택2022-11-03<NA><NA><NA>건축사사무소동성
5경상남도 남해군 창선면 서대리 670단독주택2022-11-01<NA><NA><NA>남해건축사(사)
6경상남도 남해군 상주면 양아리 900-1단독주택2022-10-31<NA><NA><NA>건축사사무소동성
7경상남도 남해군 미조면 송정리 125-4제1종근린생활시설2022-10-31<NA><NA><NA>건축사사무소동성
8경상남도 남해군 이동면 난음리 619단독주택2022-10-29<NA><NA><NA>종합건축사사무소혁성
9경상남도 남해군 창선면 옥천리 698단독주택2022-10-27<NA><NA><NA>남해건축사(사)
시공 대지위치주용도착공일준공일시공업체명(전화번호)감리사무소명(전화번호)설계사무소명(전화번호)
6380경상남도 남해군 창선면 지족리 414-1단독주택<NA><NA><NA><NA>도담 건축사사무소(051-760-8290)
6381경상남도 남해군 남면 평산리 1835-18제2종근린생활시설<NA><NA>(주)와이비(055-862-0488)<NA>건축사사무소동성(055-862-5900)
6382경상남도 남해군 이동면 초음리 348창고시설<NA><NA><NA><NA>장문호건축사사무소(055-864-7400)
6383경상남도 남해군 이동면 초음리 376창고시설<NA><NA><NA><NA>장문호건축사사무소(055-864-7400)
6384경상남도 남해군 이동면 무림리 1701 외1필지단독주택<NA><NA><NA><NA>예터건축사사무소(070-7504-0153)
6385경상남도 남해군 서면 중현리 76 외2필지단독주택<NA><NA><NA><NA>남해건축사(사)(055-863-4441)
6386경상남도 남해군 창선면 진동리 678 외5필지제1종근린생활시설<NA><NA>금명종합건설㈜(055-755-1234)(주)건화(02-6938-7786)주식회사오월건축건축사사무소(02-549-5080)
6387경상남도 남해군 미조면 미조리 861-1 외13필지제1종근린생활시설<NA><NA>동남건설㈜(055-854-0301)(주)도화엔지니어링(02-6323-3134)(주)마이건축사사무소(02-540-0707)
6388경상남도 남해군 남면 평산리 1783-7제2종근린생활시설<NA>2016-04-19<NA><NA>김윤섭건축사(사)(055-864-3315)
6389경상남도 남해군 삼동면 물건리 989-2단독주택<NA>2010-05-25<NA><NA>부산종합건축사(사)(055-863-1156)

Duplicate rows

Most frequently occurring

시공 대지위치주용도착공일준공일시공업체명(전화번호)감리사무소명(전화번호)설계사무소명(전화번호)# duplicates
8경상남도 남해군 상주면 상주리 1056-5 외1필지단독주택2012-02-232012-07-31<NA>건축사사무소바로(055-748-5771)건축사사무소바로(055-748-5771)3
14경상남도 남해군 창선면 부윤리 157단독주택<NA><NA><NA><NA>남해건축사(사)(055-863-4441)3
0경상남도 남해군 고현면 대사리 산 80-7단독주택<NA><NA><NA><NA>건축사사무소이현(055-743-0017)2
1경상남도 남해군 고현면 도마리 1094-1단독주택2019-01-142019-02-28<NA><NA>고원건축사사무소(055-863-4300)2
2경상남도 남해군 남면 당항리 31-1단독주택2017-06-072018-08-10<NA><NA>건축사사무소동성(055-862-5900)2
3경상남도 남해군 남면 홍현리 산 8 외1필지단독주택<NA><NA><NA><NA>남해건축사(사)(055-863-4441)2
4경상남도 남해군 남해읍 선소리 67-6단독주택2019-01-112019-01-23<NA><NA>건축사사무소동성(055-862-5900)2
5경상남도 남해군 남해읍 선소리 67-7단독주택2019-01-102019-01-23<NA><NA>고원건축사사무소(055-863-4300)2
6경상남도 남해군 남해읍 평리 1680창고시설2019-02-182019-02-25<NA><NA>남해건축사(사)(055-863-4441)2
7경상남도 남해군 삼동면 물건리 산 21-9 외1필지단독주택2020-11-192021-11-17<NA><NA>고원건축사사무소2