Overview

Dataset statistics

Number of variables11
Number of observations2673
Missing cells7728
Missing cells (%)26.3%
Duplicate rows13
Duplicate rows (%)0.5%
Total size in memory229.8 KiB
Average record size in memory88.0 B

Variable types

Text7
Categorical1
DateTime3

Dataset

Description경상남도 사천시 건설현장 시공정보입니다. (위치, 주용도, 착공예정일, 실제착공일, 시공업체명을 볼 수 있습니다.)
Author경상남도 사천시
URLhttps://bigdata.gyeongnam.go.kr/index.gn?menuCd=DOM_000000114002001000&publicdatapk=15048899

Alerts

Dataset has 13 (0.5%) duplicate rowsDuplicates
착공예정일 has 172 (6.4%) missing valuesMissing
실제 착공일(착공처리일) has 144 (5.4%) missing valuesMissing
준공예정일 has 2639 (98.7%) missing valuesMissing
시공업체명(빈칸은 건축주 직영) has 1876 (70.2%) missing valuesMissing
시공업체 전화번호 has 2042 (76.4%) missing valuesMissing
감리사무소명 has 228 (8.5%) missing valuesMissing
감리사무소 전화번호 has 391 (14.6%) missing valuesMissing
설계사무소 전화번호 has 216 (8.1%) missing valuesMissing

Reproduction

Analysis started2023-12-10 23:05:18.870695
Analysis finished2023-12-10 23:05:19.891811
Duration1.02 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct2376
Distinct (%)88.9%
Missing0
Missing (%)0.0%
Memory size21.0 KiB
2023-12-11T08:05:20.190339image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length34
Median length29
Mean length21.65245
Min length14

Characters and Unicode

Total characters57877
Distinct characters127
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2165 ?
Unique (%)81.0%

Sample

1st row경상남도 사천시 사남면 월성리 551-4
2nd row경상남도 사천시 서포면 조도리 363
3rd row경상남도 사천시 동금동 85-14
4th row경상남도 사천시 정동면 풍정리 382-2
5th row경상남도 사천시 벌리동 253-6
ValueCountFrequency (%)
경상남도 2673
20.2%
사천시 2673
20.2%
사천읍 601
 
4.5%
외1필지 438
 
3.3%
사남면 422
 
3.2%
향촌동 231
 
1.7%
외2필지 179
 
1.4%
벌리동 178
 
1.3%
수석리 150
 
1.1%
월성리 143
 
1.1%
Other values (2289) 5544
41.9%
2023-12-11T08:05:20.974065image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
10559
18.2%
3816
 
6.6%
3434
 
5.9%
3137
 
5.4%
2681
 
4.6%
2674
 
4.6%
2673
 
4.6%
2673
 
4.6%
1 2487
 
4.3%
- 2029
 
3.5%
Other values (117) 21714
37.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 34139
59.0%
Decimal Number 11150
 
19.3%
Space Separator 10559
 
18.2%
Dash Punctuation 2029
 
3.5%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
3816
11.2%
3434
 
10.1%
3137
 
9.2%
2681
 
7.9%
2674
 
7.8%
2673
 
7.8%
2673
 
7.8%
1828
 
5.4%
1385
 
4.1%
1052
 
3.1%
Other values (105) 8786
25.7%
Decimal Number
ValueCountFrequency (%)
1 2487
22.3%
2 1443
12.9%
4 1165
10.4%
3 1136
10.2%
6 1009
9.0%
5 963
 
8.6%
7 800
 
7.2%
8 784
 
7.0%
9 682
 
6.1%
0 681
 
6.1%
Space Separator
ValueCountFrequency (%)
10559
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 2029
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 34139
59.0%
Common 23738
41.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
3816
11.2%
3434
 
10.1%
3137
 
9.2%
2681
 
7.9%
2674
 
7.8%
2673
 
7.8%
2673
 
7.8%
1828
 
5.4%
1385
 
4.1%
1052
 
3.1%
Other values (105) 8786
25.7%
Common
ValueCountFrequency (%)
10559
44.5%
1 2487
 
10.5%
- 2029
 
8.5%
2 1443
 
6.1%
4 1165
 
4.9%
3 1136
 
4.8%
6 1009
 
4.3%
5 963
 
4.1%
7 800
 
3.4%
8 784
 
3.3%
Other values (2) 1363
 
5.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 34139
59.0%
ASCII 23738
41.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
10559
44.5%
1 2487
 
10.5%
- 2029
 
8.5%
2 1443
 
6.1%
4 1165
 
4.9%
3 1136
 
4.8%
6 1009
 
4.3%
5 963
 
4.1%
7 800
 
3.4%
8 784
 
3.3%
Other values (2) 1363
 
5.7%
Hangul
ValueCountFrequency (%)
3816
11.2%
3434
 
10.1%
3137
 
9.2%
2681
 
7.9%
2674
 
7.8%
2673
 
7.8%
2673
 
7.8%
1828
 
5.4%
1385
 
4.1%
1052
 
3.1%
Other values (105) 8786
25.7%

주용도
Categorical

Distinct28
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size21.0 KiB
단독주택
789 
제2종근린생활시설
495 
공장
402 
제1종근린생활시설
332 
창고시설
132 
Other values (23)
523 

Length

Max length10
Median length9
Mean length5.5028058
Min length2

Unique

Unique3 ?
Unique (%)0.1%

Sample

1st row단독주택
2nd row창고시설
3rd row제1종근린생활시설
4th row제2종근린생활시설
5th row제1종근린생활시설

Common Values

ValueCountFrequency (%)
단독주택 789
29.5%
제2종근린생활시설 495
18.5%
공장 402
15.0%
제1종근린생활시설 332
12.4%
창고시설 132
 
4.9%
공동주택 98
 
3.7%
동.식물관련시설 83
 
3.1%
노유자시설 72
 
2.7%
숙박시설 51
 
1.9%
업무시설 50
 
1.9%
Other values (18) 169
 
6.3%

Length

2023-12-11T08:05:21.104439image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
단독주택 789
29.5%
제2종근린생활시설 495
18.5%
공장 402
15.0%
제1종근린생활시설 332
12.4%
창고시설 132
 
4.9%
공동주택 98
 
3.7%
동.식물관련시설 83
 
3.1%
노유자시설 72
 
2.7%
숙박시설 51
 
1.9%
업무시설 50
 
1.9%
Other values (18) 169
 
6.3%

착공예정일
Date

MISSING 

Distinct1499
Distinct (%)59.9%
Missing172
Missing (%)6.4%
Memory size21.0 KiB
Minimum1900-01-02 00:00:00
Maximum2019-01-28 00:00:00
2023-12-11T08:05:21.231876image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T08:05:21.382340image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
Distinct1505
Distinct (%)59.5%
Missing144
Missing (%)5.4%
Memory size21.0 KiB
Minimum1900-01-02 00:00:00
Maximum2019-01-28 00:00:00
2023-12-11T08:05:21.498063image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T08:05:21.619272image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

준공예정일
Date

MISSING 

Distinct31
Distinct (%)91.2%
Missing2639
Missing (%)98.7%
Memory size21.0 KiB
Minimum2002-12-04 00:00:00
Maximum2019-01-22 00:00:00
2023-12-11T08:05:21.728572image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T08:05:21.822853image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=31)
Distinct486
Distinct (%)61.0%
Missing1876
Missing (%)70.2%
Memory size21.0 KiB
2023-12-11T08:05:22.039611image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length17
Median length15
Mean length8.6988708
Min length3

Characters and Unicode

Total characters6933
Distinct characters234
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique366 ?
Unique (%)45.9%

Sample

1st row청솔종합건설(주)
2nd row(주)보림종합건설
3rd row(주)메건종합건설
4th row태산종합건설 주식회사
5th row(주)보림종합건설
ValueCountFrequency (%)
주식회사 38
 
4.3%
태산종합건설(주 25
 
2.9%
가양종합건설(주 23
 
2.6%
유)동우종합건설 18
 
2.1%
주)청운건설 16
 
1.8%
주)상일종합건설 13
 
1.5%
수광종합건설(주 11
 
1.3%
청솔종합건설(주 9
 
1.0%
주)해창종합건설 9
 
1.0%
주)삼호건설 8
 
0.9%
Other values (499) 706
80.6%
2023-12-11T08:05:22.377986image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
750
 
10.8%
713
 
10.3%
686
 
9.9%
) 619
 
8.9%
( 617
 
8.9%
454
 
6.5%
448
 
6.5%
157
 
2.3%
151
 
2.2%
143
 
2.1%
Other values (224) 2195
31.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 5606
80.9%
Close Punctuation 619
 
8.9%
Open Punctuation 617
 
8.9%
Space Separator 79
 
1.1%
Uppercase Letter 8
 
0.1%
Lowercase Letter 4
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
750
 
13.4%
713
 
12.7%
686
 
12.2%
454
 
8.1%
448
 
8.0%
157
 
2.8%
151
 
2.7%
143
 
2.6%
72
 
1.3%
63
 
1.1%
Other values (215) 1969
35.1%
Uppercase Letter
ValueCountFrequency (%)
H 3
37.5%
G 3
37.5%
S 1
 
12.5%
M 1
 
12.5%
Lowercase Letter
ValueCountFrequency (%)
h 2
50.0%
g 2
50.0%
Close Punctuation
ValueCountFrequency (%)
) 619
100.0%
Open Punctuation
ValueCountFrequency (%)
( 617
100.0%
Space Separator
ValueCountFrequency (%)
79
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 5606
80.9%
Common 1315
 
19.0%
Latin 12
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
750
 
13.4%
713
 
12.7%
686
 
12.2%
454
 
8.1%
448
 
8.0%
157
 
2.8%
151
 
2.7%
143
 
2.6%
72
 
1.3%
63
 
1.1%
Other values (215) 1969
35.1%
Latin
ValueCountFrequency (%)
H 3
25.0%
G 3
25.0%
h 2
16.7%
g 2
16.7%
S 1
 
8.3%
M 1
 
8.3%
Common
ValueCountFrequency (%)
) 619
47.1%
( 617
46.9%
79
 
6.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 5606
80.9%
ASCII 1327
 
19.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
750
 
13.4%
713
 
12.7%
686
 
12.2%
454
 
8.1%
448
 
8.0%
157
 
2.8%
151
 
2.7%
143
 
2.6%
72
 
1.3%
63
 
1.1%
Other values (215) 1969
35.1%
ASCII
ValueCountFrequency (%)
) 619
46.6%
( 617
46.5%
79
 
6.0%
H 3
 
0.2%
G 3
 
0.2%
h 2
 
0.2%
g 2
 
0.2%
S 1
 
0.1%
M 1
 
0.1%
Distinct349
Distinct (%)55.3%
Missing2042
Missing (%)76.4%
Memory size21.0 KiB
2023-12-11T08:05:22.594915image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length11.992076
Min length11

Characters and Unicode

Total characters7567
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique261 ?
Unique (%)41.4%

Sample

1st row055-853-5207
2nd row055-757-1999
3rd row053-593-1476
4th row055-854-5010
5th row055-757-1999
ValueCountFrequency (%)
055-854-5010 29
 
4.6%
055-853-8910 24
 
3.8%
055-853-2161 22
 
3.5%
055-854-9166 18
 
2.9%
055-853-5207 12
 
1.9%
055-834-3351 12
 
1.9%
055-758-9000 10
 
1.6%
055-748-0581 9
 
1.4%
055-855-7338 9
 
1.4%
055-757-1999 7
 
1.1%
Other values (339) 479
75.9%
2023-12-11T08:05:22.934107image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
5 1751
23.1%
- 1262
16.7%
0 1127
14.9%
8 534
 
7.1%
7 485
 
6.4%
1 478
 
6.3%
2 439
 
5.8%
4 423
 
5.6%
3 420
 
5.6%
6 370
 
4.9%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 6305
83.3%
Dash Punctuation 1262
 
16.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
5 1751
27.8%
0 1127
17.9%
8 534
 
8.5%
7 485
 
7.7%
1 478
 
7.6%
2 439
 
7.0%
4 423
 
6.7%
3 420
 
6.7%
6 370
 
5.9%
9 278
 
4.4%
Dash Punctuation
ValueCountFrequency (%)
- 1262
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 7567
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
5 1751
23.1%
- 1262
16.7%
0 1127
14.9%
8 534
 
7.1%
7 485
 
6.4%
1 478
 
6.3%
2 439
 
5.8%
4 423
 
5.6%
3 420
 
5.6%
6 370
 
4.9%

Most occurring blocks

ValueCountFrequency (%)
ASCII 7567
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
5 1751
23.1%
- 1262
16.7%
0 1127
14.9%
8 534
 
7.1%
7 485
 
6.4%
1 478
 
6.3%
2 439
 
5.8%
4 423
 
5.6%
3 420
 
5.6%
6 370
 
4.9%

감리사무소명
Text

MISSING 

Distinct381
Distinct (%)15.6%
Missing228
Missing (%)8.5%
Memory size21.0 KiB
2023-12-11T08:05:23.196915image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length19
Median length8
Mean length9.1578732
Min length1

Characters and Unicode

Total characters22391
Distinct characters206
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique211 ?
Unique (%)8.6%

Sample

1st row예당건축사사무소
2nd row건축사사무소 지음
3rd row예건건축사사무소
4th row원 종합건축사사무소
5th row건축사사무소가인
ValueCountFrequency (%)
건축사사무소 274
 
9.7%
홍인건축사사무소 93
 
3.3%
건축사사무소가인 91
 
3.2%
예당건축사사무소 86
 
3.0%
예건건축사사무소 84
 
3.0%
우미건축사(사 81
 
2.9%
동우건축사사무소 80
 
2.8%
건축사(사)동서건축 76
 
2.7%
수안건축사사무소 69
 
2.4%
희림건축사사무소 62
 
2.2%
Other values (355) 1837
64.8%
2023-12-11T08:05:23.521156image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
4872
21.8%
2743
12.3%
2575
11.5%
1997
 
8.9%
1993
 
8.9%
( 580
 
2.6%
) 580
 
2.6%
424
 
1.9%
307
 
1.4%
289
 
1.3%
Other values (196) 6031
26.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 20653
92.2%
Open Punctuation 580
 
2.6%
Close Punctuation 580
 
2.6%
Space Separator 424
 
1.9%
Uppercase Letter 110
 
0.5%
Decimal Number 40
 
0.2%
Other Punctuation 4
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
4872
23.6%
2743
13.3%
2575
12.5%
1997
 
9.7%
1993
 
9.6%
307
 
1.5%
289
 
1.4%
288
 
1.4%
283
 
1.4%
246
 
1.2%
Other values (181) 5060
24.5%
Uppercase Letter
ValueCountFrequency (%)
M 50
45.5%
E 46
41.8%
S 5
 
4.5%
O 4
 
3.6%
A 2
 
1.8%
T 1
 
0.9%
N 1
 
0.9%
G 1
 
0.9%
Decimal Number
ValueCountFrequency (%)
2 20
50.0%
1 20
50.0%
Other Punctuation
ValueCountFrequency (%)
. 3
75.0%
& 1
 
25.0%
Open Punctuation
ValueCountFrequency (%)
( 580
100.0%
Close Punctuation
ValueCountFrequency (%)
) 580
100.0%
Space Separator
ValueCountFrequency (%)
424
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 20653
92.2%
Common 1628
 
7.3%
Latin 110
 
0.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
4872
23.6%
2743
13.3%
2575
12.5%
1997
 
9.7%
1993
 
9.6%
307
 
1.5%
289
 
1.4%
288
 
1.4%
283
 
1.4%
246
 
1.2%
Other values (181) 5060
24.5%
Latin
ValueCountFrequency (%)
M 50
45.5%
E 46
41.8%
S 5
 
4.5%
O 4
 
3.6%
A 2
 
1.8%
T 1
 
0.9%
N 1
 
0.9%
G 1
 
0.9%
Common
ValueCountFrequency (%)
( 580
35.6%
) 580
35.6%
424
26.0%
2 20
 
1.2%
1 20
 
1.2%
. 3
 
0.2%
& 1
 
0.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 20653
92.2%
ASCII 1738
 
7.8%

Most frequent character per block

Hangul
ValueCountFrequency (%)
4872
23.6%
2743
13.3%
2575
12.5%
1997
 
9.7%
1993
 
9.6%
307
 
1.5%
289
 
1.4%
288
 
1.4%
283
 
1.4%
246
 
1.2%
Other values (181) 5060
24.5%
ASCII
ValueCountFrequency (%)
( 580
33.4%
) 580
33.4%
424
24.4%
M 50
 
2.9%
E 46
 
2.6%
2 20
 
1.2%
1 20
 
1.2%
S 5
 
0.3%
O 4
 
0.2%
. 3
 
0.2%
Other values (5) 6
 
0.3%
Distinct264
Distinct (%)11.6%
Missing391
Missing (%)14.6%
Memory size21.0 KiB
2023-12-11T08:05:23.736171image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length12.003067
Min length8

Characters and Unicode

Total characters27391
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique150 ?
Unique (%)6.6%

Sample

1st row055-854-5626
2nd row055-833-9668
3rd row055-852-2303
4th row055-832-1301
5th row055-833-3724
ValueCountFrequency (%)
055-833-3724 111
 
4.9%
055-854-0205 102
 
4.5%
055-852-0941 99
 
4.3%
055-832-9005 93
 
4.1%
055-835-1727 92
 
4.0%
055-852-2303 91
 
4.0%
055-852-0071 90
 
3.9%
055-832-8010 82
 
3.6%
055-832-3650 79
 
3.5%
055-835-0770 72
 
3.2%
Other values (254) 1371
60.1%
2023-12-11T08:05:24.045489image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
5 6476
23.6%
- 4560
16.6%
0 4317
15.8%
8 2485
 
9.1%
3 2126
 
7.8%
2 2052
 
7.5%
7 1578
 
5.8%
4 1215
 
4.4%
1 1202
 
4.4%
6 770
 
2.8%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 22831
83.4%
Dash Punctuation 4560
 
16.6%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
5 6476
28.4%
0 4317
18.9%
8 2485
 
10.9%
3 2126
 
9.3%
2 2052
 
9.0%
7 1578
 
6.9%
4 1215
 
5.3%
1 1202
 
5.3%
6 770
 
3.4%
9 610
 
2.7%
Dash Punctuation
ValueCountFrequency (%)
- 4560
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 27391
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
5 6476
23.6%
- 4560
16.6%
0 4317
15.8%
8 2485
 
9.1%
3 2126
 
7.8%
2 2052
 
7.5%
7 1578
 
5.8%
4 1215
 
4.4%
1 1202
 
4.4%
6 770
 
2.8%

Most occurring blocks

ValueCountFrequency (%)
ASCII 27391
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
5 6476
23.6%
- 4560
16.6%
0 4317
15.8%
8 2485
 
9.1%
3 2126
 
7.8%
2 2052
 
7.5%
7 1578
 
5.8%
4 1215
 
4.4%
1 1202
 
4.4%
6 770
 
2.8%
Distinct448
Distinct (%)16.9%
Missing20
Missing (%)0.7%
Memory size21.0 KiB
2023-12-11T08:05:24.276682image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length20
Median length8
Mean length9.2148511
Min length3

Characters and Unicode

Total characters24447
Distinct characters236
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique232 ?
Unique (%)8.7%

Sample

1st row건축사사무소으뜸
2nd row홍인건축사사무소
3rd row주식회사 동서이앤씨건축사사무소
4th row예당건축사사무소
5th row건축사사무소 예림
ValueCountFrequency (%)
건축사사무소 212
 
7.0%
건축사사무소가인 126
 
4.2%
우미건축사(사 122
 
4.0%
예당건축사사무소 103
 
3.4%
건축사(사)동서건축 100
 
3.3%
홍인건축사사무소 89
 
3.0%
동우건축사사무소 86
 
2.9%
예림 72
 
2.4%
희림건축사사무소 69
 
2.3%
예건건축사사무소 66
 
2.2%
Other values (457) 1969
65.3%
2023-12-11T08:05:24.776273image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
5317
21.7%
2964
12.1%
2823
11.5%
2109
 
8.6%
2090
 
8.5%
) 730
 
3.0%
( 728
 
3.0%
361
 
1.5%
330
 
1.3%
312
 
1.3%
Other values (226) 6683
27.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 22442
91.8%
Close Punctuation 730
 
3.0%
Open Punctuation 728
 
3.0%
Space Separator 361
 
1.5%
Uppercase Letter 95
 
0.4%
Decimal Number 74
 
0.3%
Other Punctuation 14
 
0.1%
Lowercase Letter 3
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
5317
23.7%
2964
13.2%
2823
12.6%
2109
 
9.4%
2090
 
9.3%
330
 
1.5%
312
 
1.4%
303
 
1.4%
302
 
1.3%
280
 
1.2%
Other values (208) 5612
25.0%
Uppercase Letter
ValueCountFrequency (%)
M 29
30.5%
E 23
24.2%
A 13
13.7%
S 10
 
10.5%
C 10
 
10.5%
D 3
 
3.2%
T 3
 
3.2%
G 3
 
3.2%
N 1
 
1.1%
Decimal Number
ValueCountFrequency (%)
1 37
50.0%
2 37
50.0%
Other Punctuation
ValueCountFrequency (%)
& 11
78.6%
. 3
 
21.4%
Lowercase Letter
ValueCountFrequency (%)
p 2
66.7%
m 1
33.3%
Close Punctuation
ValueCountFrequency (%)
) 730
100.0%
Open Punctuation
ValueCountFrequency (%)
( 728
100.0%
Space Separator
ValueCountFrequency (%)
361
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 22442
91.8%
Common 1907
 
7.8%
Latin 98
 
0.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
5317
23.7%
2964
13.2%
2823
12.6%
2109
 
9.4%
2090
 
9.3%
330
 
1.5%
312
 
1.4%
303
 
1.4%
302
 
1.3%
280
 
1.2%
Other values (208) 5612
25.0%
Latin
ValueCountFrequency (%)
M 29
29.6%
E 23
23.5%
A 13
13.3%
S 10
 
10.2%
C 10
 
10.2%
D 3
 
3.1%
T 3
 
3.1%
G 3
 
3.1%
p 2
 
2.0%
N 1
 
1.0%
Common
ValueCountFrequency (%)
) 730
38.3%
( 728
38.2%
361
18.9%
1 37
 
1.9%
2 37
 
1.9%
& 11
 
0.6%
. 3
 
0.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 22442
91.8%
ASCII 2005
 
8.2%

Most frequent character per block

Hangul
ValueCountFrequency (%)
5317
23.7%
2964
13.2%
2823
12.6%
2109
 
9.4%
2090
 
9.3%
330
 
1.5%
312
 
1.4%
303
 
1.4%
302
 
1.3%
280
 
1.2%
Other values (208) 5612
25.0%
ASCII
ValueCountFrequency (%)
) 730
36.4%
( 728
36.3%
361
18.0%
1 37
 
1.8%
2 37
 
1.8%
M 29
 
1.4%
E 23
 
1.1%
A 13
 
0.6%
& 11
 
0.5%
S 10
 
0.5%
Other values (8) 26
 
1.3%
Distinct319
Distinct (%)13.0%
Missing216
Missing (%)8.1%
Memory size21.0 KiB
2023-12-11T08:05:25.052870image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length11.997965
Min length9

Characters and Unicode

Total characters29479
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique167 ?
Unique (%)6.8%

Sample

1st row055-835-3312
2nd row055-854-0205
3rd row055-834-0404
4th row055-854-5626
5th row055-835-1727
ValueCountFrequency (%)
055-833-3724 126
 
5.1%
055-852-0941 122
 
5.0%
055-835-1727 109
 
4.4%
055-832-9005 104
 
4.2%
055-854-0205 98
 
4.0%
055-832-8010 83
 
3.4%
055-852-0071 73
 
3.0%
055-832-1301 70
 
2.8%
055-832-3788 69
 
2.8%
055-754-5100 68
 
2.8%
Other values (309) 1535
62.5%
2023-12-11T08:05:25.485463image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
5 6893
23.4%
- 4912
16.7%
0 4499
15.3%
8 2349
 
8.0%
2 2317
 
7.9%
3 2182
 
7.4%
7 1875
 
6.4%
4 1425
 
4.8%
1 1317
 
4.5%
6 953
 
3.2%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 24567
83.3%
Dash Punctuation 4912
 
16.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
5 6893
28.1%
0 4499
18.3%
8 2349
 
9.6%
2 2317
 
9.4%
3 2182
 
8.9%
7 1875
 
7.6%
4 1425
 
5.8%
1 1317
 
5.4%
6 953
 
3.9%
9 757
 
3.1%
Dash Punctuation
ValueCountFrequency (%)
- 4912
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 29479
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
5 6893
23.4%
- 4912
16.7%
0 4499
15.3%
8 2349
 
8.0%
2 2317
 
7.9%
3 2182
 
7.4%
7 1875
 
6.4%
4 1425
 
4.8%
1 1317
 
4.5%
6 953
 
3.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII 29479
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
5 6893
23.4%
- 4912
16.7%
0 4499
15.3%
8 2349
 
8.0%
2 2317
 
7.9%
3 2182
 
7.4%
7 1875
 
6.4%
4 1425
 
4.8%
1 1317
 
4.5%
6 953
 
3.2%

Correlations

2023-12-11T08:05:25.603808image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
주용도준공예정일
주용도1.000NaN
준공예정일NaN1.000

Missing values

2023-12-11T08:05:19.493578image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T08:05:19.638656image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-11T08:05:19.778192image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

시공 대지위치주용도착공예정일실제 착공일(착공처리일)준공예정일시공업체명(빈칸은 건축주 직영)시공업체 전화번호감리사무소명감리사무소 전화번호설계사무소명설계사무소 전화번호
0경상남도 사천시 사남면 월성리 551-4단독주택2019-01-282019-01-28<NA><NA><NA>예당건축사사무소055-854-5626건축사사무소으뜸055-835-3312
1경상남도 사천시 서포면 조도리 363창고시설2019-01-092019-01-09<NA><NA><NA><NA><NA>홍인건축사사무소055-854-0205
2경상남도 사천시 동금동 85-14제1종근린생활시설2018-12-242018-12-24<NA><NA><NA><NA><NA>주식회사 동서이앤씨건축사사무소055-834-0404
3경상남도 사천시 정동면 풍정리 382-2제2종근린생활시설2018-12-242018-12-24<NA>청솔종합건설(주)055-853-5207건축사사무소 지음055-833-9668예당건축사사무소055-854-5626
4경상남도 사천시 벌리동 253-6제1종근린생활시설2018-11-062018-11-06<NA><NA><NA>예건건축사사무소055-852-2303건축사사무소 예림055-835-1727
5경상남도 사천시 실안동 1247-3제1종근린생활시설2018-10-312018-10-31<NA><NA><NA>원 종합건축사사무소055-832-1301건축사사무소거명055-759-7751
6경상남도 사천시 축동면 구호리 75 외2필지공장2018-10-252018-10-25<NA>(주)보림종합건설055-757-1999건축사사무소가인055-833-3724예당건축사사무소055-854-5626
7경상남도 사천시 사천읍 두량리 1562-8창고시설2018-10-242018-10-26<NA><NA><NA>원종합건축사사무소055-832-1301건축사사무소 예림055-835-1727
8경상남도 사천시 정동면 예수리 162-2제1종근린생활시설2018-10-192018-10-19<NA><NA><NA>세움건축사사무소055-853-2003EM건축사사무소055-855-0977
9경상남도 사천시 곤명면 은사리 657-2 외2필지동.식물관련시설1997-05-051997-03-05<NA><NA><NA><NA><NA>홍인건축사사무소055-854-0205
시공 대지위치주용도착공예정일실제 착공일(착공처리일)준공예정일시공업체명(빈칸은 건축주 직영)시공업체 전화번호감리사무소명감리사무소 전화번호설계사무소명설계사무소 전화번호
2663경상남도 사천시 서포면 자혜리 478공동주택2015-08-19<NA>2018-07-31세한(주) 박용형<NA><NA>동우건축사사무소 김동우<NA>
2664경상남도 사천시 사남면 죽천리 866공동주택2016-07-22<NA>2016-09-27(주)청운건설 이상영<NA>(주)상언엔지니어링건축사<NA>우미건축사사무소 임성민<NA>
2665경상남도 사천시 용현면 신촌리 산 28-1공동주택2015-08-04<NA>2017-06-28(주)민용종합건설 윤을순<NA><NA>진원건축사사무소 문재옥<NA>
2666경상남도 사천시 사남면 유천리 928공동주택2016-02-11<NA>2018-03-30흥한건설(주) 김회조<NA>(주)무영씨엠건축사사무소<NA>(주)유민종합건축사사무소 신의규<NA>
2667경상남도 사천시 동금동 582공동주택2016-02-15<NA>2017-12-22가양종합건설(주) 김용국<NA>수안건축사사무소<NA>수안건축사사무소 정홍진<NA>
2668경상남도 사천시 사천읍 구암리 1450공동주택2016-04-12<NA>2017-12-22대화건설주식회사 대화건설주식회사<NA>이O운<NA>(주)조은이종칠건축사사무소 이종칠<NA>
2669경상남도 사천시 사천읍 구암리 1450공동주택2016-04-12<NA>2017-12-22대화건설주식회사 대화건설주식회사<NA>이O운<NA>(주)조은이종칠건축사사무소 이종칠<NA>
2670경상남도 사천시 용강동 550-1공동주택2017-06-15<NA>2019-01-22정우개발(주) 조철현<NA>(주)나우종합건축사사무소<NA>이노종합건축사사무소 전원배<NA>
2671경상남도 사천시 사천읍 사주리 482공동주택2016-12-12<NA>2018-12-14(주)한화건설 최광호<NA>(주) 한국건설관리공사<NA>주식회사 건축사사무소유에이디 안상훈<NA>
2672경상남도 사천시 용강동 923공동주택2017-05-22<NA>2018-06-27우진탑종합건설(주) 공희영<NA>(주)나우종합건축사사무소<NA>건축사사무소예림 안현생<NA>

Duplicate rows

Most frequently occurring

시공 대지위치주용도착공예정일실제 착공일(착공처리일)준공예정일시공업체명(빈칸은 건축주 직영)시공업체 전화번호감리사무소명감리사무소 전화번호설계사무소명설계사무소 전화번호# duplicates
2경상남도 사천시 벌리동 270-3업무시설<NA><NA><NA><NA><NA><NA><NA>가원종합건축사사무소055-852-52873
12경상남도 사천시 정동면 고읍리 438-18업무시설<NA><NA><NA><NA><NA><NA><NA>우미건축사(사)055-852-09413
0경상남도 사천시 노룡동 368-5 외1필지제2종근린생활시설2018-05-032018-05-03<NA><NA><NA>건축사사무소가인055-833-3724건축사사무소인하우스<NA>2
1경상남도 사천시 동림동 407-21단독주택2016-05-182016-05-18<NA><NA><NA>건축사사무소 돌채055-743-5200건축사사무소 돌채055-743-52002
3경상남도 사천시 벌리동 46-7단독주택2015-11-162015-11-16<NA><NA><NA>두양건축사(사)055-742-2355두양건축사(사)055-742-23552
4경상남도 사천시 사남면 월성리 418공장1900-01-021900-01-02<NA><NA><NA>건축사(사)21세기055-762-2423건축사(사)21세기055-762-24232
5경상남도 사천시 사천읍 구암리 1450공동주택2016-04-12<NA>2017-12-22대화건설주식회사 대화건설주식회사<NA>이O운<NA>(주)조은이종칠건축사사무소 이종칠<NA>2
6경상남도 사천시 사천읍 선인리 428-1 외1필지단독주택2015-11-062015-11-06<NA><NA><NA>건축사사무소으뜸055-835-6990윤건축사사무소055-746-99152
7경상남도 사천시 사천읍 수석리 268-7 외2필지제2종근린생활시설<NA><NA><NA><NA><NA><NA><NA><NA><NA>2
8경상남도 사천시 사천읍 수석리 370-1교정및군사시설<NA>2011-05-01<NA><NA><NA><NA><NA>(주)카스종합건축사사무소02-572-16362