Overview

Dataset statistics

Number of variables11
Number of observations941
Missing cells915
Missing cells (%)8.8%
Duplicate rows3
Duplicate rows (%)0.3%
Total size in memory81.0 KiB
Average record size in memory88.1 B

Variable types

Categorical2
Text5
DateTime4

Dataset

Description경상남도 사천시 건설현장 시공정보입니다. (위치, 주용도, 착공예정일, 실제착공일, 시공업체명을 볼 수 있습니다.)
Author경상남도 사천시
URLhttps://bigdata.gyeongnam.go.kr/index.gn?menuCd=DOM_000000114002001000&publicdatapk=15048899

Alerts

Dataset has 3 (0.3%) duplicate rowsDuplicates
건축구분 is highly imbalanced (54.9%)Imbalance
설계사무소명 has 52 (5.5%) missing valuesMissing
시공자사무소명 has 855 (90.9%) missing valuesMissing

Reproduction

Analysis started2023-12-10 23:05:36.432974
Analysis finished2023-12-10 23:05:37.645171
Duration1.21 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

건축구분
Categorical

IMBALANCE 

Distinct5
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size7.5 KiB
신축
573 
증축
358 
대수선
 
7
재축
 
2
이전
 
1

Length

Max length3
Median length2
Mean length2.0074389
Min length2

Unique

Unique1 ?
Unique (%)0.1%

Sample

1st row증축
2nd row신축
3rd row증축
4th row증축
5th row증축

Common Values

ValueCountFrequency (%)
신축 573
60.9%
증축 358
38.0%
대수선 7
 
0.7%
재축 2
 
0.2%
이전 1
 
0.1%

Length

2023-12-11T08:05:37.723412image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T08:05:37.827240image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
신축 573
60.9%
증축 358
38.0%
대수선 7
 
0.7%
재축 2
 
0.2%
이전 1
 
0.1%
Distinct875
Distinct (%)93.0%
Missing0
Missing (%)0.0%
Memory size7.5 KiB
2023-12-11T08:05:38.169410image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length30
Median length27
Mean length21.622742
Min length15

Characters and Unicode

Total characters20347
Distinct characters128
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique815 ?
Unique (%)86.6%

Sample

1st row경상남도 사천시 향촌동 1324
2nd row경상남도 사천시 사천읍 정의리 437-16
3rd row경상남도 사천시 서포면 선전리 927
4th row경상남도 사천시 사남면 유천리 897
5th row경상남도 사천시 사남면 유천리 901 외1필지
ValueCountFrequency (%)
경상남도 941
20.0%
사천시 941
20.0%
서포면 159
 
3.4%
외1필지 142
 
3.0%
사천읍 100
 
2.1%
정동면 91
 
1.9%
사남면 78
 
1.7%
곤양면 77
 
1.6%
용현면 72
 
1.5%
곤명면 60
 
1.3%
Other values (947) 2036
43.3%
2023-12-11T08:05:38.633465image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
3756
18.5%
1161
 
5.7%
1083
 
5.3%
1038
 
5.1%
956
 
4.7%
941
 
4.6%
941
 
4.6%
941
 
4.6%
1 843
 
4.1%
700
 
3.4%
Other values (118) 7987
39.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 12139
59.7%
Decimal Number 3799
 
18.7%
Space Separator 3756
 
18.5%
Dash Punctuation 653
 
3.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1161
 
9.6%
1083
 
8.9%
1038
 
8.6%
956
 
7.9%
941
 
7.8%
941
 
7.8%
941
 
7.8%
700
 
5.8%
575
 
4.7%
420
 
3.5%
Other values (106) 3383
27.9%
Decimal Number
ValueCountFrequency (%)
1 843
22.2%
2 497
13.1%
3 412
10.8%
4 395
10.4%
5 342
9.0%
7 287
 
7.6%
6 283
 
7.4%
8 275
 
7.2%
0 254
 
6.7%
9 211
 
5.6%
Space Separator
ValueCountFrequency (%)
3756
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 653
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 12139
59.7%
Common 8208
40.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1161
 
9.6%
1083
 
8.9%
1038
 
8.6%
956
 
7.9%
941
 
7.8%
941
 
7.8%
941
 
7.8%
700
 
5.8%
575
 
4.7%
420
 
3.5%
Other values (106) 3383
27.9%
Common
ValueCountFrequency (%)
3756
45.8%
1 843
 
10.3%
- 653
 
8.0%
2 497
 
6.1%
3 412
 
5.0%
4 395
 
4.8%
5 342
 
4.2%
7 287
 
3.5%
6 283
 
3.4%
8 275
 
3.4%
Other values (2) 465
 
5.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 12139
59.7%
ASCII 8208
40.3%

Most frequent character per block

ASCII
ValueCountFrequency (%)
3756
45.8%
1 843
 
10.3%
- 653
 
8.0%
2 497
 
6.1%
3 412
 
5.0%
4 395
 
4.8%
5 342
 
4.2%
7 287
 
3.5%
6 283
 
3.4%
8 275
 
3.4%
Other values (2) 465
 
5.7%
Hangul
ValueCountFrequency (%)
1161
 
9.6%
1083
 
8.9%
1038
 
8.6%
956
 
7.9%
941
 
7.8%
941
 
7.8%
941
 
7.8%
700
 
5.8%
575
 
4.7%
420
 
3.5%
Other values (106) 3383
27.9%
Distinct571
Distinct (%)60.7%
Missing0
Missing (%)0.0%
Memory size7.5 KiB
Minimum1995-04-21 00:00:00
Maximum2023-07-07 00:00:00
2023-12-11T08:05:38.807321image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T08:05:38.932006image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
Distinct535
Distinct (%)56.9%
Missing1
Missing (%)0.1%
Memory size7.5 KiB
Minimum2019-07-04 00:00:00
Maximum2023-08-04 00:00:00
2023-12-11T08:05:39.060292image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T08:05:39.200274image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
Distinct574
Distinct (%)61.0%
Missing0
Missing (%)0.0%
Memory size7.5 KiB
Minimum2019-11-13 00:00:00
Maximum2023-10-23 00:00:00
2023-12-11T08:05:39.329218image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T08:05:39.485867image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

주용도
Categorical

Distinct23
Distinct (%)2.4%
Missing0
Missing (%)0.0%
Memory size7.5 KiB
단독주택
376 
제2종근린생활시설
150 
동물및식물관련시설
105 
창고시설
98 
제1종근린생활시설
88 
Other values (18)
124 

Length

Max length10
Median length4
Mean length5.848034
Min length2

Unique

Unique6 ?
Unique (%)0.6%

Sample

1st row공장
2nd row단독주택
3rd row제1종근린생활시설
4th row공장
5th row공장

Common Values

ValueCountFrequency (%)
단독주택 376
40.0%
제2종근린생활시설 150
 
15.9%
동물및식물관련시설 105
 
11.2%
창고시설 98
 
10.4%
제1종근린생활시설 88
 
9.4%
공장 45
 
4.8%
노유자시설 18
 
1.9%
숙박시설 9
 
1.0%
자동차관련시설 8
 
0.9%
야영장시설 8
 
0.9%
Other values (13) 36
 
3.8%

Length

2023-12-11T08:05:39.615297image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
단독주택 376
40.0%
제2종근린생활시설 150
 
15.9%
동물및식물관련시설 105
 
11.2%
창고시설 98
 
10.4%
제1종근린생활시설 88
 
9.4%
공장 45
 
4.8%
노유자시설 18
 
1.9%
숙박시설 9
 
1.0%
자동차관련시설 8
 
0.9%
야영장시설 8
 
0.9%
Other values (13) 36
 
3.8%

설계사무소명
Text

MISSING 

Distinct151
Distinct (%)17.0%
Missing52
Missing (%)5.5%
Memory size7.5 KiB
2023-12-11T08:05:39.796827image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length18
Median length8
Mean length8.9336333
Min length7

Characters and Unicode

Total characters7942
Distinct characters158
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique73 ?
Unique (%)8.2%

Sample

1st row(주)수안건축사사무소
2nd row이가건축사사무소
3rd row건축사사무소가인
4th row(주)다온건축사사무소
5th row(주)동원건축사사무소
ValueCountFrequency (%)
건축사사무소 113
 
11.1%
건축사사무소가인 60
 
5.9%
예림 46
 
4.5%
홍인건축사사무소 45
 
4.4%
으뜸건축사사무소 42
 
4.1%
강현 38
 
3.7%
예당건축사사무소 35
 
3.4%
주)수안건축사사무소 34
 
3.3%
우미건축사(사 33
 
3.2%
건축사(사)동서건축 33
 
3.2%
Other values (144) 539
52.9%
2023-12-11T08:05:40.090995image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1793
22.6%
960
12.1%
929
11.7%
819
10.3%
815
10.3%
( 133
 
1.7%
133
 
1.7%
) 133
 
1.7%
129
 
1.6%
127
 
1.6%
Other values (148) 1971
24.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 7497
94.4%
Open Punctuation 133
 
1.7%
Close Punctuation 133
 
1.7%
Space Separator 129
 
1.6%
Uppercase Letter 42
 
0.5%
Decimal Number 8
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1793
23.9%
960
12.8%
929
12.4%
819
10.9%
815
10.9%
133
 
1.8%
127
 
1.7%
103
 
1.4%
101
 
1.3%
77
 
1.0%
Other values (134) 1640
21.9%
Uppercase Letter
ValueCountFrequency (%)
M 17
40.5%
E 13
31.0%
S 6
 
14.3%
H 1
 
2.4%
D 1
 
2.4%
A 1
 
2.4%
G 1
 
2.4%
T 1
 
2.4%
O 1
 
2.4%
Decimal Number
ValueCountFrequency (%)
1 4
50.0%
2 4
50.0%
Open Punctuation
ValueCountFrequency (%)
( 133
100.0%
Close Punctuation
ValueCountFrequency (%)
) 133
100.0%
Space Separator
ValueCountFrequency (%)
129
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 7497
94.4%
Common 403
 
5.1%
Latin 42
 
0.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1793
23.9%
960
12.8%
929
12.4%
819
10.9%
815
10.9%
133
 
1.8%
127
 
1.7%
103
 
1.4%
101
 
1.3%
77
 
1.0%
Other values (134) 1640
21.9%
Latin
ValueCountFrequency (%)
M 17
40.5%
E 13
31.0%
S 6
 
14.3%
H 1
 
2.4%
D 1
 
2.4%
A 1
 
2.4%
G 1
 
2.4%
T 1
 
2.4%
O 1
 
2.4%
Common
ValueCountFrequency (%)
( 133
33.0%
) 133
33.0%
129
32.0%
1 4
 
1.0%
2 4
 
1.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 7497
94.4%
ASCII 445
 
5.6%

Most frequent character per block

Hangul
ValueCountFrequency (%)
1793
23.9%
960
12.8%
929
12.4%
819
10.9%
815
10.9%
133
 
1.8%
127
 
1.7%
103
 
1.4%
101
 
1.3%
77
 
1.0%
Other values (134) 1640
21.9%
ASCII
ValueCountFrequency (%)
( 133
29.9%
) 133
29.9%
129
29.0%
M 17
 
3.8%
E 13
 
2.9%
S 6
 
1.3%
1 4
 
0.9%
2 4
 
0.9%
H 1
 
0.2%
D 1
 
0.2%
Other values (4) 4
 
0.9%
Distinct202
Distinct (%)21.5%
Missing3
Missing (%)0.3%
Memory size7.5 KiB
2023-12-11T08:05:40.278816image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length53
Median length42
Mean length27.487207
Min length15

Characters and Unicode

Total characters25783
Distinct characters246
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique112 ?
Unique (%)11.9%

Sample

1st row경상남도 사천시 용현면 시청2길 9-5, 서원래온시티 202호
2nd row경상남도 사천시 정동면 진삼로 1386, 사천3차 한보아파트 102동 상가204호
3rd row경상남도 사천시 사천읍 읍내1길 117, 2층
4th row경상남도 함안군 가야읍 함안대로 735, 2층(함안상공회의소)
5th row경기도 시흥시 군자천로 335, 305호
ValueCountFrequency (%)
경상남도 916
 
16.7%
사천시 698
 
12.7%
사천읍 278
 
5.1%
용현면 169
 
3.1%
진주시 154
 
2.8%
2층 150
 
2.7%
3층 128
 
2.3%
진삼로 103
 
1.9%
1층 93
 
1.7%
21 68
 
1.2%
Other values (447) 2734
49.8%
2023-12-11T08:05:40.606179image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
4562
 
17.7%
1188
 
4.6%
1 1120
 
4.3%
1065
 
4.1%
1051
 
4.1%
1015
 
3.9%
1006
 
3.9%
931
 
3.6%
928
 
3.6%
2 787
 
3.1%
Other values (236) 12130
47.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 15307
59.4%
Space Separator 4562
 
17.7%
Decimal Number 4257
 
16.5%
Other Punctuation 773
 
3.0%
Open Punctuation 330
 
1.3%
Close Punctuation 330
 
1.3%
Dash Punctuation 160
 
0.6%
Uppercase Letter 63
 
0.2%
Letter Number 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1188
 
7.8%
1065
 
7.0%
1051
 
6.9%
1015
 
6.6%
1006
 
6.6%
931
 
6.1%
928
 
6.1%
527
 
3.4%
496
 
3.2%
460
 
3.0%
Other values (211) 6640
43.4%
Decimal Number
ValueCountFrequency (%)
1 1120
26.3%
2 787
18.5%
3 507
11.9%
4 482
11.3%
0 370
 
8.7%
5 267
 
6.3%
8 208
 
4.9%
6 207
 
4.9%
9 174
 
4.1%
7 135
 
3.2%
Uppercase Letter
ValueCountFrequency (%)
I 20
31.7%
K 18
28.6%
B 13
20.6%
T 4
 
6.3%
A 3
 
4.8%
H 2
 
3.2%
S 1
 
1.6%
J 1
 
1.6%
F 1
 
1.6%
Space Separator
ValueCountFrequency (%)
4562
100.0%
Other Punctuation
ValueCountFrequency (%)
, 773
100.0%
Open Punctuation
ValueCountFrequency (%)
( 330
100.0%
Close Punctuation
ValueCountFrequency (%)
) 330
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 160
100.0%
Letter Number
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 15307
59.4%
Common 10412
40.4%
Latin 64
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1188
 
7.8%
1065
 
7.0%
1051
 
6.9%
1015
 
6.6%
1006
 
6.6%
931
 
6.1%
928
 
6.1%
527
 
3.4%
496
 
3.2%
460
 
3.0%
Other values (211) 6640
43.4%
Common
ValueCountFrequency (%)
4562
43.8%
1 1120
 
10.8%
2 787
 
7.6%
, 773
 
7.4%
3 507
 
4.9%
4 482
 
4.6%
0 370
 
3.6%
( 330
 
3.2%
) 330
 
3.2%
5 267
 
2.6%
Other values (5) 884
 
8.5%
Latin
ValueCountFrequency (%)
I 20
31.2%
K 18
28.1%
B 13
20.3%
T 4
 
6.2%
A 3
 
4.7%
H 2
 
3.1%
S 1
 
1.6%
J 1
 
1.6%
1
 
1.6%
F 1
 
1.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 15307
59.4%
ASCII 10475
40.6%
Number Forms 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
4562
43.6%
1 1120
 
10.7%
2 787
 
7.5%
, 773
 
7.4%
3 507
 
4.8%
4 482
 
4.6%
0 370
 
3.5%
( 330
 
3.2%
) 330
 
3.2%
5 267
 
2.5%
Other values (14) 947
 
9.0%
Hangul
ValueCountFrequency (%)
1188
 
7.8%
1065
 
7.0%
1051
 
6.9%
1015
 
6.6%
1006
 
6.6%
931
 
6.1%
928
 
6.1%
527
 
3.4%
496
 
3.2%
460
 
3.0%
Other values (211) 6640
43.4%
Number Forms
ValueCountFrequency (%)
1
100.0%

시공자사무소명
Text

MISSING 

Distinct65
Distinct (%)75.6%
Missing855
Missing (%)90.9%
Memory size7.5 KiB
2023-12-11T08:05:40.812261image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length20
Median length12
Mean length8.4069767
Min length4

Characters and Unicode

Total characters723
Distinct characters118
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique55 ?
Unique (%)64.0%

Sample

1st row주식회사 지평종합건설
2nd row(주)바르도종합건설
3rd row(주)이엔에프건설
4th row가양종합건설(주)
5th row일성종합건설주식회사
ValueCountFrequency (%)
주)상일종합건설 8
 
9.0%
두강건설(주 4
 
4.5%
청솔종합건설(주 3
 
3.4%
극동글로벌(주 3
 
3.4%
직영공사 3
 
3.4%
주식회사 3
 
3.4%
금명종합건설(주 2
 
2.2%
주)더온종합건설 2
 
2.2%
동남건설(주 2
 
2.2%
호산종합건설(주 2
 
2.2%
Other values (56) 57
64.0%
2023-12-11T08:05:41.241926image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
80
 
11.1%
69
 
9.5%
68
 
9.4%
( 67
 
9.3%
) 67
 
9.3%
45
 
6.2%
44
 
6.1%
17
 
2.4%
14
 
1.9%
13
 
1.8%
Other values (108) 239
33.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 586
81.1%
Open Punctuation 67
 
9.3%
Close Punctuation 67
 
9.3%
Space Separator 3
 
0.4%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
80
 
13.7%
69
 
11.8%
68
 
11.6%
45
 
7.7%
44
 
7.5%
17
 
2.9%
14
 
2.4%
13
 
2.2%
10
 
1.7%
9
 
1.5%
Other values (105) 217
37.0%
Open Punctuation
ValueCountFrequency (%)
( 67
100.0%
Close Punctuation
ValueCountFrequency (%)
) 67
100.0%
Space Separator
ValueCountFrequency (%)
3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 586
81.1%
Common 137
 
18.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
80
 
13.7%
69
 
11.8%
68
 
11.6%
45
 
7.7%
44
 
7.5%
17
 
2.9%
14
 
2.4%
13
 
2.2%
10
 
1.7%
9
 
1.5%
Other values (105) 217
37.0%
Common
ValueCountFrequency (%)
( 67
48.9%
) 67
48.9%
3
 
2.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 586
81.1%
ASCII 137
 
18.9%

Most frequent character per block

Hangul
ValueCountFrequency (%)
80
 
13.7%
69
 
11.8%
68
 
11.6%
45
 
7.7%
44
 
7.5%
17
 
2.9%
14
 
2.4%
13
 
2.2%
10
 
1.7%
9
 
1.5%
Other values (105) 217
37.0%
ASCII
ValueCountFrequency (%)
( 67
48.9%
) 67
48.9%
3
 
2.2%
Distinct846
Distinct (%)90.3%
Missing4
Missing (%)0.4%
Memory size7.5 KiB
2023-12-11T08:05:41.980056image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length57
Median length47
Mean length26.467449
Min length14

Characters and Unicode

Total characters24800
Distinct characters378
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique781 ?
Unique (%)83.4%

Sample

1st row경상남도 창원시 진해구 진해대로 626-1 (경화동)
2nd row경상남도 사천시 사천읍 용당길 71-16
3rd row부산광역시 부산진구 성지로61번길 14, 1층
4th row경상남도 창원시 의창구 대봉로26번길 4-5, 601호(봉림동,YG빌딩)
5th row경상남도 사천시 축동면 운계길 81, 0
ValueCountFrequency (%)
경상남도 846
 
16.7%
사천시 683
 
13.4%
사천읍 105
 
2.1%
진주시 98
 
1.9%
서포면 79
 
1.6%
사남면 65
 
1.3%
정동면 64
 
1.3%
용현면 53
 
1.0%
곤양면 50
 
1.0%
부산광역시 36
 
0.7%
Other values (1638) 3001
59.1%
2023-12-11T08:05:42.595497image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
4149
 
16.7%
1 1125
 
4.5%
957
 
3.9%
955
 
3.9%
945
 
3.8%
895
 
3.6%
895
 
3.6%
891
 
3.6%
887
 
3.6%
0 702
 
2.8%
Other values (368) 12399
50.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 14448
58.3%
Decimal Number 4795
 
19.3%
Space Separator 4149
 
16.7%
Other Punctuation 492
 
2.0%
Dash Punctuation 313
 
1.3%
Open Punctuation 281
 
1.1%
Close Punctuation 281
 
1.1%
Uppercase Letter 38
 
0.2%
Lowercase Letter 3
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
957
 
6.6%
955
 
6.6%
945
 
6.5%
895
 
6.2%
895
 
6.2%
891
 
6.2%
887
 
6.1%
651
 
4.5%
544
 
3.8%
488
 
3.4%
Other values (337) 6340
43.9%
Uppercase Letter
ValueCountFrequency (%)
A 16
42.1%
C 7
18.4%
K 4
 
10.5%
G 2
 
5.3%
I 2
 
5.3%
P 2
 
5.3%
T 1
 
2.6%
R 1
 
2.6%
L 1
 
2.6%
Y 1
 
2.6%
Decimal Number
ValueCountFrequency (%)
1 1125
23.5%
0 702
14.6%
2 700
14.6%
3 492
10.3%
4 383
 
8.0%
5 329
 
6.9%
6 325
 
6.8%
7 263
 
5.5%
9 245
 
5.1%
8 231
 
4.8%
Other Punctuation
ValueCountFrequency (%)
, 489
99.4%
' 1
 
0.2%
@ 1
 
0.2%
. 1
 
0.2%
Lowercase Letter
ValueCountFrequency (%)
c 2
66.7%
k 1
33.3%
Space Separator
ValueCountFrequency (%)
4149
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 313
100.0%
Open Punctuation
ValueCountFrequency (%)
( 281
100.0%
Close Punctuation
ValueCountFrequency (%)
) 281
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 14448
58.3%
Common 10311
41.6%
Latin 41
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
957
 
6.6%
955
 
6.6%
945
 
6.5%
895
 
6.2%
895
 
6.2%
891
 
6.2%
887
 
6.1%
651
 
4.5%
544
 
3.8%
488
 
3.4%
Other values (337) 6340
43.9%
Common
ValueCountFrequency (%)
4149
40.2%
1 1125
 
10.9%
0 702
 
6.8%
2 700
 
6.8%
3 492
 
4.8%
, 489
 
4.7%
4 383
 
3.7%
5 329
 
3.2%
6 325
 
3.2%
- 313
 
3.0%
Other values (8) 1304
 
12.6%
Latin
ValueCountFrequency (%)
A 16
39.0%
C 7
17.1%
K 4
 
9.8%
G 2
 
4.9%
I 2
 
4.9%
P 2
 
4.9%
c 2
 
4.9%
T 1
 
2.4%
R 1
 
2.4%
L 1
 
2.4%
Other values (3) 3
 
7.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 14448
58.3%
ASCII 10352
41.7%

Most frequent character per block

ASCII
ValueCountFrequency (%)
4149
40.1%
1 1125
 
10.9%
0 702
 
6.8%
2 700
 
6.8%
3 492
 
4.8%
, 489
 
4.7%
4 383
 
3.7%
5 329
 
3.2%
6 325
 
3.1%
- 313
 
3.0%
Other values (21) 1345
 
13.0%
Hangul
ValueCountFrequency (%)
957
 
6.6%
955
 
6.6%
945
 
6.5%
895
 
6.2%
895
 
6.2%
891
 
6.2%
887
 
6.1%
651
 
4.5%
544
 
3.8%
488
 
3.4%
Other values (337) 6340
43.9%
Distinct2
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size7.5 KiB
Minimum2023-06-01 00:00:00
Maximum2023-10-31 00:00:00
2023-12-11T08:05:42.704593image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T08:05:42.810755image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=2)

Correlations

2023-12-11T08:05:42.917210image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
건축구분주용도시공자사무소명데이터기준일자
건축구분1.0000.3230.0000.000
주용도0.3231.0000.6830.162
시공자사무소명0.0000.6831.0000.868
데이터기준일자0.0000.1620.8681.000
2023-12-11T08:05:43.035371image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
주용도건축구분
주용도1.0000.163
건축구분0.1631.000
2023-12-11T08:05:43.120730image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
건축구분주용도
건축구분1.0000.163
주용도0.1631.000

Missing values

2023-12-11T08:05:37.283926image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T08:05:37.435581image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-11T08:05:37.575473image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

건축구분대지위치허가일착공처리일사용승인일주용도설계사무소명설계자 도로명주소시공자사무소명시공자 도로명주소데이터기준일자
0증축경상남도 사천시 향촌동 13242023-07-072023-08-042023-09-20공장(주)수안건축사사무소경상남도 사천시 용현면 시청2길 9-5, 서원래온시티 202호주식회사 지평종합건설경상남도 창원시 진해구 진해대로 626-1 (경화동)2023-10-31
1신축경상남도 사천시 사천읍 정의리 437-162023-06-122023-06-162023-10-17단독주택이가건축사사무소경상남도 사천시 정동면 진삼로 1386, 사천3차 한보아파트 102동 상가204호<NA>경상남도 사천시 사천읍 용당길 71-162023-10-31
2증축경상남도 사천시 서포면 선전리 9272023-06-052023-06-142023-08-14제1종근린생활시설건축사사무소가인경상남도 사천시 사천읍 읍내1길 117, 2층(주)바르도종합건설부산광역시 부산진구 성지로61번길 14, 1층2023-10-31
3증축경상남도 사천시 사남면 유천리 8972023-05-262023-06-132023-10-12공장(주)다온건축사사무소경상남도 함안군 가야읍 함안대로 735, 2층(함안상공회의소)(주)이엔에프건설경상남도 창원시 의창구 대봉로26번길 4-5, 601호(봉림동,YG빌딩)2023-10-31
4증축경상남도 사천시 사남면 유천리 901 외1필지2023-05-232023-06-092023-10-23공장(주)동원건축사사무소경기도 시흥시 군자천로 335, 305호가양종합건설(주)경상남도 사천시 축동면 운계길 81, 02023-10-31
5증축경상남도 사천시 대방동 185-16 외1필지2023-05-172023-05-262023-07-25공장건축사사무소 강현경상남도 사천시 사천읍 서재농청길 49, 1층<NA>경상남도 사천시 대방길 60-49 (대방동)2023-10-31
6신축경상남도 사천시 서금동 144-12023-05-022023-05-082023-07-12제2종근린생활시설이가건축사사무소경상남도 사천시 정동면 진삼로 1386, 사천3차 한보아파트 102동 상가204호<NA>경상남도 사천시 팔포3길 65-32023-10-31
7신축경상남도 사천시 용현면 신복리 산 123-172023-05-022023-06-022023-09-22창고시설건축사사무소뜰경상남도 진주시 북장대로63번길 10 (2층)일성종합건설주식회사경상남도 김해시 진례면 고모로324번안길 90, 2층2023-10-31
8신축경상남도 사천시 죽림동 851-22023-04-212023-05-032023-08-07창고시설이가건축사사무소경상남도 사천시 정동면 진삼로 1386, 사천3차 한보아파트 102동 상가204호<NA>경상남도 사천시 문화안길 67-82023-10-31
9증축경상남도 사천시 백천동 108-12023-03-282023-05-242023-06-27종교시설건축사사무소 강현경상남도 사천시 사천읍 서재농청길 49, 1층<NA>경상남도 사천시 백천길 326-22023-10-31
건축구분대지위치허가일착공처리일사용승인일주용도설계사무소명설계자 도로명주소시공자사무소명시공자 도로명주소데이터기준일자
931신축경상남도 사천시 서포면 비토리 12-172019-03-192021-04-082021-11-04단독주택건축사사무소가토경상남도 진주시 말티고개로123번길 8-1, 3층<NA>경상남도 사천시 사남면 월성1길 89, 111동1305호(엘아이지)2023-06-01
932신축경상남도 사천시 서포면 비토리 12-162019-03-192021-07-302023-01-20단독주택건축사사무소가토경상남도 진주시 말티고개로123번길 8-1, 3층<NA>경상남도 사천시 사천읍 사천향교로 13, 101동404호2023-06-01
933신축경상남도 사천시 곤양면 송전리 7862019-03-182020-11-242021-01-12동물및식물관련시설건축사사무소가인경상남도 사천시 사천읍 읍내1길 117, 2층<NA>경상남도 사천시 곤양면 포곡길 2622023-06-01
934증축경상남도 사천시 축동면 탑리 1 외1필지2018-12-312021-01-202021-07-14공장아담건축사사무소경상남도 사천시 사천읍 구암두문로 46 (3층)<NA>경상남도 진주시 새평거로 30, 109동 902호(평거동,엠코타운더프라하)2023-06-01
935신축경상남도 사천시 사남면 죽천리 694-42018-12-192020-12-082021-11-09제2종근린생활시설건축사사무소 나우경상남도 진주시 충의로 119, A동 204호 (충무공동, 아슬란몰)<NA>부산광역시 강서구 공항로743번길 48-52, 대저2동2023-06-01
936신축경상남도 사천시 사천읍 구암리 산 55-22018-11-082019-11-132020-02-25단독주택건축사사무소가인경상남도 사천시 사천읍 읍내1길 117<NA>경상남도 사천시 사천읍 사천대로 1996-72023-06-01
937증축경상남도 사천시 곤양면 대진리 산 86-12018-10-082021-03-122021-11-04동물및식물관련시설예당건축사사무소경상남도 사천시 사천읍 사천대로 1882, 3층<NA>경상남도 사천시 곤양면 가화길 1092023-06-01
938신축경상남도 사천시 곤양면 묵곡리 산 1532018-09-052020-01-312022-04-13제2종근린생활시설(주)제이와이엔지니어링건축사사무소경상남도 진주시 동부로169번길 12, B동 604호(충무공동, 윙스타워)농업회사법인청산주식회사경상남도 사천시 곤명면 묵성로 3712023-06-01
939증축경상남도 사천시 서동 72-12018-03-162020-01-302020-02-26의료시설더원건축사사무소경상남도 창원시 마산합포구 장군천로 32 (장군동3가)(주)상일종합건설경상남도 사천시 구미2길 119-92023-06-01
940신축경상남도 사천시 축동면 배춘리 732-391995-04-212020-07-022020-07-20단독주택건축사사무소가인경상남도 사천시 사천읍 읍내1길 117<NA>경상남도 사천시 축동면 길평3길 10-152023-06-01

Duplicate rows

Most frequently occurring

건축구분대지위치허가일착공처리일사용승인일주용도설계사무소명설계자 도로명주소시공자사무소명시공자 도로명주소데이터기준일자# duplicates
0신축경상남도 사천시 곤명면 추천리 6052019-11-012019-11-072020-06-05단독주택건축사사무소모아SM경상남도 진주시 솔밭로 142<NA>경상남도 진주시 명석면 광제산로 11-21, 104동 1503호(동신아파트)2023-06-012
1신축경상남도 사천시 서포면 자혜리 4782020-05-262020-07-172020-09-09단독주택건축사사무소가인경상남도 사천시 사천읍 읍내1길 117<NA>경상남도 사천시 사천대로 728, A동 107호(노룡동)2023-06-012
2신축경상남도 사천시 서포면 자혜리 478-29 외1필지2020-06-102020-08-062020-12-23단독주택건축사사무소가인경상남도 사천시 사천읍 읍내1길 117<NA>경상남도 사천시 사천대로 728, A동 107호(노룡동)2023-06-012