Overview

Dataset statistics

Number of variables8
Number of observations35
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.5 KiB
Average record size in memory71.8 B

Variable types

Numeric4
Text1
DateTime1
Categorical2

Dataset

Description충청남도의 공사중단 일반건축물 현황(위치, 허가년도, 구조, 동수, 용도, 공정율 등)에 대한 데이터입니다. * 2022.12.31. 기준 ## LINK 미리보기 [![미리보기](http://curate.gimi9.com/linkview/www-data-go-kr-data-filedata-15096329?url=http%3A//www.chungnam.go.kr/cnnet/board.do%3Fmnu_url%3D/cnbbs/view.do%3Fboard_seq%3D389213%26mnu_cd%3DCNNMENU02498%26searchCnd%3D0%26pageNo%3D3%26pageGNo%3D0%26showSplitNo%3D10%26code%3D609&version=d7)](https://www.data.go.kr/data/15096329/fileData.do)
URLhttps://www.data.go.kr/data/15096329/fileData.do

Alerts

중단기간(개월) is highly overall correlated with 구 조High correlation
구 조 is highly overall correlated with 중단기간(개월)High correlation
구 조 is highly imbalanced (68.4%)Imbalance
연번 has unique valuesUnique
위 치 has unique valuesUnique

Reproduction

Analysis started2023-12-12 07:09:25.617255
Analysis finished2023-12-12 07:09:28.033309
Duration2.42 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

UNIQUE 

Distinct35
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean18
Minimum1
Maximum35
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size447.0 B
2023-12-12T16:09:28.096531image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile2.7
Q19.5
median18
Q326.5
95-th percentile33.3
Maximum35
Range34
Interquartile range (IQR)17

Descriptive statistics

Standard deviation10.246951
Coefficient of variation (CV)0.56927504
Kurtosis-1.2
Mean18
Median Absolute Deviation (MAD)9
Skewness0
Sum630
Variance105
MonotonicityStrictly increasing
2023-12-12T16:09:28.217689image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=35)
ValueCountFrequency (%)
1 1
 
2.9%
2 1
 
2.9%
21 1
 
2.9%
22 1
 
2.9%
23 1
 
2.9%
24 1
 
2.9%
25 1
 
2.9%
26 1
 
2.9%
27 1
 
2.9%
28 1
 
2.9%
Other values (25) 25
71.4%
ValueCountFrequency (%)
1 1
2.9%
2 1
2.9%
3 1
2.9%
4 1
2.9%
5 1
2.9%
6 1
2.9%
7 1
2.9%
8 1
2.9%
9 1
2.9%
10 1
2.9%
ValueCountFrequency (%)
35 1
2.9%
34 1
2.9%
33 1
2.9%
32 1
2.9%
31 1
2.9%
30 1
2.9%
29 1
2.9%
28 1
2.9%
27 1
2.9%
26 1
2.9%

위 치
Text

UNIQUE 

Distinct35
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size412.0 B
2023-12-12T16:09:28.549514image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length26
Median length22
Mean length17.971429
Min length11

Characters and Unicode

Total characters629
Distinct characters100
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique35 ?
Unique (%)100.0%

Sample

1st row천안시 동남구 목천읍 신계리 125-10외 10
2nd row공주시 계룡면 구왕리 918-8외 3
3rd row공주시 계룡면 중장리 24-1외 5
4th row보령시 남포면 봉덕리 28-1외 2
5th row보령시 남포면 삼현리 700-4외 1
ValueCountFrequency (%)
당진시 5
 
3.2%
아산시 5
 
3.2%
1 5
 
3.2%
청양군 5
 
3.2%
2 5
 
3.2%
예산군 4
 
2.5%
적곡리 4
 
2.5%
장평면 4
 
2.5%
계룡면 3
 
1.9%
태안군 3
 
1.9%
Other values (107) 115
72.8%
2023-12-12T16:09:29.023071image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
123
19.6%
1 41
 
6.5%
30
 
4.8%
- 28
 
4.5%
4 24
 
3.8%
2 23
 
3.7%
22
 
3.5%
22
 
3.5%
5 19
 
3.0%
18
 
2.9%
Other values (90) 279
44.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 323
51.4%
Decimal Number 155
24.6%
Space Separator 123
 
19.6%
Dash Punctuation 28
 
4.5%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
30
 
9.3%
22
 
6.8%
22
 
6.8%
18
 
5.6%
17
 
5.3%
17
 
5.3%
8
 
2.5%
8
 
2.5%
8
 
2.5%
8
 
2.5%
Other values (78) 165
51.1%
Decimal Number
ValueCountFrequency (%)
1 41
26.5%
4 24
15.5%
2 23
14.8%
5 19
12.3%
3 13
 
8.4%
8 9
 
5.8%
0 9
 
5.8%
9 9
 
5.8%
6 4
 
2.6%
7 4
 
2.6%
Space Separator
ValueCountFrequency (%)
123
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 28
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 323
51.4%
Common 306
48.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
30
 
9.3%
22
 
6.8%
22
 
6.8%
18
 
5.6%
17
 
5.3%
17
 
5.3%
8
 
2.5%
8
 
2.5%
8
 
2.5%
8
 
2.5%
Other values (78) 165
51.1%
Common
ValueCountFrequency (%)
123
40.2%
1 41
 
13.4%
- 28
 
9.2%
4 24
 
7.8%
2 23
 
7.5%
5 19
 
6.2%
3 13
 
4.2%
8 9
 
2.9%
0 9
 
2.9%
9 9
 
2.9%
Other values (2) 8
 
2.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 323
51.4%
ASCII 306
48.6%

Most frequent character per block

ASCII
ValueCountFrequency (%)
123
40.2%
1 41
 
13.4%
- 28
 
9.2%
4 24
 
7.8%
2 23
 
7.5%
5 19
 
6.2%
3 13
 
4.2%
8 9
 
2.9%
0 9
 
2.9%
9 9
 
2.9%
Other values (2) 8
 
2.6%
Hangul
ValueCountFrequency (%)
30
 
9.3%
22
 
6.8%
22
 
6.8%
18
 
5.6%
17
 
5.3%
17
 
5.3%
8
 
2.5%
8
 
2.5%
8
 
2.5%
8
 
2.5%
Other values (78) 165
51.1%
Distinct31
Distinct (%)88.6%
Missing0
Missing (%)0.0%
Memory size412.0 B
Minimum1987-09-01 00:00:00
Maximum2017-11-01 00:00:00
2023-12-12T16:09:29.160108image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T16:09:29.298979image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=31)

구 조
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct2
Distinct (%)5.7%
Missing0
Missing (%)0.0%
Memory size412.0 B
철근콘크리트
33 
철골철근콘크리트
 
2

Length

Max length8
Median length6
Mean length6.1142857
Min length6

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row철근콘크리트
2nd row철근콘크리트
3rd row철근콘크리트
4th row철근콘크리트
5th row철근콘크리트

Common Values

ValueCountFrequency (%)
철근콘크리트 33
94.3%
철골철근콘크리트 2
 
5.7%

Length

2023-12-12T16:09:29.447588image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T16:09:29.556600image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
철근콘크리트 33
94.3%
철골철근콘크리트 2
 
5.7%

동수
Real number (ℝ)

Distinct9
Distinct (%)25.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3.1714286
Minimum1
Maximum16
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size447.0 B
2023-12-12T16:09:29.648317image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q11
median2
Q34
95-th percentile10.5
Maximum16
Range15
Interquartile range (IQR)3

Descriptive statistics

Standard deviation3.5602851
Coefficient of variation (CV)1.1226124
Kurtosis6.0905309
Mean3.1714286
Median Absolute Deviation (MAD)1
Skewness2.4429223
Sum111
Variance12.67563
MonotonicityNot monotonic
2023-12-12T16:09:29.770025image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=9)
ValueCountFrequency (%)
1 16
45.7%
2 5
 
14.3%
3 4
 
11.4%
4 4
 
11.4%
5 2
 
5.7%
9 1
 
2.9%
14 1
 
2.9%
8 1
 
2.9%
16 1
 
2.9%
ValueCountFrequency (%)
1 16
45.7%
2 5
 
14.3%
3 4
 
11.4%
4 4
 
11.4%
5 2
 
5.7%
8 1
 
2.9%
9 1
 
2.9%
14 1
 
2.9%
16 1
 
2.9%
ValueCountFrequency (%)
16 1
 
2.9%
14 1
 
2.9%
9 1
 
2.9%
8 1
 
2.9%
5 2
 
5.7%
4 4
 
11.4%
3 4
 
11.4%
2 5
 
14.3%
1 16
45.7%

용 도
Categorical

Distinct6
Distinct (%)17.1%
Missing0
Missing (%)0.0%
Memory size412.0 B
공동주택
18 
숙박시설
판매시설
기타시설
노인요양
 
1

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique2 ?
Unique (%)5.7%

Sample

1st row공동주택
2nd row노인요양
3rd row숙박시설
4th row공동주택
5th row공동주택

Common Values

ValueCountFrequency (%)
공동주택 18
51.4%
숙박시설 8
22.9%
판매시설 5
 
14.3%
기타시설 2
 
5.7%
노인요양 1
 
2.9%
의료시설 1
 
2.9%

Length

2023-12-12T16:09:29.897831image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T16:09:30.021225image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
공동주택 18
51.4%
숙박시설 8
22.9%
판매시설 5
 
14.3%
기타시설 2
 
5.7%
노인요양 1
 
2.9%
의료시설 1
 
2.9%

공정율(퍼센트)
Real number (ℝ)

Distinct13
Distinct (%)37.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean42.685714
Minimum1
Maximum97
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size447.0 B
2023-12-12T16:09:30.152080image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile3.8
Q122.5
median30
Q370
95-th percentile86.5
Maximum97
Range96
Interquartile range (IQR)47.5

Descriptive statistics

Standard deviation29.085848
Coefficient of variation (CV)0.68139537
Kurtosis-1.1836079
Mean42.685714
Median Absolute Deviation (MAD)20
Skewness0.22597455
Sum1494
Variance845.98655
MonotonicityNot monotonic
2023-12-12T16:09:30.279712image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=13)
ValueCountFrequency (%)
30 6
17.1%
50 6
17.1%
5 4
11.4%
80 4
11.4%
70 3
8.6%
25 3
8.6%
20 2
 
5.7%
1 2
 
5.7%
65 1
 
2.9%
85 1
 
2.9%
Other values (3) 3
8.6%
ValueCountFrequency (%)
1 2
 
5.7%
5 4
11.4%
10 1
 
2.9%
20 2
 
5.7%
25 3
8.6%
30 6
17.1%
50 6
17.1%
65 1
 
2.9%
70 3
8.6%
80 4
11.4%
ValueCountFrequency (%)
97 1
 
2.9%
90 1
 
2.9%
85 1
 
2.9%
80 4
11.4%
70 3
8.6%
65 1
 
2.9%
50 6
17.1%
30 6
17.1%
25 3
8.6%
20 2
 
5.7%

중단기간(개월)
Real number (ℝ)

HIGH CORRELATION 

Distinct30
Distinct (%)85.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean242.05714
Minimum61
Maximum378
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size447.0 B
2023-12-12T16:09:30.399230image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum61
5-th percentile97.4
Q1186
median265
Q3297
95-th percentile356.6
Maximum378
Range317
Interquartile range (IQR)111

Descriptive statistics

Standard deviation79.807468
Coefficient of variation (CV)0.32970507
Kurtosis-0.28818119
Mean242.05714
Median Absolute Deviation (MAD)57
Skewness-0.44546954
Sum8472
Variance6369.2319
MonotonicityNot monotonic
2023-12-12T16:09:30.513817image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=30)
ValueCountFrequency (%)
297 4
 
11.4%
194 2
 
5.7%
182 2
 
5.7%
350 1
 
2.9%
211 1
 
2.9%
157 1
 
2.9%
61 1
 
2.9%
227 1
 
2.9%
223 1
 
2.9%
187 1
 
2.9%
Other values (20) 20
57.1%
ValueCountFrequency (%)
61 1
2.9%
68 1
2.9%
110 1
2.9%
153 1
2.9%
157 1
2.9%
175 1
2.9%
182 2
5.7%
185 1
2.9%
187 1
2.9%
194 2
5.7%
ValueCountFrequency (%)
378 1
 
2.9%
372 1
 
2.9%
350 1
 
2.9%
324 1
 
2.9%
322 1
 
2.9%
321 1
 
2.9%
307 1
 
2.9%
303 1
 
2.9%
297 4
11.4%
291 1
 
2.9%

Interactions

2023-12-12T16:09:27.436845image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T16:09:25.954623image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T16:09:26.663788image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T16:09:27.038914image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T16:09:27.568348image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T16:09:26.044099image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T16:09:26.756884image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T16:09:27.144715image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T16:09:27.672037image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T16:09:26.142450image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T16:09:26.856562image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T16:09:27.241466image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T16:09:27.761277image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T16:09:26.568545image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T16:09:26.941720image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T16:09:27.327464image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T16:09:30.595901image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번위 치허가년도구 조동수용 도공정율(퍼센트)중단기간(개월)
연번1.0001.0000.8850.0000.0000.5220.0000.558
위 치1.0001.0001.0001.0001.0001.0001.0001.000
허가년도0.8851.0001.0001.0000.8510.9770.7900.972
구 조0.0001.0001.0001.0000.0000.0000.2890.888
동수0.0001.0000.8510.0001.0000.3280.0000.000
용 도0.5221.0000.9770.0000.3281.0000.5040.366
공정율(퍼센트)0.0001.0000.7900.2890.0000.5041.0000.127
중단기간(개월)0.5581.0000.9720.8880.0000.3660.1271.000
2023-12-12T16:09:30.694902image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
용 도구 조
용 도1.0000.000
구 조0.0001.000
2023-12-12T16:09:30.791502image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번동수공정율(퍼센트)중단기간(개월)구 조용 도
연번1.0000.128-0.115-0.3880.0000.311
동수0.1281.0000.081-0.0650.0000.184
공정율(퍼센트)-0.1150.0811.000-0.1530.2440.249
중단기간(개월)-0.388-0.065-0.1531.0000.6270.164
구 조0.0000.0000.2440.6271.0000.000
용 도0.3110.1840.2490.1640.0001.000

Missing values

2023-12-12T16:09:27.872938image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T16:09:27.989711image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번위 치허가년도구 조동수용 도공정율(퍼센트)중단기간(개월)
01천안시 동남구 목천읍 신계리 125-10외 101997-09-01철근콘크리트9공동주택30350
12공주시 계룡면 구왕리 918-8외 31998-04-01철근콘크리트1노인요양70291
23공주시 계룡면 중장리 24-1외 51987-09-01철근콘크리트1숙박시설65372
34보령시 남포면 봉덕리 28-1외 21999-08-01철근콘크리트3공동주택30267
45보령시 남포면 삼현리 700-4외 11993-11-01철근콘크리트14공동주택50307
56아산시 모종동 558-8외 12006-03-01철근콘크리트1판매시설20182
67아산시 배방읍 세출리 3412007-01-01철근콘크리트1판매시설5175
78아산시 온천동 84-1외 71990-06-01철근콘크리트1판매시설1378
89아산시 용화동 4232003-12-01철근콘크리트3공동주택85185
910아산시 선장면 궁평리 산54-1외 21992-08-01철근콘크리트2기타시설25282
연번위 치허가년도구 조동수용 도공정율(퍼센트)중단기간(개월)
2526청양군 장평면 적곡리 5961994-10-01철근콘크리트1숙박시설70322
2627홍성군 광천읍 상정리 12외 41995-04-01철근콘크리트4공동주택5068
2728홍성군 홍성읍 남장리 344-42007-08-01철근콘크리트3공동주택30187
2829예산군 덕산면 사동리 116-4외 22004-03-01철근콘크리트4공동주택5223
2930예산군 신암면 신종리 210-5외 191998-11-01철근콘크리트2공동주택5227
3031예산군 예산읍 창소리 58외 92005-05-01철근콘크리트4공동주택30194
3132예산군 응봉면 주령리 122-5외 32001-07-01철근콘크리트2공동주택1194
3233태안군 남면 몽산리 465-21외 92008-08-01철근콘크리트16기타시설80182
3334태안군 고남면 고남리 81-6외 22017-11-01철근콘크리트1의료시설8061
3435태안군 삭선리 420-5외 12010-11-01철근콘크리트5공동주택25157