Overview

Dataset statistics

Number of variables21
Number of observations64
Missing cells155
Missing cells (%)11.5%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory11.4 KiB
Average record size in memory182.1 B

Variable types

Categorical7
Text3
Numeric7
DateTime3
Unsupported1

Dataset

Description인천광역시 연수구 건축물 착공신고 현황(건축물 대지위치, 허가일, 대지면적(㎡), 건축면적(㎡), 연면적(㎡), 사용승인일, 주용도, 부속용도)
Author인천광역시 연수구
URLhttps://data.incheon.go.kr/findData/publicDataDetail?dataId=15029299&srcSe=7661IVAWM27C61E190

Alerts

세대수 is highly imbalanced (74.5%)Imbalance
호수 is highly imbalanced (59.9%)Imbalance
가구수 is highly imbalanced (58.6%)Imbalance
증축연면적 has 48 (75.0%) missing valuesMissing
사용승인일 has 30 (46.9%) missing valuesMissing
부속용도 has 13 (20.3%) missing valuesMissing
기타구조 has 64 (100.0%) missing valuesMissing
허가번호 has unique valuesUnique
대지위치 has unique valuesUnique
대지면적 has unique valuesUnique
건축면적 has unique valuesUnique
연면적 has unique valuesUnique
기타구조 is an unsupported type, check if it needs cleaning or further analysisUnsupported

Reproduction

Analysis started2024-03-18 05:25:57.524933
Analysis finished2024-03-18 05:25:57.751567
Duration0.23 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

건축구분
Categorical

Distinct3
Distinct (%)4.7%
Missing0
Missing (%)0.0%
Memory size644.0 B
신축
35 
증축
17 
대수선
12 

Length

Max length3
Median length2
Mean length2.1875
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row신축
2nd row신축
3rd row신축
4th row신축
5th row증축

Common Values

ValueCountFrequency (%)
신축 35
54.7%
증축 17
26.6%
대수선 12
 
18.8%

Length

2024-03-18T14:25:57.807617image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-18T14:25:57.904709image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
신축 35
54.7%
증축 17
26.6%
대수선 12
 
18.8%

허가번호
Text

UNIQUE 

Distinct64
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size644.0 B
2024-03-18T14:25:58.064651image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length17
Median length16
Mean length15.65625
Min length15

Characters and Unicode

Total characters1002
Distinct characters27
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique64 ?
Unique (%)100.0%

Sample

1st row2021-건축과-신축허가-15
2nd row2021-건축과-신축허가-14
3rd row2021-건축과-신축허가-13
4th row2021-건축과-신축허가-12
5th row2021-건축과-증축허가-2
ValueCountFrequency (%)
2021-건축과-신축허가-15 1
 
1.6%
2021-건축과-신축허가-14 1
 
1.6%
2020-건축과-대수선허가-3 1
 
1.6%
2020-건축과-증축허가-5 1
 
1.6%
2020-건축과-대수선허가-5 1
 
1.6%
2020-건축과-신축허가-30 1
 
1.6%
2020-건축과-신축허가-29 1
 
1.6%
2020-건축과-신축허가-28 1
 
1.6%
2020-건축과-신축허가-27 1
 
1.6%
2020-건축과-신축허가-24 1
 
1.6%
Other values (54) 54
84.4%
2024-03-18T14:25:58.363710image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 192
19.2%
2 137
13.7%
117
11.7%
0 106
10.6%
70
 
7.0%
64
 
6.4%
51
 
5.1%
51
 
5.1%
1 47
 
4.7%
42
 
4.2%
Other values (17) 125
12.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 465
46.4%
Decimal Number 345
34.4%
Dash Punctuation 192
19.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
117
25.2%
70
15.1%
64
13.8%
51
11.0%
51
11.0%
42
 
9.0%
12
 
2.6%
11
 
2.4%
11
 
2.4%
11
 
2.4%
Other values (6) 25
 
5.4%
Decimal Number
ValueCountFrequency (%)
2 137
39.7%
0 106
30.7%
1 47
 
13.6%
3 17
 
4.9%
7 8
 
2.3%
9 8
 
2.3%
8 7
 
2.0%
5 6
 
1.7%
4 5
 
1.4%
6 4
 
1.2%
Dash Punctuation
ValueCountFrequency (%)
- 192
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 537
53.6%
Hangul 465
46.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
117
25.2%
70
15.1%
64
13.8%
51
11.0%
51
11.0%
42
 
9.0%
12
 
2.6%
11
 
2.4%
11
 
2.4%
11
 
2.4%
Other values (6) 25
 
5.4%
Common
ValueCountFrequency (%)
- 192
35.8%
2 137
25.5%
0 106
19.7%
1 47
 
8.8%
3 17
 
3.2%
7 8
 
1.5%
9 8
 
1.5%
8 7
 
1.3%
5 6
 
1.1%
4 5
 
0.9%

Most occurring blocks

ValueCountFrequency (%)
ASCII 537
53.6%
Hangul 465
46.4%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 192
35.8%
2 137
25.5%
0 106
19.7%
1 47
 
8.8%
3 17
 
3.2%
7 8
 
1.5%
9 8
 
1.5%
8 7
 
1.3%
5 6
 
1.1%
4 5
 
0.9%
Hangul
ValueCountFrequency (%)
117
25.2%
70
15.1%
64
13.8%
51
11.0%
51
11.0%
42
 
9.0%
12
 
2.6%
11
 
2.4%
11
 
2.4%
11
 
2.4%
Other values (6) 25
 
5.4%

대지위치
Text

UNIQUE 

Distinct64
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size644.0 B
2024-03-18T14:25:58.599714image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length35
Median length27
Mean length20.6875
Min length17

Characters and Unicode

Total characters1324
Distinct characters41
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique64 ?
Unique (%)100.0%

Sample

1st row인천광역시 연수구 동춘동 동춘1도시개발사업구역 26블럭 13로트
2nd row인천광역시 연수구 동춘동 810-1
3rd row인천광역시 연수구 동춘동 동춘1도시개발사업구역 25블럭 14루트
4th row인천광역시 연수구 옥련동 573-3
5th row인천광역시 연수구 옥련동 405-21
ValueCountFrequency (%)
인천광역시 64
22.9%
연수구 64
22.9%
동춘동 22
 
7.9%
외1필지 16
 
5.7%
선학동 13
 
4.6%
옥련동 12
 
4.3%
연수동 11
 
3.9%
청학동 6
 
2.1%
동춘1도시개발사업구역 2
 
0.7%
527-15 1
 
0.4%
Other values (69) 69
24.6%
2024-03-18T14:25:58.954574image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
216
16.3%
89
 
6.7%
75
 
5.7%
75
 
5.7%
67
 
5.1%
67
 
5.1%
66
 
5.0%
64
 
4.8%
64
 
4.8%
64
 
4.8%
Other values (31) 477
36.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 788
59.5%
Decimal Number 270
 
20.4%
Space Separator 216
 
16.3%
Dash Punctuation 50
 
3.8%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
89
11.3%
75
9.5%
75
9.5%
67
8.5%
67
8.5%
66
8.4%
64
8.1%
64
8.1%
64
8.1%
25
 
3.2%
Other values (19) 132
16.8%
Decimal Number
ValueCountFrequency (%)
1 60
22.2%
5 31
11.5%
4 26
9.6%
6 25
9.3%
3 25
9.3%
2 25
9.3%
9 22
 
8.1%
7 21
 
7.8%
8 21
 
7.8%
0 14
 
5.2%
Space Separator
ValueCountFrequency (%)
216
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 50
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 788
59.5%
Common 536
40.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
89
11.3%
75
9.5%
75
9.5%
67
8.5%
67
8.5%
66
8.4%
64
8.1%
64
8.1%
64
8.1%
25
 
3.2%
Other values (19) 132
16.8%
Common
ValueCountFrequency (%)
216
40.3%
1 60
 
11.2%
- 50
 
9.3%
5 31
 
5.8%
4 26
 
4.9%
6 25
 
4.7%
3 25
 
4.7%
2 25
 
4.7%
9 22
 
4.1%
7 21
 
3.9%
Other values (2) 35
 
6.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 788
59.5%
ASCII 536
40.5%

Most frequent character per block

ASCII
ValueCountFrequency (%)
216
40.3%
1 60
 
11.2%
- 50
 
9.3%
5 31
 
5.8%
4 26
 
4.9%
6 25
 
4.7%
3 25
 
4.7%
2 25
 
4.7%
9 22
 
4.1%
7 21
 
3.9%
Other values (2) 35
 
6.5%
Hangul
ValueCountFrequency (%)
89
11.3%
75
9.5%
75
9.5%
67
8.5%
67
8.5%
66
8.4%
64
8.1%
64
8.1%
64
8.1%
25
 
3.2%
Other values (19) 132
16.8%

위도
Real number (ℝ)

Distinct62
Distinct (%)96.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean37.419924
Minimum37.39575
Maximum37.435586
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size708.0 B
2024-03-18T14:25:59.082974image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum37.39575
5-th percentile37.404676
Q137.414364
median37.42078
Q337.427248
95-th percentile37.433054
Maximum37.435586
Range0.039836
Interquartile range (IQR)0.01288425

Descriptive statistics

Standard deviation0.0096714341
Coefficient of variation (CV)0.0002584568
Kurtosis-0.38292086
Mean37.419924
Median Absolute Deviation (MAD)0.006464
Skewness-0.40086157
Sum2394.8752
Variance9.3536637 × 10-5
MonotonicityNot monotonic
2024-03-18T14:25:59.206011image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
37.404676 3
 
4.7%
37.4152 1
 
1.6%
37.430257 1
 
1.6%
37.430689 1
 
1.6%
37.408724 1
 
1.6%
37.421259 1
 
1.6%
37.404991 1
 
1.6%
37.417038 1
 
1.6%
37.410386 1
 
1.6%
37.415952 1
 
1.6%
Other values (52) 52
81.2%
ValueCountFrequency (%)
37.39575 1
 
1.6%
37.395903 1
 
1.6%
37.404676 3
4.7%
37.404991 1
 
1.6%
37.407803 1
 
1.6%
37.4079 1
 
1.6%
37.408315 1
 
1.6%
37.408418 1
 
1.6%
37.408724 1
 
1.6%
37.410386 1
 
1.6%
ValueCountFrequency (%)
37.435586 1
1.6%
37.435554 1
1.6%
37.434149 1
1.6%
37.433093 1
1.6%
37.432836 1
1.6%
37.432189 1
1.6%
37.432078 1
1.6%
37.43205 1
1.6%
37.431811 1
1.6%
37.431732 1
1.6%

경도
Real number (ℝ)

Distinct62
Distinct (%)96.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean126.67171
Minimum126.63597
Maximum126.70126
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size708.0 B
2024-03-18T14:25:59.339778image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum126.63597
5-th percentile126.64674
Q1126.65702
median126.67087
Q3126.68231
95-th percentile126.69963
Maximum126.70126
Range0.065284
Interquartile range (IQR)0.025289

Descriptive statistics

Standard deviation0.01754081
Coefficient of variation (CV)0.00013847456
Kurtosis-1.0308337
Mean126.67171
Median Absolute Deviation (MAD)0.013294
Skewness0.20708024
Sum8106.9895
Variance0.00030768001
MonotonicityNot monotonic
2024-03-18T14:25:59.518102image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
126.680942 3
 
4.7%
126.657615 1
 
1.6%
126.697278 1
 
1.6%
126.663253 1
 
1.6%
126.65565 1
 
1.6%
126.653434 1
 
1.6%
126.657541 1
 
1.6%
126.656535 1
 
1.6%
126.68108 1
 
1.6%
126.657186 1
 
1.6%
Other values (52) 52
81.2%
ValueCountFrequency (%)
126.635971 1
1.6%
126.644311 1
1.6%
126.644834 1
1.6%
126.646079 1
1.6%
126.650478 1
1.6%
126.651659 1
1.6%
126.652089 1
1.6%
126.652127 1
1.6%
126.653434 1
1.6%
126.654771 1
1.6%
ValueCountFrequency (%)
126.701255 1
1.6%
126.700518 1
1.6%
126.700134 1
1.6%
126.699676 1
1.6%
126.699358 1
1.6%
126.699168 1
1.6%
126.699149 1
1.6%
126.699115 1
1.6%
126.697368 1
1.6%
126.697278 1
1.6%

대지면적
Real number (ℝ)

UNIQUE 

Distinct64
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean6546.0791
Minimum134
Maximum242484.7
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size708.0 B
2024-03-18T14:25:59.868595image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum134
5-th percentile212.175
Q1331.6
median609.3
Q31231.225
95-th percentile13385.34
Maximum242484.7
Range242350.7
Interquartile range (IQR)899.625

Descriptive statistics

Standard deviation31074.438
Coefficient of variation (CV)4.7470307
Kurtosis54.974531
Mean6546.0791
Median Absolute Deviation (MAD)361.75
Skewness7.2352425
Sum418949.06
Variance9.6562071 × 108
MonotonicityNot monotonic
2024-03-18T14:25:59.983721image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
325.6 1
 
1.6%
560.3 1
 
1.6%
535.0 1
 
1.6%
295.3 1
 
1.6%
1075.4 1
 
1.6%
904.0 1
 
1.6%
668.0 1
 
1.6%
407.0 1
 
1.6%
617.8 1
 
1.6%
12278.3 1
 
1.6%
Other values (54) 54
84.4%
ValueCountFrequency (%)
134.0 1
1.6%
209.8 1
1.6%
210.6 1
1.6%
211.5 1
1.6%
216.0 1
1.6%
228.26 1
1.6%
230.1 1
1.6%
231.9 1
1.6%
242.5 1
1.6%
247.0 1
1.6%
ValueCountFrequency (%)
242484.7 1
1.6%
50097.2 1
1.6%
41908.8 1
1.6%
13580.7 1
1.6%
12278.3 1
1.6%
9377.0 1
1.6%
4552.0 1
1.6%
4471.0 1
1.6%
3633.0 1
1.6%
2017.7 1
1.6%

건축면적
Real number (ℝ)

UNIQUE 

Distinct64
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean752.85305
Minimum24
Maximum21233.98
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size708.0 B
2024-03-18T14:26:00.105302image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum24
5-th percentile82.39275
Q1146.7525
median238.705
Q3544.9275
95-th percentile1377.9025
Maximum21233.98
Range21209.98
Interquartile range (IQR)398.175

Descriptive statistics

Standard deviation2657.6605
Coefficient of variation (CV)3.5301185
Kurtosis58.456727
Mean752.85305
Median Absolute Deviation (MAD)115.505
Skewness7.5136418
Sum48182.595
Variance7063159.3
MonotonicityNot monotonic
2024-03-18T14:26:00.216834image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
147.77 1
 
1.6%
299.32 1
 
1.6%
320.46 1
 
1.6%
148.0 1
 
1.6%
642.2 1
 
1.6%
189.8 1
 
1.6%
392.26 1
 
1.6%
121.04 1
 
1.6%
233.78 1
 
1.6%
3491.4 1
 
1.6%
Other values (54) 54
84.4%
ValueCountFrequency (%)
24.0 1
1.6%
77.0 1
1.6%
79.14 1
1.6%
81.24 1
1.6%
88.925 1
1.6%
95.9 1
1.6%
120.75 1
1.6%
121.04 1
1.6%
122.08 1
1.6%
124.32 1
1.6%
ValueCountFrequency (%)
21233.98 1
1.6%
3491.4 1
1.6%
2349.5 1
1.6%
1401.46 1
1.6%
1244.41 1
1.6%
1084.18 1
1.6%
956.19 1
1.6%
883.21 1
1.6%
774.12 1
1.6%
722.84 1
1.6%

연면적
Real number (ℝ)

UNIQUE 

Distinct64
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2190.7319
Minimum24
Maximum34421.5
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size708.0 B
2024-03-18T14:26:00.332521image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum24
5-th percentile140.3415
Q1431.425
median732.36
Q31660.8675
95-th percentile7276.813
Maximum34421.5
Range34397.5
Interquartile range (IQR)1229.4425

Descriptive statistics

Standard deviation5103.7893
Coefficient of variation (CV)2.3297188
Kurtosis28.21624
Mean2190.7319
Median Absolute Deviation (MAD)450.865
Skewness5.0422997
Sum140206.84
Variance26048665
MonotonicityNot monotonic
2024-03-18T14:26:00.448619image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
268.52 1
 
1.6%
1073.49 1
 
1.6%
1331.59 1
 
1.6%
148.0 1
 
1.6%
1974.05 1
 
1.6%
639.56 1
 
1.6%
1989.56 1
 
1.6%
324.72 1
 
1.6%
294.47 1
 
1.6%
8471.98 1
 
1.6%
Other values (54) 54
84.4%
ValueCountFrequency (%)
24.0 1
1.6%
74.54 1
1.6%
77.0 1
1.6%
138.99 1
1.6%
148.0 1
1.6%
167.4 1
1.6%
191.8 1
1.6%
197.82 1
1.6%
259.85 1
1.6%
268.52 1
1.6%
ValueCountFrequency (%)
34421.5 1
1.6%
21410.4 1
1.6%
8471.98 1
1.6%
7318.81 1
1.6%
7038.83 1
1.6%
5152.11 1
1.6%
4968.77 1
1.6%
3458.0 1
1.6%
3036.02 1
1.6%
2587.52 1
1.6%

증축연면적
Real number (ℝ)

MISSING 

Distinct16
Distinct (%)100.0%
Missing48
Missing (%)75.0%
Infinite0
Infinite (%)0.0%
Mean455.86625
Minimum2.52
Maximum3263.94
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size708.0 B
2024-03-18T14:26:00.554431image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2.52
5-th percentile4.8375
Q136.9175
median183.755
Q3526.895
95-th percentile1481.625
Maximum3263.94
Range3261.42
Interquartile range (IQR)489.9775

Descriptive statistics

Standard deviation798.01115
Coefficient of variation (CV)1.7505379
Kurtosis11.632044
Mean455.86625
Median Absolute Deviation (MAD)175.215
Skewness3.2521279
Sum7293.86
Variance636821.8
MonotonicityNot monotonic
2024-03-18T14:26:00.641986image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=16)
ValueCountFrequency (%)
497.0 1
 
1.6%
2.52 1
 
1.6%
11.47 1
 
1.6%
5.61 1
 
1.6%
19.24 1
 
1.6%
402.6 1
 
1.6%
112.56 1
 
1.6%
42.81 1
 
1.6%
169.51 1
 
1.6%
148.93 1
 
1.6%
Other values (6) 6
 
9.4%
(Missing) 48
75.0%
ValueCountFrequency (%)
2.52 1
1.6%
5.61 1
1.6%
11.47 1
1.6%
19.24 1
1.6%
42.81 1
1.6%
112.56 1
1.6%
148.93 1
1.6%
169.51 1
1.6%
198.0 1
1.6%
210.0 1
1.6%
ValueCountFrequency (%)
3263.94 1
1.6%
887.52 1
1.6%
705.57 1
1.6%
616.58 1
1.6%
497.0 1
1.6%
402.6 1
1.6%
210.0 1
1.6%
198.0 1
1.6%
169.51 1
1.6%
148.93 1
1.6%
Distinct61
Distinct (%)95.3%
Missing0
Missing (%)0.0%
Memory size644.0 B
Minimum2017-12-19 00:00:00
Maximum2021-08-27 00:00:00
2024-03-18T14:26:00.775136image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-18T14:26:00.911833image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
Distinct59
Distinct (%)92.2%
Missing0
Missing (%)0.0%
Memory size644.0 B
Minimum2020-07-20 00:00:00
Maximum2021-09-16 00:00:00
2024-03-18T14:26:01.024258image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-18T14:26:01.147422image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

사용승인일
Date

MISSING 

Distinct31
Distinct (%)91.2%
Missing30
Missing (%)46.9%
Memory size644.0 B
Minimum2020-12-11 00:00:00
Maximum2021-09-17 00:00:00
2024-03-18T14:26:01.262029image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-18T14:26:01.368705image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=31)

주용도
Categorical

Distinct13
Distinct (%)20.3%
Missing0
Missing (%)0.0%
Memory size644.0 B
제2종근린생활시설
20 
단독주택
13 
제1종근린생활시설
11 
노유자시설
관광휴게시설
Other values (8)
14 

Length

Max length10
Median length9
Mean length6.71875
Min length4

Unique

Unique3 ?
Unique (%)4.7%

Sample

1st row단독주택
2nd row제1종근린생활시설
3rd row단독주택
4th row단독주택
5th row노유자시설

Common Values

ValueCountFrequency (%)
제2종근린생활시설 20
31.2%
단독주택 13
20.3%
제1종근린생활시설 11
17.2%
노유자시설 3
 
4.7%
관광휴게시설 3
 
4.7%
공동주택 3
 
4.7%
교육연구시설 2
 
3.1%
의료시설 2
 
3.1%
종교시설 2
 
3.1%
업무시설 2
 
3.1%
Other values (3) 3
 
4.7%

Length

2024-03-18T14:26:01.499404image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
제2종근린생활시설 20
31.2%
단독주택 13
20.3%
제1종근린생활시설 11
17.2%
노유자시설 3
 
4.7%
관광휴게시설 3
 
4.7%
공동주택 3
 
4.7%
교육연구시설 2
 
3.1%
의료시설 2
 
3.1%
종교시설 2
 
3.1%
업무시설 2
 
3.1%
Other values (3) 3
 
4.7%

부속용도
Text

MISSING 

Distinct35
Distinct (%)68.6%
Missing13
Missing (%)20.3%
Memory size644.0 B
2024-03-18T14:26:01.684304image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length31
Median length17
Mean length7.9411765
Min length2

Characters and Unicode

Total characters405
Distinct characters75
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique27 ?
Unique (%)52.9%

Sample

1st row휴게음식점
2nd row단독주택
3rd row노인복지시설
4th row다중주택 및 사무소
5th row제1종근린생활시설
ValueCountFrequency (%)
사무소 11
 
16.9%
다중주택 6
 
9.2%
4
 
6.2%
휴게음식점 4
 
6.2%
단독주택 3
 
4.6%
소매점 2
 
3.1%
공공업무시설 2
 
3.1%
노인복지시설 2
 
3.1%
근린생활시설 2
 
3.1%
의료시설,근린생활시설 1
 
1.5%
Other values (28) 28
43.1%
2024-03-18T14:26:01.983057image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
26
 
6.4%
23
 
5.7%
20
 
4.9%
18
 
4.4%
17
 
4.2%
16
 
4.0%
, 15
 
3.7%
15
 
3.7%
14
 
3.5%
13
 
3.2%
Other values (65) 228
56.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 363
89.6%
Other Punctuation 17
 
4.2%
Space Separator 14
 
3.5%
Open Punctuation 4
 
1.0%
Close Punctuation 4
 
1.0%
Decimal Number 3
 
0.7%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
26
 
7.2%
23
 
6.3%
20
 
5.5%
18
 
5.0%
17
 
4.7%
16
 
4.4%
15
 
4.1%
13
 
3.6%
12
 
3.3%
10
 
2.8%
Other values (59) 193
53.2%
Other Punctuation
ValueCountFrequency (%)
, 15
88.2%
/ 2
 
11.8%
Space Separator
ValueCountFrequency (%)
14
100.0%
Open Punctuation
ValueCountFrequency (%)
( 4
100.0%
Close Punctuation
ValueCountFrequency (%)
) 4
100.0%
Decimal Number
ValueCountFrequency (%)
1 3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 363
89.6%
Common 42
 
10.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
26
 
7.2%
23
 
6.3%
20
 
5.5%
18
 
5.0%
17
 
4.7%
16
 
4.4%
15
 
4.1%
13
 
3.6%
12
 
3.3%
10
 
2.8%
Other values (59) 193
53.2%
Common
ValueCountFrequency (%)
, 15
35.7%
14
33.3%
( 4
 
9.5%
) 4
 
9.5%
1 3
 
7.1%
/ 2
 
4.8%

Most occurring blocks

ValueCountFrequency (%)
Hangul 363
89.6%
ASCII 42
 
10.4%

Most frequent character per block

Hangul
ValueCountFrequency (%)
26
 
7.2%
23
 
6.3%
20
 
5.5%
18
 
5.0%
17
 
4.7%
16
 
4.4%
15
 
4.1%
13
 
3.6%
12
 
3.3%
10
 
2.8%
Other values (59) 193
53.2%
ASCII
ValueCountFrequency (%)
, 15
35.7%
14
33.3%
( 4
 
9.5%
) 4
 
9.5%
1 3
 
7.1%
/ 2
 
4.8%

주구조
Categorical

Distinct6
Distinct (%)9.4%
Missing0
Missing (%)0.0%
Memory size644.0 B
철근콘크리트구조
45 
일반철골구조
12 
일반목구조
 
3
<NA>
 
2
석구조
 
1

Length

Max length8
Median length8
Mean length7.21875
Min length3

Unique

Unique2 ?
Unique (%)3.1%

Sample

1st row일반목구조
2nd row철근콘크리트구조
3rd row일반목구조
4th row철근콘크리트구조
5th row철근콘크리트구조

Common Values

ValueCountFrequency (%)
철근콘크리트구조 45
70.3%
일반철골구조 12
 
18.8%
일반목구조 3
 
4.7%
<NA> 2
 
3.1%
석구조 1
 
1.6%
기타구조 1
 
1.6%

Length

2024-03-18T14:26:02.098800image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-18T14:26:02.193782image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
철근콘크리트구조 45
70.3%
일반철골구조 12
 
18.8%
일반목구조 3
 
4.7%
na 2
 
3.1%
석구조 1
 
1.6%
기타구조 1
 
1.6%

기타구조
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing64
Missing (%)100.0%
Memory size708.0 B

지상층수
Real number (ℝ)

Distinct10
Distinct (%)15.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3.484375
Minimum1
Maximum12
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size708.0 B
2024-03-18T14:26:02.286792image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q12
median3
Q34
95-th percentile6.85
Maximum12
Range11
Interquartile range (IQR)2

Descriptive statistics

Standard deviation2.1005833
Coefficient of variation (CV)0.602858
Kurtosis4.5807164
Mean3.484375
Median Absolute Deviation (MAD)1
Skewness1.8204248
Sum223
Variance4.4124504
MonotonicityNot monotonic
2024-03-18T14:26:02.376249image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%)
2 18
28.1%
3 15
23.4%
4 10
15.6%
5 8
12.5%
1 6
 
9.4%
6 3
 
4.7%
10 1
 
1.6%
7 1
 
1.6%
12 1
 
1.6%
9 1
 
1.6%
ValueCountFrequency (%)
1 6
 
9.4%
2 18
28.1%
3 15
23.4%
4 10
15.6%
5 8
12.5%
6 3
 
4.7%
7 1
 
1.6%
9 1
 
1.6%
10 1
 
1.6%
12 1
 
1.6%
ValueCountFrequency (%)
12 1
 
1.6%
10 1
 
1.6%
9 1
 
1.6%
7 1
 
1.6%
6 3
 
4.7%
5 8
12.5%
4 10
15.6%
3 15
23.4%
2 18
28.1%
1 6
 
9.4%

지하층수
Categorical

Distinct6
Distinct (%)9.4%
Missing0
Missing (%)0.0%
Memory size644.0 B
0
29 
1
17 
<NA>
11 
2
3
 
2

Length

Max length4
Median length1
Mean length1.515625
Min length1

Unique

Unique1 ?
Unique (%)1.6%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row1

Common Values

ValueCountFrequency (%)
0 29
45.3%
1 17
26.6%
<NA> 11
 
17.2%
2 4
 
6.2%
3 2
 
3.1%
5 1
 
1.6%

Length

2024-03-18T14:26:02.504205image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-18T14:26:02.594448image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
0 29
45.3%
1 17
26.6%
na 11
 
17.2%
2 4
 
6.2%
3 2
 
3.1%
5 1
 
1.6%

세대수
Categorical

IMBALANCE 

Distinct6
Distinct (%)9.4%
Missing0
Missing (%)0.0%
Memory size644.0 B
<NA>
58 
1
 
2
3
 
1
25
 
1
20
 
1

Length

Max length4
Median length4
Mean length3.75
Min length1

Unique

Unique4 ?
Unique (%)6.2%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 58
90.6%
1 2
 
3.1%
3 1
 
1.6%
25 1
 
1.6%
20 1
 
1.6%
8 1
 
1.6%

Length

2024-03-18T14:26:02.706196image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-18T14:26:02.829618image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 58
90.6%
1 2
 
3.1%
3 1
 
1.6%
25 1
 
1.6%
20 1
 
1.6%
8 1
 
1.6%

호수
Categorical

IMBALANCE 

Distinct6
Distinct (%)9.4%
Missing0
Missing (%)0.0%
Memory size644.0 B
<NA>
53 
1
 
4
11
 
2
19
 
2
12
 
2

Length

Max length4
Median length4
Mean length3.59375
Min length1

Unique

Unique1 ?
Unique (%)1.6%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 53
82.8%
1 4
 
6.2%
11 2
 
3.1%
19 2
 
3.1%
12 2
 
3.1%
13 1
 
1.6%

Length

2024-03-18T14:26:02.953087image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-18T14:26:03.052279image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 53
82.8%
1 4
 
6.2%
11 2
 
3.1%
19 2
 
3.1%
12 2
 
3.1%
13 1
 
1.6%

가구수
Categorical

IMBALANCE 

Distinct3
Distinct (%)4.7%
Missing0
Missing (%)0.0%
Memory size644.0 B
<NA>
55 
1
7
 
1

Length

Max length4
Median length4
Mean length3.578125
Min length1

Unique

Unique1 ?
Unique (%)1.6%

Sample

1st row1
2nd row<NA>
3rd row1
4th row1
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 55
85.9%
1 8
 
12.5%
7 1
 
1.6%

Length

2024-03-18T14:26:03.147340image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-18T14:26:03.247230image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 55
85.9%
1 8
 
12.5%
7 1
 
1.6%

Sample

건축구분허가번호대지위치위도경도대지면적건축면적연면적증축연면적허가일착공일사용승인일주용도부속용도주구조기타구조지상층수지하층수세대수호수가구수
0신축2021-건축과-신축허가-15인천광역시 연수구 동춘동 동춘1도시개발사업구역 26블럭 13로트37.404676126.680942325.6147.77268.52<NA>2021-08-262021-09-13<NA>단독주택<NA>일반목구조<NA>20<NA><NA>1
1신축2021-건축과-신축허가-14인천광역시 연수구 동춘동 810-137.415434126.656319478.7201.2869.66<NA>2021-08-232021-09-16<NA>제1종근린생활시설휴게음식점철근콘크리트구조<NA>50<NA><NA><NA>
2신축2021-건축과-신축허가-13인천광역시 연수구 동춘동 동춘1도시개발사업구역 25블럭 14루트37.404676126.680942250.0122.08167.4<NA>2021-06-282021-08-20<NA>단독주택<NA>일반목구조<NA>20<NA><NA>1
3신축2021-건축과-신축허가-12인천광역시 연수구 옥련동 573-337.41839126.65571816.2166.77323.58<NA>2021-06-222021-08-06<NA>단독주택단독주택철근콘크리트구조<NA>30<NA><NA>1
4증축2021-건축과-증축허가-2인천광역시 연수구 옥련동 405-2137.422491126.654956552.0238.74823.8842.812021-06-142021-06-29<NA>노유자시설노인복지시설철근콘크리트구조<NA>31<NA><NA><NA>
5신축2021-건축과-신축허가-10인천광역시 연수구 연수동 515-737.426416126.683544242.5143.7559.99<NA>2021-06-102021-06-24<NA>단독주택다중주택 및 사무소철근콘크리트구조<NA>50<NA>11<NA>
6대수선2021-건축과-대수선허가-3인천광역시 연수구 옥련동 307-137.427405126.6576991298.0774.127038.83<NA>2021-06-042021-09-09<NA>제2종근린생활시설제1종근린생활시설철근콘크리트구조<NA>63<NA>1<NA>
7신축2021-건축과-신축허가-8인천광역시 연수구 동춘동 1113-737.414967126.672559887.0300.28702.12<NA>2021-05-212021-07-14<NA>제2종근린생활시설휴게음식점, 사무소철근콘크리트구조<NA>30<NA><NA><NA>
8신축2021-건축과-신축허가-7인천광역시 연수구 옥련동 204-1 외1필지37.427186126.6448341303.0648.274968.77<NA>2021-05-102021-05-13<NA>제2종근린생활시설<NA>철근콘크리트구조<NA>62<NA><NA><NA>
9증축2021-건축과-증축허가-1인천광역시 연수구 동춘동 93637.408315126.671248612.6397.565152.11169.512021-05-042021-05-31<NA>제1종근린생활시설업무시설,교육연구시설,근린생활시설,다세대주택철근콘크리트구조<NA>1033<NA><NA>
건축구분허가번호대지위치위도경도대지면적건축면적연면적증축연면적허가일착공일사용승인일주용도부속용도주구조기타구조지상층수지하층수세대수호수가구수
54신축2018-건축과-신축허가-19인천광역시 연수구 연수동 563-537.42069126.675526216.0128.7397.72<NA>2018-05-282021-03-032021-08-02단독주택다중주택및 근린생활시설철근콘크리트구조<NA>40<NA>11<NA>
55신축2018-건축과-신축허가-6인천광역시 연수구 연수동 498-737.426516126.681183209.8124.32394.53<NA>2018-03-062021-04-05<NA>단독주택다중주택철근콘크리트구조<NA>40<NA>13<NA>
56증축2017-건축과-증축허가-10인천광역시 연수구 동춘동 698-537.395903126.663434761.0134.3660.0402.62017-12-192021-03-092021-04-07제2종근린생활시설사무소철근콘크리트구조<NA>21<NA><NA><NA>
57신축2021-건축과-신축신고-3인천광역시 연수구 선학동 139-837.431732126.699358228.2681.2474.54<NA>2021-08-272021-09-14<NA>제2종근린생활시설사무소일반철골구조<NA>20<NA><NA><NA>
58신축2021-건축과-신축신고-1인천광역시 연수구 선학동 137-537.432189126.699168298.024.024.0<NA>2021-04-062021-04-112021-09-13제2종근린생활시설사무소기타구조<NA>10<NA>1<NA>
59증축2021-건축과-증축신고-2인천광역시 연수구 옥련동 194-637.423527126.6443111591.0541.762587.5219.242021-02-082021-02-092021-03-30제2종근린생활시설자동차관련시설,노유자시설일반철골구조<NA>30<NA><NA><NA>
60증축2021-건축과-증축신고-1인천광역시 연수구 청학동 3-2137.433093126.659393134.079.14138.995.612021-01-272021-02-022021-02-09단독주택단독주택철근콘크리트구조<NA>2<NA><NA><NA>1
61증축2020-건축과-증축신고-5인천광역시 연수구 선학동 36737.423469126.694918274.0157.95547.7811.472020-11-092020-12-082021-02-05제2종근린생활시설소매점,다가구주택철근콘크리트구조<NA>4<NA><NA><NA>7
62증축2020-건축과-증축신고-3인천광역시 연수구 옥련동 258-237.427196126.651659581.0232.73472.952.522020-08-272020-09-032020-12-30제2종근린생활시설사무소일반철골구조<NA>3<NA><NA><NA><NA>
63신축2020-건축과-신축신고-2인천광역시 연수구 선학동 137-437.432078126.6996761094.677.077.0<NA>2020-07-232020-09-152021-03-08제2종근린생활시설사무소일반철골구조<NA>10<NA><NA><NA>