Overview

Dataset statistics

Number of variables15
Number of observations10000
Missing cells5339
Missing cells (%)3.6%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.3 MiB
Average record size in memory137.0 B

Variable types

Numeric8
Categorical4
Text3

Dataset

Description허가신고번호,허가신고양식,팀명,개발위치지역코드,개발위치산,개발위치번지,개발위치호,경도도,경도분,경도초,위도도,위도분,위도초,기준년도,구명
Author서울특별시
URLhttps://data.seoul.go.kr/dataList/OA-22149/S/1/datasetView.do

Alerts

구명 is highly overall correlated with 개발위치지역코드 and 1 other fieldsHigh correlation
팀명 is highly overall correlated with 개발위치지역코드 and 1 other fieldsHigh correlation
허가신고번호 is highly overall correlated with 허가신고양식High correlation
개발위치지역코드 is highly overall correlated with 팀명 and 1 other fieldsHigh correlation
경도도 is highly overall correlated with 경도분 and 4 other fieldsHigh correlation
경도분 is highly overall correlated with 경도도 and 4 other fieldsHigh correlation
경도초 is highly overall correlated with 경도도 and 4 other fieldsHigh correlation
위도도 is highly overall correlated with 경도도 and 4 other fieldsHigh correlation
위도분 is highly overall correlated with 경도도 and 4 other fieldsHigh correlation
위도초 is highly overall correlated with 경도도 and 4 other fieldsHigh correlation
허가신고양식 is highly overall correlated with 허가신고번호High correlation
허가신고양식 is highly imbalanced (74.5%)Imbalance
개발위치지역코드 has 260 (2.6%) missing valuesMissing
개발위치번지 has 278 (2.8%) missing valuesMissing
개발위치호 has 4801 (48.0%) missing valuesMissing
경도도 has 7042 (70.4%) zerosZeros
경도분 has 7199 (72.0%) zerosZeros
경도초 has 7146 (71.5%) zerosZeros
위도도 has 7043 (70.4%) zerosZeros
위도분 has 7084 (70.8%) zerosZeros
위도초 has 7144 (71.4%) zerosZeros

Reproduction

Analysis started2024-05-17 22:07:43.874224
Analysis finished2024-05-17 22:08:14.603401
Duration30.73 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

허가신고번호
Real number (ℝ)

HIGH CORRELATION 

Distinct4191
Distinct (%)41.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2.0766617 × 109
Minimum1.9010012 × 108
Maximum3.2002 × 109
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-05-18T07:08:14.808724image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1.9010012 × 108
5-th percentile1.1901001 × 109
Q12.1901002 × 109
median2.1901008 × 109
Q32.2 × 109
95-th percentile2.2008 × 109
Maximum3.2002 × 109
Range3.0100999 × 109
Interquartile range (IQR)9899812

Descriptive statistics

Standard deviation3.2867212 × 108
Coefficient of variation (CV)0.15826945
Kurtosis4.5217884
Mean2.0766617 × 109
Median Absolute Deviation (MAD)8799200.5
Skewness-2.4172782
Sum2.0766617 × 1013
Variance1.0802536 × 1017
MonotonicityNot monotonic
2024-05-18T07:08:15.276401image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
2190100226 13
 
0.1%
2190100102 13
 
0.1%
2201000002 12
 
0.1%
2200600003 12
 
0.1%
2200100001 11
 
0.1%
2200000014 11
 
0.1%
2200300003 11
 
0.1%
2200500006 11
 
0.1%
2200100005 11
 
0.1%
2190100151 11
 
0.1%
Other values (4181) 9884
98.8%
ValueCountFrequency (%)
190100123 1
< 0.1%
198400002 1
< 0.1%
198400003 1
< 0.1%
198400005 1
< 0.1%
198400007 1
< 0.1%
198400008 1
< 0.1%
198400012 1
< 0.1%
198400020 1
< 0.1%
198600002 1
< 0.1%
198900003 1
< 0.1%
ValueCountFrequency (%)
3200200013 1
< 0.1%
3200200009 1
< 0.1%
3200200005 1
< 0.1%
3200200002 1
< 0.1%
3200200001 2
< 0.1%
3200000002 1
< 0.1%
2201800010 1
< 0.1%
2201800008 1
< 0.1%
2201800003 2
< 0.1%
2201800001 1
< 0.1%

허가신고양식
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct6
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
신고시설
8678 
허가시설
1093 
경미시설
 
216
유출지하수
 
7
기타시설
 
5

Length

Max length5
Median length4
Mean length4.0007
Min length4

Unique

Unique1 ?
Unique (%)< 0.1%

Sample

1st row신고시설
2nd row신고시설
3rd row신고시설
4th row신고시설
5th row신고시설

Common Values

ValueCountFrequency (%)
신고시설 8678
86.8%
허가시설 1093
 
10.9%
경미시설 216
 
2.2%
유출지하수 7
 
0.1%
기타시설 5
 
0.1%
온천시설 1
 
< 0.1%

Length

2024-05-18T07:08:15.797556image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-18T07:08:16.386963image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
신고시설 8678
86.8%
허가시설 1093
 
10.9%
경미시설 216
 
2.2%
유출지하수 7
 
0.1%
기타시설 5
 
< 0.1%
온천시설 1
 
< 0.1%

팀명
Categorical

HIGH CORRELATION 

Distinct25
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
서울특별시 서초구
1120 
서울특별시 강남구
924 
서울특별시 강서구
753 
서울특별시 강동구
646 
서울특별시 노원구
607 
Other values (20)
5950 

Length

Max length10
Median length9
Mean length9.0903
Min length8

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row서울특별시 금천구
2nd row서울특별시 노원구
3rd row서울특별시 노원구
4th row서울특별시 강서구
5th row서울특별시 강서구

Common Values

ValueCountFrequency (%)
서울특별시 서초구 1120
 
11.2%
서울특별시 강남구 924
 
9.2%
서울특별시 강서구 753
 
7.5%
서울특별시 강동구 646
 
6.5%
서울특별시 노원구 607
 
6.1%
서울특별시 송파구 587
 
5.9%
서울특별시 구로구 516
 
5.2%
서울특별시 도봉구 457
 
4.6%
서울특별시 동대문구 415
 
4.2%
서울특별시 영등포구 412
 
4.1%
Other values (15) 3563
35.6%

Length

2024-05-18T07:08:17.034465image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
서울특별시 10000
50.0%
서초구 1120
 
5.6%
강남구 924
 
4.6%
강서구 753
 
3.8%
강동구 646
 
3.2%
노원구 607
 
3.0%
송파구 587
 
2.9%
구로구 516
 
2.6%
도봉구 457
 
2.3%
동대문구 415
 
2.1%
Other values (16) 3975
 
19.9%

개발위치지역코드
Real number (ℝ)

HIGH CORRELATION  MISSING 

Distinct394
Distinct (%)4.0%
Missing260
Missing (%)2.6%
Infinite0
Infinite (%)0.0%
Mean1.1488352 × 109
Minimum1.1110101 × 109
Maximum1.174011 × 109
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-05-18T07:08:17.677343image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1.1110101 × 109
5-th percentile1.1170127 × 109
Q11.1320108 × 109
median1.1500113 × 109
Q31.1650109 × 109
95-th percentile1.1740105 × 109
Maximum1.174011 × 109
Range63000900
Interquartile range (IQR)33000100

Descriptive statistics

Standard deviation18305331
Coefficient of variation (CV)0.015933818
Kurtosis-1.1342068
Mean1.1488352 × 109
Median Absolute Deviation (MAD)15001000
Skewness-0.33285225
Sum1.1189655 × 1013
Variance3.3508513 × 1014
MonotonicityNot monotonic
2024-05-18T07:08:18.303574image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1135010500 297
 
3.0%
1130510300 236
 
2.4%
1132010800 232
 
2.3%
1165010300 219
 
2.2%
1165010900 218
 
2.2%
1171010800 213
 
2.1%
1174011000 205
 
2.1%
1168011200 191
 
1.9%
1165010800 177
 
1.8%
1153010200 176
 
1.8%
Other values (384) 7576
75.8%
(Missing) 260
 
2.6%
ValueCountFrequency (%)
1111010100 4
< 0.1%
1111010500 2
 
< 0.1%
1111010600 3
< 0.1%
1111010700 2
 
< 0.1%
1111010800 7
0.1%
1111011000 2
 
< 0.1%
1111011100 1
 
< 0.1%
1111011400 3
< 0.1%
1111011600 2
 
< 0.1%
1111011700 1
 
< 0.1%
ValueCountFrequency (%)
1174011000 205
2.1%
1174010900 66
 
0.7%
1174010800 41
 
0.4%
1174010700 73
 
0.7%
1174010600 48
 
0.5%
1174010500 66
 
0.7%
1174010300 60
 
0.6%
1174010200 53
 
0.5%
1174010100 34
 
0.3%
1171011400 36
 
0.4%

개발위치산
Categorical

Distinct3
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
<NA>
5111 
1
4701 
2
 
188

Length

Max length4
Median length4
Mean length2.5333
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 5111
51.1%
1 4701
47.0%
2 188
 
1.9%

Length

2024-05-18T07:08:19.048079image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-18T07:08:19.495430image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 5111
51.1%
1 4701
47.0%
2 188
 
1.9%

개발위치번지
Text

MISSING 

Distinct3342
Distinct (%)34.4%
Missing278
Missing (%)2.8%
Memory size156.2 KiB
2024-05-18T07:08:20.418094image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length9
Median length3
Mean length3.4025921
Min length1

Characters and Unicode

Total characters33080
Distinct characters46
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2177 ?
Unique (%)22.4%

Sample

1st row906
2nd row966-10
3rd row1205-430
4th row908-20
5th row산51-10
ValueCountFrequency (%)
1 210
 
2.1%
8 48
 
0.5%
2 44
 
0.5%
12 37
 
0.4%
14 34
 
0.3%
618 32
 
0.3%
6 31
 
0.3%
13 30
 
0.3%
304 28
 
0.3%
9 28
 
0.3%
Other values (3321) 9251
94.7%
2024-05-18T07:08:22.196002image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1 5337
16.1%
2 3791
11.5%
3 3502
10.6%
4 3154
9.5%
5 2824
8.5%
6 2679
8.1%
7 2391
7.2%
- 2349
7.1%
0 2238
6.8%
8 2230
6.7%
Other values (36) 2585
7.8%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 30369
91.8%
Dash Punctuation 2349
 
7.1%
Other Letter 263
 
0.8%
Space Separator 55
 
0.2%
Other Punctuation 29
 
0.1%
Uppercase Letter 7
 
< 0.1%
Open Punctuation 4
 
< 0.1%
Close Punctuation 4
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
113
43.0%
41
 
15.6%
33
 
12.5%
22
 
8.4%
10
 
3.8%
8
 
3.0%
6
 
2.3%
5
 
1.9%
2
 
0.8%
2
 
0.8%
Other values (16) 21
 
8.0%
Decimal Number
ValueCountFrequency (%)
1 5337
17.6%
2 3791
12.5%
3 3502
11.5%
4 3154
10.4%
5 2824
9.3%
6 2679
8.8%
7 2391
7.9%
0 2238
7.4%
8 2230
7.3%
9 2223
7.3%
Other Punctuation
ValueCountFrequency (%)
, 11
37.9%
/ 11
37.9%
. 7
24.1%
Uppercase Letter
ValueCountFrequency (%)
B 3
42.9%
L 3
42.9%
W 1
 
14.3%
Dash Punctuation
ValueCountFrequency (%)
- 2349
100.0%
Space Separator
ValueCountFrequency (%)
55
100.0%
Open Punctuation
ValueCountFrequency (%)
( 4
100.0%
Close Punctuation
ValueCountFrequency (%)
) 4
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 32810
99.2%
Hangul 263
 
0.8%
Latin 7
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
113
43.0%
41
 
15.6%
33
 
12.5%
22
 
8.4%
10
 
3.8%
8
 
3.0%
6
 
2.3%
5
 
1.9%
2
 
0.8%
2
 
0.8%
Other values (16) 21
 
8.0%
Common
ValueCountFrequency (%)
1 5337
16.3%
2 3791
11.6%
3 3502
10.7%
4 3154
9.6%
5 2824
8.6%
6 2679
8.2%
7 2391
7.3%
- 2349
7.2%
0 2238
6.8%
8 2230
6.8%
Other values (7) 2315
7.1%
Latin
ValueCountFrequency (%)
B 3
42.9%
L 3
42.9%
W 1
 
14.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 32817
99.2%
Hangul 263
 
0.8%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1 5337
16.3%
2 3791
11.6%
3 3502
10.7%
4 3154
9.6%
5 2824
8.6%
6 2679
8.2%
7 2391
7.3%
- 2349
7.2%
0 2238
6.8%
8 2230
6.8%
Other values (10) 2322
7.1%
Hangul
ValueCountFrequency (%)
113
43.0%
41
 
15.6%
33
 
12.5%
22
 
8.4%
10
 
3.8%
8
 
3.0%
6
 
2.3%
5
 
1.9%
2
 
0.8%
2
 
0.8%
Other values (16) 21
 
8.0%

개발위치호
Text

MISSING 

Distinct451
Distinct (%)8.7%
Missing4801
Missing (%)48.0%
Memory size156.2 KiB
2024-05-18T07:08:23.003398image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length4
Median length1
Mean length1.6164647
Min length1

Characters and Unicode

Total characters8404
Distinct characters14
Distinct categories4 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique237 ?
Unique (%)4.6%

Sample

1st row12
2nd row2
3rd row2
4th row5
5th row1
ValueCountFrequency (%)
1 860
 
16.5%
2 473
 
9.1%
3 351
 
6.8%
4 292
 
5.6%
5 255
 
4.9%
6 202
 
3.9%
7 162
 
3.1%
10 145
 
2.8%
8 145
 
2.8%
9 121
 
2.3%
Other values (398) 2193
42.2%
2024-05-18T07:08:24.389493image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1 2349
28.0%
2 1309
15.6%
3 888
 
10.6%
4 741
 
8.8%
5 609
 
7.2%
6 560
 
6.7%
7 495
 
5.9%
8 399
 
4.7%
9 386
 
4.6%
0 382
 
4.5%
Other values (4) 286
 
3.4%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 8118
96.6%
Space Separator 282
 
3.4%
Other Punctuation 3
 
< 0.1%
Dash Punctuation 1
 
< 0.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
1 2349
28.9%
2 1309
16.1%
3 888
 
10.9%
4 741
 
9.1%
5 609
 
7.5%
6 560
 
6.9%
7 495
 
6.1%
8 399
 
4.9%
9 386
 
4.8%
0 382
 
4.7%
Other Punctuation
ValueCountFrequency (%)
, 2
66.7%
. 1
33.3%
Space Separator
ValueCountFrequency (%)
282
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 8404
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
1 2349
28.0%
2 1309
15.6%
3 888
 
10.6%
4 741
 
8.8%
5 609
 
7.2%
6 560
 
6.7%
7 495
 
5.9%
8 399
 
4.7%
9 386
 
4.6%
0 382
 
4.5%
Other values (4) 286
 
3.4%

Most occurring blocks

ValueCountFrequency (%)
ASCII 8404
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1 2349
28.0%
2 1309
15.6%
3 888
 
10.6%
4 741
 
8.8%
5 609
 
7.2%
6 560
 
6.7%
7 495
 
5.9%
8 399
 
4.7%
9 386
 
4.6%
0 382
 
4.5%
Other values (4) 286
 
3.4%

경도도
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct10
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean37.425
Minimum0
Maximum204
Zeros7042
Zeros (%)70.4%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-05-18T07:08:24.780451image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q3126
95-th percentile127
Maximum204
Range204
Interquartile range (IQR)126

Descriptive statistics

Standard deviation57.776959
Coefficient of variation (CV)1.5438065
Kurtosis-1.1927284
Mean37.425
Median Absolute Deviation (MAD)0
Skewness0.89717637
Sum374250
Variance3338.177
MonotonicityNot monotonic
2024-05-18T07:08:25.256960image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%)
0 7042
70.4%
127 1739
 
17.4%
126 1198
 
12.0%
125 9
 
0.1%
128 7
 
0.1%
16 1
 
< 0.1%
37 1
 
< 0.1%
129 1
 
< 0.1%
42 1
 
< 0.1%
204 1
 
< 0.1%
ValueCountFrequency (%)
0 7042
70.4%
16 1
 
< 0.1%
37 1
 
< 0.1%
42 1
 
< 0.1%
125 9
 
0.1%
126 1198
 
12.0%
127 1739
 
17.4%
128 7
 
0.1%
129 1
 
< 0.1%
204 1
 
< 0.1%
ValueCountFrequency (%)
204 1
 
< 0.1%
129 1
 
< 0.1%
128 7
 
0.1%
127 1739
 
17.4%
126 1198
 
12.0%
125 9
 
0.1%
42 1
 
< 0.1%
37 1
 
< 0.1%
16 1
 
< 0.1%
0 7042
70.4%

경도분
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct58
Distinct (%)0.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean7.3114
Minimum0
Maximum64
Zeros7199
Zeros (%)72.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-05-18T07:08:25.871539image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q32
95-th percentile55
Maximum64
Range64
Interquartile range (IQR)2

Descriptive statistics

Standard deviation17.385156
Coefficient of variation (CV)2.3778149
Kurtosis3.2079007
Mean7.3114
Median Absolute Deviation (MAD)0
Skewness2.2462993
Sum73114
Variance302.24365
MonotonicityNot monotonic
2024-05-18T07:08:26.256892image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 7199
72.0%
3 318
 
3.2%
1 227
 
2.3%
4 199
 
2.0%
2 198
 
2.0%
50 138
 
1.4%
55 129
 
1.3%
6 128
 
1.3%
5 119
 
1.2%
49 113
 
1.1%
Other values (48) 1232
 
12.3%
ValueCountFrequency (%)
0 7199
72.0%
1 227
 
2.3%
2 198
 
2.0%
3 318
 
3.2%
4 199
 
2.0%
5 119
 
1.2%
6 128
 
1.3%
7 103
 
1.0%
8 61
 
0.6%
9 73
 
0.7%
ValueCountFrequency (%)
64 1
 
< 0.1%
60 4
 
< 0.1%
59 84
0.8%
58 101
1.0%
57 98
1.0%
56 96
1.0%
55 129
1.3%
54 106
1.1%
53 91
0.9%
52 51
 
0.5%

경도초
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct821
Distinct (%)8.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean8.3126581
Minimum0
Maximum99.6
Zeros7146
Zeros (%)71.5%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-05-18T07:08:26.931759image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q38
95-th percentile49
Maximum99.6
Range99.6
Interquartile range (IQR)8

Descriptive statistics

Standard deviation16.019996
Coefficient of variation (CV)1.9271809
Kurtosis2.2760046
Mean8.3126581
Median Absolute Deviation (MAD)0
Skewness1.8581633
Sum83126.581
Variance256.64028
MonotonicityNot monotonic
2024-05-18T07:08:27.381909image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0.0 7146
71.5%
30.0 73
 
0.7%
10.0 73
 
0.7%
20.0 58
 
0.6%
9.0 54
 
0.5%
15.0 48
 
0.5%
57.0 45
 
0.4%
8.0 44
 
0.4%
21.0 41
 
0.4%
31.0 40
 
0.4%
Other values (811) 2378
 
23.8%
ValueCountFrequency (%)
0.0 7146
71.5%
0.38 1
 
< 0.1%
0.4 1
 
< 0.1%
0.45 1
 
< 0.1%
0.47 1
 
< 0.1%
0.5 1
 
< 0.1%
0.53 1
 
< 0.1%
0.61 1
 
< 0.1%
0.64 1
 
< 0.1%
0.65 1
 
< 0.1%
ValueCountFrequency (%)
99.6 1
< 0.1%
99.0 1
< 0.1%
97.84 1
< 0.1%
93.0 1
< 0.1%
69.0 1
< 0.1%
61.0 1
< 0.1%
59.88 1
< 0.1%
59.8 1
< 0.1%
59.65 1
< 0.1%
59.58 1
< 0.1%

위도도
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct9
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean10.9348
Minimum0
Maximum44
Zeros7043
Zeros (%)70.4%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-05-18T07:08:27.860393image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q337
95-th percentile37
Maximum44
Range44
Interquartile range (IQR)37

Descriptive statistics

Standard deviation16.879113
Coefficient of variation (CV)1.5436142
Kurtosis-1.1967369
Mean10.9348
Median Absolute Deviation (MAD)0
Skewness0.89612556
Sum109348
Variance284.90444
MonotonicityNot monotonic
2024-05-18T07:08:28.230687image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=9)
ValueCountFrequency (%)
0 7043
70.4%
37 2925
29.2%
38 9
 
0.1%
36 6
 
0.1%
34 6
 
0.1%
33 5
 
0.1%
35 4
 
< 0.1%
12 1
 
< 0.1%
44 1
 
< 0.1%
ValueCountFrequency (%)
0 7043
70.4%
12 1
 
< 0.1%
33 5
 
0.1%
34 6
 
0.1%
35 4
 
< 0.1%
36 6
 
0.1%
37 2925
29.2%
38 9
 
0.1%
44 1
 
< 0.1%
ValueCountFrequency (%)
44 1
 
< 0.1%
38 9
 
0.1%
37 2925
29.2%
36 6
 
0.1%
35 4
 
< 0.1%
34 6
 
0.1%
33 5
 
0.1%
12 1
 
< 0.1%
0 7043
70.4%

위도분
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct55
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean9.2767
Minimum0
Maximum72
Zeros7084
Zeros (%)70.8%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-05-18T07:08:28.752084image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q327
95-th percentile36
Maximum72
Range72
Interquartile range (IQR)27

Descriptive statistics

Standard deviation14.726677
Coefficient of variation (CV)1.587491
Kurtosis-0.71215687
Mean9.2767
Median Absolute Deviation (MAD)0
Skewness1.0372748
Sum92767
Variance216.87502
MonotonicityNot monotonic
2024-05-18T07:08:29.438971image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 7084
70.8%
33 340
 
3.4%
32 278
 
2.8%
27 273
 
2.7%
34 253
 
2.5%
30 250
 
2.5%
28 247
 
2.5%
31 243
 
2.4%
29 223
 
2.2%
35 129
 
1.3%
Other values (45) 680
 
6.8%
ValueCountFrequency (%)
0 7084
70.8%
2 2
 
< 0.1%
6 1
 
< 0.1%
8 6
 
0.1%
9 3
 
< 0.1%
10 2
 
< 0.1%
11 6
 
0.1%
12 5
 
0.1%
13 1
 
< 0.1%
14 4
 
< 0.1%
ValueCountFrequency (%)
72 1
 
< 0.1%
60 4
< 0.1%
59 2
 
< 0.1%
58 2
 
< 0.1%
57 3
 
< 0.1%
56 1
 
< 0.1%
55 1
 
< 0.1%
53 2
 
< 0.1%
52 8
0.1%
50 3
 
< 0.1%

위도초
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct805
Distinct (%)8.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean8.3781339
Minimum0
Maximum81
Zeros7144
Zeros (%)71.4%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-05-18T07:08:29.932942image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q38
95-th percentile49
Maximum81
Range81
Interquartile range (IQR)8

Descriptive statistics

Standard deviation16.12269
Coefficient of variation (CV)1.9243772
Kurtosis2.0504211
Mean8.3781339
Median Absolute Deviation (MAD)0
Skewness1.8328855
Sum83781.339
Variance259.94114
MonotonicityNot monotonic
2024-05-18T07:08:30.488403image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0.0 7144
71.4%
30.0 73
 
0.7%
15.0 70
 
0.7%
10.0 64
 
0.6%
20.0 63
 
0.6%
50.0 50
 
0.5%
8.0 45
 
0.4%
25.0 44
 
0.4%
40.0 43
 
0.4%
5.0 42
 
0.4%
Other values (795) 2362
 
23.6%
ValueCountFrequency (%)
0.0 7144
71.4%
0.02 1
 
< 0.1%
0.19 1
 
< 0.1%
0.38 1
 
< 0.1%
0.4 1
 
< 0.1%
0.47 1
 
< 0.1%
0.5 1
 
< 0.1%
0.52 1
 
< 0.1%
0.6 2
 
< 0.1%
0.64 1
 
< 0.1%
ValueCountFrequency (%)
81.0 1
< 0.1%
79.0 1
< 0.1%
71.0 1
< 0.1%
63.8 1
< 0.1%
62.13 1
< 0.1%
61.72 1
< 0.1%
60.48 1
< 0.1%
59.95 1
< 0.1%
59.94 1
< 0.1%
59.93 1
< 0.1%
Distinct53
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2024-05-18T07:08:31.027468image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length4
Median length4
Mean length4
Min length4

Characters and Unicode

Total characters40000
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique16 ?
Unique (%)0.2%

Sample

1st row2018
2nd row2017
3rd row2017
4th row2018
5th row2018
ValueCountFrequency (%)
2017 5651
56.5%
2018 3870
38.7%
2005 48
 
0.5%
2011 43
 
0.4%
84-0 37
 
0.4%
85-0 35
 
0.4%
2003 35
 
0.4%
86-0 32
 
0.3%
87-0 27
 
0.3%
89-0 24
 
0.2%
Other values (43) 198
 
2.0%
2024-05-18T07:08:32.320610image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 10049
25.1%
2 9695
24.2%
1 9686
24.2%
7 5692
14.2%
8 4161
10.4%
- 330
 
0.8%
5 105
 
0.3%
9 103
 
0.3%
3 63
 
0.2%
6 59
 
0.1%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 39670
99.2%
Dash Punctuation 330
 
0.8%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 10049
25.3%
2 9695
24.4%
1 9686
24.4%
7 5692
14.3%
8 4161
10.5%
5 105
 
0.3%
9 103
 
0.3%
3 63
 
0.2%
6 59
 
0.1%
4 57
 
0.1%
Dash Punctuation
ValueCountFrequency (%)
- 330
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 40000
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 10049
25.1%
2 9695
24.2%
1 9686
24.2%
7 5692
14.2%
8 4161
10.4%
- 330
 
0.8%
5 105
 
0.3%
9 103
 
0.3%
3 63
 
0.2%
6 59
 
0.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 40000
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 10049
25.1%
2 9695
24.2%
1 9686
24.2%
7 5692
14.2%
8 4161
10.4%
- 330
 
0.8%
5 105
 
0.3%
9 103
 
0.3%
3 63
 
0.2%
6 59
 
0.1%

구명
Categorical

HIGH CORRELATION 

Distinct25
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
서초구
1120 
강남구
924 
강서구
753 
강동구
646 
노원구
607 
Other values (20)
5950 

Length

Max length4
Median length3
Mean length3.0903
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row금천구
2nd row노원구
3rd row노원구
4th row강서구
5th row강서구

Common Values

ValueCountFrequency (%)
서초구 1120
 
11.2%
강남구 924
 
9.2%
강서구 753
 
7.5%
강동구 646
 
6.5%
노원구 607
 
6.1%
송파구 587
 
5.9%
구로구 516
 
5.2%
도봉구 457
 
4.6%
동대문구 415
 
4.2%
영등포구 412
 
4.1%
Other values (15) 3563
35.6%

Length

2024-05-18T07:08:32.944937image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
서초구 1120
 
11.2%
강남구 924
 
9.2%
강서구 753
 
7.5%
강동구 646
 
6.5%
노원구 607
 
6.1%
송파구 587
 
5.9%
구로구 516
 
5.2%
도봉구 457
 
4.6%
동대문구 415
 
4.2%
영등포구 412
 
4.1%
Other values (15) 3563
35.6%

Interactions

2024-05-18T07:08:10.755359image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-18T07:07:50.087015image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-18T07:07:53.113271image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-18T07:07:56.641249image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-18T07:07:59.631839image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-18T07:08:02.363353image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-18T07:08:05.114378image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-18T07:08:08.081732image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-18T07:08:10.967324image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-18T07:07:50.468546image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-18T07:07:53.494357image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-18T07:07:56.941251image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-18T07:07:59.982224image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-18T07:08:02.755363image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-18T07:08:05.384854image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-18T07:08:08.584857image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-18T07:08:11.263347image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-18T07:07:50.898283image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-18T07:07:53.909996image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-18T07:07:57.440327image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-18T07:08:00.296268image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-18T07:08:03.069616image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-18T07:08:05.805297image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-18T07:08:09.006683image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-18T07:08:11.605498image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-18T07:07:51.306138image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-18T07:07:54.225911image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-18T07:07:57.785550image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-18T07:08:00.699634image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-18T07:08:03.424822image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-18T07:08:06.279033image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-18T07:08:09.300805image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-18T07:08:11.882559image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-18T07:07:51.538268image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-18T07:07:54.674203image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-18T07:07:58.175331image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-18T07:08:01.077101image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-18T07:08:03.716550image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-18T07:08:06.643183image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-18T07:08:09.570802image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-18T07:08:12.128211image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-18T07:07:52.130688image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-18T07:07:55.148555image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-18T07:07:58.521232image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-18T07:08:01.346978image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-18T07:08:04.029030image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-18T07:08:07.005141image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-18T07:08:09.836267image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-18T07:08:12.556959image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-18T07:07:52.415064image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-18T07:07:55.634040image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-18T07:07:58.878994image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-18T07:08:01.693860image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-18T07:08:04.397270image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-18T07:08:07.357523image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-18T07:08:10.163167image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-18T07:08:12.849638image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-18T07:07:52.777833image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-18T07:07:56.265944image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-18T07:07:59.311947image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-18T07:08:01.984360image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-18T07:08:04.756488image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-18T07:08:07.726485image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-18T07:08:10.445005image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-05-18T07:08:33.351967image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
허가신고번호허가신고양식팀명개발위치지역코드개발위치산경도도경도분경도초위도도위도분위도초기준년도구명
허가신고번호1.0000.9070.3130.2270.0370.0730.1560.1030.0710.1430.0900.8060.313
허가신고양식0.9071.0000.4460.3420.0480.1230.1580.1210.1240.3060.1470.7630.446
팀명0.3130.4461.0001.0000.3340.2160.6450.1700.2210.5370.2070.7791.000
개발위치지역코드0.2270.3421.0001.0000.2190.1630.6320.1210.1650.5410.1670.6671.000
개발위치산0.0370.0480.3340.2191.0000.0000.0420.0110.0000.1320.0190.0000.334
경도도0.0730.1230.2160.1630.0001.0000.6960.6100.9910.8400.7860.0000.216
경도분0.1560.1580.6450.6320.0420.6961.0000.4800.7270.7350.5900.2510.645
경도초0.1030.1210.1700.1210.0110.6100.4801.0000.6100.6210.6020.1190.170
위도도0.0710.1240.2210.1650.0000.9910.7270.6101.0000.8650.7880.1710.221
위도분0.1430.3060.5370.5410.1320.8400.7350.6210.8651.0000.7370.1910.537
위도초0.0900.1470.2070.1670.0190.7860.5900.6020.7880.7371.0000.0000.207
기준년도0.8060.7630.7790.6670.0000.0000.2510.1190.1710.1910.0001.0000.779
구명0.3130.4461.0001.0000.3340.2160.6450.1700.2210.5370.2070.7791.000
2024-05-18T07:08:33.725707image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
구명개발위치산허가신고양식팀명
구명1.0000.2880.2181.000
개발위치산0.2881.0000.0340.288
허가신고양식0.2180.0341.0000.218
팀명1.0000.2880.2181.000
2024-05-18T07:08:34.056474image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
허가신고번호개발위치지역코드경도도경도분경도초위도도위도분위도초허가신고양식팀명개발위치산구명
허가신고번호1.0000.0370.4150.3710.4000.4060.3970.3960.7910.1730.0250.173
개발위치지역코드0.0371.0000.0200.0050.0040.006-0.0630.0050.1870.9990.1680.999
경도도0.4150.0201.0000.8990.9390.9840.9560.9410.0830.0940.0000.094
경도분0.3710.0050.8991.0000.9220.9440.9340.9240.0840.2870.0320.287
경도초0.4000.0040.9390.9221.0000.9550.9450.9500.0670.0680.0080.068
위도도0.4060.0060.9840.9440.9551.0000.9700.9570.0840.0970.0000.097
위도분0.397-0.0630.9560.9340.9450.9701.0000.9430.1660.2200.1310.220
위도초0.3960.0050.9410.9240.9500.9570.9431.0000.0780.0740.0140.074
허가신고양식0.7910.1870.0830.0840.0670.0840.1660.0781.0000.2180.0340.218
팀명0.1730.9990.0940.2870.0680.0970.2200.0740.2181.0000.2881.000
개발위치산0.0250.1680.0000.0320.0080.0000.1310.0140.0340.2881.0000.288
구명0.1730.9990.0940.2870.0680.0970.2200.0740.2181.0000.2881.000

Missing values

2024-05-18T07:08:13.295155image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-05-18T07:08:13.958457image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2024-05-18T07:08:14.398695image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

허가신고번호허가신고양식팀명개발위치지역코드개발위치산개발위치번지개발위치호경도도경도분경도초위도도위도분위도초기준년도구명
236932190100321신고시설서울특별시 금천구1154510200<NA>90612000.0000.02018금천구
72382190100118신고시설서울특별시 노원구1135010500<NA>966-10<NA>127325.0374015.02017노원구
223962190100128신고시설서울특별시 노원구1135010500<NA>1205-430<NA>000.0000.02017노원구
204192199400597신고시설서울특별시 강서구1150010300<NA>908-20<NA>000.0000.02018강서구
130182190100031신고시설서울특별시 강서구1150010700<NA>산51-10<NA>000.0000.02018강서구
158682190100879신고시설서울특별시 강동구117401090013342000.0000.02017강동구
89852190100409신고시설서울특별시 송파구1171011100<NA>126-4<NA>000.0000.02017송파구
158642190100263신고시설서울특별시 양천구<NA><NA><NA><NA>000.0000.091-0양천구
146382190100839신고시설서울특별시 강동구1174010500249<NA>000.0000.02017강동구
41871190100128신고시설서울특별시 강남구1168010500<NA>128-17<NA>000.0000.02017강남구
허가신고번호허가신고양식팀명개발위치지역코드개발위치산개발위치번지개발위치호경도도경도분경도초위도도위도분위도초기준년도구명
275042200400001신고시설서울특별시 동작구11590108001343-1<NA>1263041.0373032.02017동작구
61262190101647신고시설서울특별시 강남구1168010600<NA>997-4<NA>000.0000.02017강남구
236182190100171신고시설서울특별시 양천구<NA><NA><NA><NA>000.0000.084-1양천구
39902200900003신고시설서울특별시 강동구11740107001105<NA>1271528.0372742.02017강동구
233852200900004신고시설서울특별시 금천구1154510200190041265426.93372848.812018금천구
180112200900007신고시설서울특별시 강서구1150010700128011264920.1373243.02018강서구
79642190100298신고시설서울특별시 마포구1144012400148022000.0000.02017마포구
64492200100149경미시설서울특별시 서초구116501030015764127132.0372722.02017서초구
267372200300003신고시설서울특별시 노원구113501050011205<NA>12732.037411.02017노원구
75632200000338신고시설서울특별시 도봉구1132010600<NA>412-4<NA>000.0000.02017도봉구