Overview

Dataset statistics

Number of variables6
Number of observations108
Missing cells106
Missing cells (%)16.4%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory5.4 KiB
Average record size in memory51.2 B

Variable types

Numeric2
Text3
Categorical1

Dataset

Description경기도_고양시_마을회관현황에 대한 데이터로 마을회관의 시설명, 위치, 연면적, 이용현황 등의 항목을 제공합니다.
URLhttps://www.data.go.kr/data/3079134/fileData.do

Alerts

이용현황 is highly imbalanced (72.5%)Imbalance
비 고 has 106 (98.1%) missing valuesMissing
연번 has unique valuesUnique
시설명 has unique valuesUnique
위 치 has unique valuesUnique

Reproduction

Analysis started2023-12-12 06:46:15.606987
Analysis finished2023-12-12 06:46:16.525574
Duration0.92 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

UNIQUE 

Distinct108
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean54.5
Minimum1
Maximum108
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.1 KiB
2023-12-12T15:46:16.614771image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile6.35
Q127.75
median54.5
Q381.25
95-th percentile102.65
Maximum108
Range107
Interquartile range (IQR)53.5

Descriptive statistics

Standard deviation31.32092
Coefficient of variation (CV)0.57469577
Kurtosis-1.2
Mean54.5
Median Absolute Deviation (MAD)27
Skewness0
Sum5886
Variance981
MonotonicityStrictly increasing
2023-12-12T15:46:16.811697image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.9%
70 1
 
0.9%
81 1
 
0.9%
80 1
 
0.9%
79 1
 
0.9%
78 1
 
0.9%
77 1
 
0.9%
76 1
 
0.9%
75 1
 
0.9%
74 1
 
0.9%
Other values (98) 98
90.7%
ValueCountFrequency (%)
1 1
0.9%
2 1
0.9%
3 1
0.9%
4 1
0.9%
5 1
0.9%
6 1
0.9%
7 1
0.9%
8 1
0.9%
9 1
0.9%
10 1
0.9%
ValueCountFrequency (%)
108 1
0.9%
107 1
0.9%
106 1
0.9%
105 1
0.9%
104 1
0.9%
103 1
0.9%
102 1
0.9%
101 1
0.9%
100 1
0.9%
99 1
0.9%

시설명
Text

UNIQUE 

Distinct108
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size996.0 B
2023-12-12T15:46:17.086389image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length15
Median length9
Mean length9.6574074
Min length8

Characters and Unicode

Total characters1043
Distinct characters55
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique108 ?
Unique (%)100.0%

Sample

1st row관산14통 마을회관
2nd row관산19통 마을회관
3rd row관산20통 마을회관
4th row관산21통 마을회관
5th row관산30통 마을회관
ValueCountFrequency (%)
마을회관 101
48.3%
관산19통 1
 
0.5%
고봉5통마을회관 1
 
0.5%
고봉4통 1
 
0.5%
고봉3통마을회관 1
 
0.5%
고봉2통 1
 
0.5%
고봉1통마을회관 1
 
0.5%
장항1동3통 1
 
0.5%
풍산6통 1
 
0.5%
풍산2,14통 1
 
0.5%
Other values (99) 99
47.4%
2023-12-12T15:46:17.869404image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
115
11.0%
109
 
10.5%
109
 
10.5%
108
 
10.4%
107
 
10.3%
105
 
10.1%
1 43
 
4.1%
25
 
2.4%
2 24
 
2.3%
4 17
 
1.6%
Other values (45) 281
26.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 767
73.5%
Decimal Number 160
 
15.3%
Space Separator 105
 
10.1%
Other Punctuation 10
 
1.0%
Math Symbol 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
115
15.0%
109
14.2%
109
14.2%
108
14.1%
107
14.0%
25
 
3.3%
15
 
2.0%
13
 
1.7%
11
 
1.4%
10
 
1.3%
Other values (32) 145
18.9%
Decimal Number
ValueCountFrequency (%)
1 43
26.9%
2 24
15.0%
4 17
 
10.6%
5 17
 
10.6%
6 15
 
9.4%
3 13
 
8.1%
7 10
 
6.2%
8 10
 
6.2%
9 6
 
3.8%
0 5
 
3.1%
Space Separator
ValueCountFrequency (%)
105
100.0%
Other Punctuation
ValueCountFrequency (%)
, 10
100.0%
Math Symbol
ValueCountFrequency (%)
~ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 767
73.5%
Common 276
 
26.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
115
15.0%
109
14.2%
109
14.2%
108
14.1%
107
14.0%
25
 
3.3%
15
 
2.0%
13
 
1.7%
11
 
1.4%
10
 
1.3%
Other values (32) 145
18.9%
Common
ValueCountFrequency (%)
105
38.0%
1 43
15.6%
2 24
 
8.7%
4 17
 
6.2%
5 17
 
6.2%
6 15
 
5.4%
3 13
 
4.7%
, 10
 
3.6%
7 10
 
3.6%
8 10
 
3.6%
Other values (3) 12
 
4.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 767
73.5%
ASCII 276
 
26.5%

Most frequent character per block

Hangul
ValueCountFrequency (%)
115
15.0%
109
14.2%
109
14.2%
108
14.1%
107
14.0%
25
 
3.3%
15
 
2.0%
13
 
1.7%
11
 
1.4%
10
 
1.3%
Other values (32) 145
18.9%
ASCII
ValueCountFrequency (%)
105
38.0%
1 43
15.6%
2 24
 
8.7%
4 17
 
6.2%
5 17
 
6.2%
6 15
 
5.4%
3 13
 
4.7%
, 10
 
3.6%
7 10
 
3.6%
8 10
 
3.6%
Other values (3) 12
 
4.3%

위 치
Text

UNIQUE 

Distinct108
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size996.0 B
2023-12-12T15:46:18.209124image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length27
Median length24
Mean length21.490741
Min length16

Characters and Unicode

Total characters2321
Distinct characters108
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique108 ?
Unique (%)100.0%

Sample

1st row경기도 고양시 덕양구 통일로966번길 56
2nd row경기도 고양시 덕양구 고골길11번길 19
3rd row경기도 고양시 덕양구 통일로1154번길 23-5
4th row경기도 고양시 덕양구 내유길83번길 3
5th row경기도 고양시 덕양구 유산길37번길 41
ValueCountFrequency (%)
고양시 108
23.7%
경기도 107
23.5%
덕양구 68
14.9%
일산동구 22
 
4.8%
일산서구 18
 
3.9%
경기동 1
 
0.2%
동헌로246 1
 
0.2%
13 1
 
0.2%
진밭로 1
 
0.2%
은마길224-7 1
 
0.2%
Other values (128) 128
28.1%
2023-12-12T15:46:18.758636image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
351
 
15.1%
181
 
7.8%
114
 
4.9%
111
 
4.8%
109
 
4.7%
108
 
4.7%
108
 
4.7%
108
 
4.7%
93
 
4.0%
1 91
 
3.9%
Other values (98) 947
40.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1477
63.6%
Decimal Number 455
 
19.6%
Space Separator 351
 
15.1%
Dash Punctuation 38
 
1.6%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
181
12.3%
114
 
7.7%
111
 
7.5%
109
 
7.4%
108
 
7.3%
108
 
7.3%
108
 
7.3%
93
 
6.3%
73
 
4.9%
72
 
4.9%
Other values (86) 400
27.1%
Decimal Number
ValueCountFrequency (%)
1 91
20.0%
3 54
11.9%
2 52
11.4%
6 50
11.0%
4 49
10.8%
5 40
8.8%
9 32
 
7.0%
8 31
 
6.8%
7 31
 
6.8%
0 25
 
5.5%
Space Separator
ValueCountFrequency (%)
351
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 38
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1477
63.6%
Common 844
36.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
181
12.3%
114
 
7.7%
111
 
7.5%
109
 
7.4%
108
 
7.3%
108
 
7.3%
108
 
7.3%
93
 
6.3%
73
 
4.9%
72
 
4.9%
Other values (86) 400
27.1%
Common
ValueCountFrequency (%)
351
41.6%
1 91
 
10.8%
3 54
 
6.4%
2 52
 
6.2%
6 50
 
5.9%
4 49
 
5.8%
5 40
 
4.7%
- 38
 
4.5%
9 32
 
3.8%
8 31
 
3.7%
Other values (2) 56
 
6.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1477
63.6%
ASCII 844
36.4%

Most frequent character per block

ASCII
ValueCountFrequency (%)
351
41.6%
1 91
 
10.8%
3 54
 
6.4%
2 52
 
6.2%
6 50
 
5.9%
4 49
 
5.8%
5 40
 
4.7%
- 38
 
4.5%
9 32
 
3.8%
8 31
 
3.7%
Other values (2) 56
 
6.6%
Hangul
ValueCountFrequency (%)
181
12.3%
114
 
7.7%
111
 
7.5%
109
 
7.4%
108
 
7.3%
108
 
7.3%
108
 
7.3%
93
 
6.3%
73
 
4.9%
72
 
4.9%
Other values (86) 400
27.1%

연면적(제곱미터)
Real number (ℝ)

Distinct104
Distinct (%)96.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean196.70139
Minimum21
Maximum558
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.1 KiB
2023-12-12T15:46:18.911245image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum21
5-th percentile83.427
Q1139.6175
median198.84
Q3240.8175
95-th percentile301.16
Maximum558
Range537
Interquartile range (IQR)101.2

Descriptive statistics

Standard deviation78.124422
Coefficient of variation (CV)0.3971727
Kurtosis3.4311895
Mean196.70139
Median Absolute Deviation (MAD)53.1
Skewness0.86379071
Sum21243.75
Variance6103.4253
MonotonicityNot monotonic
2023-12-12T15:46:19.084569image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
99.0 3
 
2.8%
203.0 2
 
1.9%
199.0 2
 
1.9%
207.4 1
 
0.9%
197.98 1
 
0.9%
95.04 1
 
0.9%
321.94 1
 
0.9%
260.88 1
 
0.9%
211.2 1
 
0.9%
389.94 1
 
0.9%
Other values (94) 94
87.0%
ValueCountFrequency (%)
21.0 1
0.9%
38.0 1
0.9%
67.41 1
0.9%
68.86 1
0.9%
69.56 1
0.9%
83.0 1
0.9%
84.22 1
0.9%
92.17 1
0.9%
95.04 1
0.9%
97.2 1
0.9%
ValueCountFrequency (%)
558.0 1
0.9%
389.94 1
0.9%
369.0 1
0.9%
321.94 1
0.9%
316.94 1
0.9%
302.0 1
0.9%
299.6 1
0.9%
296.0 1
0.9%
295.8 1
0.9%
292.74 1
0.9%

이용현황
Categorical

IMBALANCE 

Distinct5
Distinct (%)4.6%
Missing0
Missing (%)0.0%
Memory size996.0 B
마을회관, 경로당
97 
마을회관
 
6
경로당, 마을회관
 
3
마을회관 사무실
 
1
마을회관,경로당, 체력단련실
 
1

Length

Max length15
Median length9
Mean length8.7685185
Min length4

Unique

Unique2 ?
Unique (%)1.9%

Sample

1st row마을회관
2nd row마을회관, 경로당
3rd row마을회관, 경로당
4th row마을회관, 경로당
5th row마을회관, 경로당

Common Values

ValueCountFrequency (%)
마을회관, 경로당 97
89.8%
마을회관 6
 
5.6%
경로당, 마을회관 3
 
2.8%
마을회관 사무실 1
 
0.9%
마을회관,경로당, 체력단련실 1
 
0.9%

Length

2023-12-12T15:46:19.282763image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T15:46:19.419195image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
마을회관 107
51.0%
경로당 100
47.6%
사무실 1
 
0.5%
마을회관,경로당 1
 
0.5%
체력단련실 1
 
0.5%

비 고
Text

MISSING 

Distinct2
Distinct (%)100.0%
Missing106
Missing (%)98.1%
Memory size996.0 B
2023-12-12T15:46:19.577306image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length7
Median length5
Mean length5
Min length3

Characters and Unicode

Total characters10
Distinct characters10
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2 ?
Unique (%)100.0%

Sample

1st row2개동
2nd row(한울노인정)
ValueCountFrequency (%)
2개동 1
50.0%
한울노인정 1
50.0%
2023-12-12T15:46:19.968479image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2 1
10.0%
1
10.0%
1
10.0%
( 1
10.0%
1
10.0%
1
10.0%
1
10.0%
1
10.0%
1
10.0%
) 1
10.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 7
70.0%
Decimal Number 1
 
10.0%
Open Punctuation 1
 
10.0%
Close Punctuation 1
 
10.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1
14.3%
1
14.3%
1
14.3%
1
14.3%
1
14.3%
1
14.3%
1
14.3%
Decimal Number
ValueCountFrequency (%)
2 1
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 7
70.0%
Common 3
30.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1
14.3%
1
14.3%
1
14.3%
1
14.3%
1
14.3%
1
14.3%
1
14.3%
Common
ValueCountFrequency (%)
2 1
33.3%
( 1
33.3%
) 1
33.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 7
70.0%
ASCII 3
30.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2 1
33.3%
( 1
33.3%
) 1
33.3%
Hangul
ValueCountFrequency (%)
1
14.3%
1
14.3%
1
14.3%
1
14.3%
1
14.3%
1
14.3%
1
14.3%

Interactions

2023-12-12T15:46:16.097527image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T15:46:15.893397image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T15:46:16.198951image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T15:46:15.992774image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T15:46:20.076778image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번연면적(제곱미터)이용현황비 고
연번1.0000.0000.569NaN
연면적(제곱미터)0.0001.0000.2240.000
이용현황0.5690.2241.000NaN
비 고NaN0.000NaN1.000
2023-12-12T15:46:20.198161image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번연면적(제곱미터)이용현황
연번1.0000.0150.263
연면적(제곱미터)0.0151.0000.134
이용현황0.2630.1341.000

Missing values

2023-12-12T15:46:16.335304image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T15:46:16.477104image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번시설명위 치연면적(제곱미터)이용현황비 고
01관산14통 마을회관경기도 고양시 덕양구 통일로966번길 56292.74마을회관<NA>
12관산19통 마을회관경기도 고양시 덕양구 고골길11번길 19180.75마을회관, 경로당<NA>
23관산20통 마을회관경기도 고양시 덕양구 통일로1154번길 23-5165.4마을회관, 경로당<NA>
34관산21통 마을회관경기도 고양시 덕양구 내유길83번길 3197.85마을회관, 경로당<NA>
45관산30통 마을회관경기도 고양시 덕양구 유산길37번길 41214.0마을회관, 경로당<NA>
56관산24통 마을회관경기도 고양시 덕양구 고골길264번길3-8195.2마을회관, 경로당<NA>
67관산25통 마을회관경기도 고양시 덕양구 성령길5225.94마을회관, 경로당<NA>
78창릉1통 마을회관경기도 고양시 덕양구 고양대로1978번길12131.0마을회관, 경로당<NA>
89창릉2통 마을회관경기도 고양시 덕양구 고양대로1940번길62242.52마을회관, 경로당<NA>
910창릉5통 마을회관경기도 고양시 덕양구 화랑로332번길16-4207.45마을회관, 경로당<NA>
연번시설명위 치연면적(제곱미터)이용현황비 고
9899덕이4통 마을회관경기도 고양시 일산서구 송포백송길1584.22마을회관<NA>
99100덕이5통 마을회관경기도 고양시 일산서구 덕이로287128.8마을회관, 경로당<NA>
100101덕이6통 마을회관경기도 고양시 일산서구 구산로69번길 32197.64마을회관, 경로당<NA>
101102가좌2통 마을회관경기도 고양시 일산서구 송산로266번길19-116272.58마을회관, 경로당<NA>
102103가좌4통 마을회관경기도 고양시 일산서구 송포로425번길133156.4마을회관, 경로당<NA>
103104가좌5통 마을회관경기도 고양시 일산서구 덕산로 121번길 5187.7마을회관, 경로당<NA>
104105가좌6통 마을회관경기도 고양시 일산서구 덕산로48번길22198.96마을회관, 경로당<NA>
105106가좌7통 마을회관경기도 고양시 일산서구 곳산길 160198.95마을회관, 경로당<NA>
106107가좌8통 마을회관경기도 고양시 일산서구 이산포길 498-58276.29마을회관, 경로당<NA>
107108창릉4통 마을회관경기도 고양시 덕양구 서오릉로 504-49218.13마을회관<NA>