Overview

Dataset statistics

Number of variables6
Number of observations1289
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory61.8 KiB
Average record size in memory49.1 B

Variable types

Numeric1
Categorical2
Text3

Dataset

Description강원특별자치도 내 시군별 마을회관에 대한 데이터로 시군명, 시설명, 마을명, 소재지 도로명주소 등의 항목을 제공합니다.
URLhttps://www.data.go.kr/data/15033674/fileData.do

Alerts

시도명 has constant value ""Constant
연번 is highly overall correlated with 시군구명High correlation
시군구명 is highly overall correlated with 연번High correlation
연번 has unique valuesUnique

Reproduction

Analysis started2023-12-12 07:04:51.197987
Analysis finished2023-12-12 07:04:52.254582
Duration1.06 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct1289
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean645
Minimum1
Maximum1289
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size11.5 KiB
2023-12-12T16:04:52.318945image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile65.4
Q1323
median645
Q3967
95-th percentile1224.6
Maximum1289
Range1288
Interquartile range (IQR)644

Descriptive statistics

Standard deviation372.24656
Coefficient of variation (CV)0.57712645
Kurtosis-1.2
Mean645
Median Absolute Deviation (MAD)322
Skewness0
Sum831405
Variance138567.5
MonotonicityStrictly increasing
2023-12-12T16:04:52.450584image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.1%
887 1
 
0.1%
865 1
 
0.1%
864 1
 
0.1%
863 1
 
0.1%
862 1
 
0.1%
861 1
 
0.1%
860 1
 
0.1%
859 1
 
0.1%
858 1
 
0.1%
Other values (1279) 1279
99.2%
ValueCountFrequency (%)
1 1
0.1%
2 1
0.1%
3 1
0.1%
4 1
0.1%
5 1
0.1%
6 1
0.1%
7 1
0.1%
8 1
0.1%
9 1
0.1%
10 1
0.1%
ValueCountFrequency (%)
1289 1
0.1%
1288 1
0.1%
1287 1
0.1%
1286 1
0.1%
1285 1
0.1%
1284 1
0.1%
1283 1
0.1%
1282 1
0.1%
1281 1
0.1%
1280 1
0.1%

시도명
Categorical

CONSTANT 

Distinct1
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size10.2 KiB
강원특별자치도
1289 

Length

Max length7
Median length7
Mean length7
Min length7

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row강원특별자치도
2nd row강원특별자치도
3rd row강원특별자치도
4th row강원특별자치도
5th row강원특별자치도

Common Values

ValueCountFrequency (%)
강원특별자치도 1289
100.0%

Length

2023-12-12T16:04:52.566074image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T16:04:52.642950image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
강원특별자치도 1289
100.0%

시군구명
Categorical

HIGH CORRELATION 

Distinct16
Distinct (%)1.2%
Missing0
Missing (%)0.0%
Memory size10.2 KiB
횡성군
167 
삼척시
161 
춘천시
157 
영월군
156 
강릉시
132 
Other values (11)
516 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row춘천시
2nd row춘천시
3rd row춘천시
4th row춘천시
5th row춘천시

Common Values

ValueCountFrequency (%)
횡성군 167
13.0%
삼척시 161
12.5%
춘천시 157
12.2%
영월군 156
12.1%
강릉시 132
10.2%
원주시 129
10.0%
양양군 114
8.8%
양구군 82
6.4%
홍천군 74
5.7%
화천군 50
 
3.9%
Other values (6) 67
5.2%

Length

2023-12-12T16:04:52.745801image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
횡성군 167
13.0%
삼척시 161
12.5%
춘천시 157
12.2%
영월군 156
12.1%
강릉시 132
10.2%
원주시 129
10.0%
양양군 114
8.8%
양구군 82
6.4%
홍천군 74
5.7%
화천군 50
 
3.9%
Other values (6) 67
5.2%
Distinct1228
Distinct (%)95.3%
Missing0
Missing (%)0.0%
Memory size10.2 KiB
2023-12-12T16:04:52.992023image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length16
Median length8
Mean length8.1512801
Min length4

Characters and Unicode

Total characters10507
Distinct characters270
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1169 ?
Unique (%)90.7%

Sample

1st row청평1리마을회관
2nd row청평2리마을회관
3rd row부귀리마을회관
4th row추곡1리마을회관
5th row추곡2리마을회관
ValueCountFrequency (%)
종합복지회관 114
 
7.7%
마을회관 74
 
5.0%
학곡1리마을회관 3
 
0.2%
추동리마을회관 3
 
0.2%
리마을회관 3
 
0.2%
매지2리마을회관 2
 
0.1%
용석1리마을회관 2
 
0.1%
심포리마을회관 2
 
0.1%
강림1리마을회관 2
 
0.1%
학곡2리마을회관 2
 
0.1%
Other values (1227) 1280
86.1%
2023-12-12T16:04:53.346170image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1291
12.3%
1287
12.2%
1227
 
11.7%
1199
 
11.4%
1170
 
11.1%
1 339
 
3.2%
2 321
 
3.1%
201
 
1.9%
144
 
1.4%
3 126
 
1.2%
Other values (260) 3202
30.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 9339
88.9%
Decimal Number 912
 
8.7%
Space Separator 201
 
1.9%
Open Punctuation 26
 
0.2%
Close Punctuation 26
 
0.2%
Other Punctuation 2
 
< 0.1%
Dash Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1291
13.8%
1287
13.8%
1227
13.1%
1199
12.8%
1170
12.5%
144
 
1.5%
118
 
1.3%
117
 
1.3%
115
 
1.2%
85
 
0.9%
Other values (245) 2586
27.7%
Decimal Number
ValueCountFrequency (%)
1 339
37.2%
2 321
35.2%
3 126
 
13.8%
4 54
 
5.9%
5 24
 
2.6%
6 16
 
1.8%
7 10
 
1.1%
9 8
 
0.9%
8 7
 
0.8%
0 7
 
0.8%
Space Separator
ValueCountFrequency (%)
201
100.0%
Open Punctuation
ValueCountFrequency (%)
( 26
100.0%
Close Punctuation
ValueCountFrequency (%)
) 26
100.0%
Other Punctuation
ValueCountFrequency (%)
, 2
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 9339
88.9%
Common 1168
 
11.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1291
13.8%
1287
13.8%
1227
13.1%
1199
12.8%
1170
12.5%
144
 
1.5%
118
 
1.3%
117
 
1.3%
115
 
1.2%
85
 
0.9%
Other values (245) 2586
27.7%
Common
ValueCountFrequency (%)
1 339
29.0%
2 321
27.5%
201
17.2%
3 126
 
10.8%
4 54
 
4.6%
( 26
 
2.2%
) 26
 
2.2%
5 24
 
2.1%
6 16
 
1.4%
7 10
 
0.9%
Other values (5) 25
 
2.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 9339
88.9%
ASCII 1168
 
11.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
1291
13.8%
1287
13.8%
1227
13.1%
1199
12.8%
1170
12.5%
144
 
1.5%
118
 
1.3%
117
 
1.3%
115
 
1.2%
85
 
0.9%
Other values (245) 2586
27.7%
ASCII
ValueCountFrequency (%)
1 339
29.0%
2 321
27.5%
201
17.2%
3 126
 
10.8%
4 54
 
4.6%
( 26
 
2.2%
) 26
 
2.2%
5 24
 
2.1%
6 16
 
1.4%
7 10
 
0.9%
Other values (5) 25
 
2.1%
Distinct1222
Distinct (%)94.8%
Missing0
Missing (%)0.0%
Memory size10.2 KiB
2023-12-12T16:04:53.728122image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length8
Median length4
Mean length3.9511249
Min length2

Characters and Unicode

Total characters5093
Distinct characters262
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1156 ?
Unique (%)89.7%

Sample

1st row청평1리
2nd row청평2리
3rd row부귀리
4th row추곡1리
5th row추곡2리
ValueCountFrequency (%)
학곡1리 3
 
0.2%
반곡리 3
 
0.2%
두산2리 2
 
0.2%
상2리 2
 
0.2%
용석4리 2
 
0.2%
신촌리 2
 
0.2%
창촌3리 2
 
0.2%
교항리 2
 
0.2%
두산3리 2
 
0.2%
두산1리 2
 
0.2%
Other values (1211) 1271
98.3%
2023-12-12T16:04:54.201158image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1217
23.9%
1 337
 
6.6%
2 318
 
6.2%
127
 
2.5%
3 126
 
2.5%
96
 
1.9%
90
 
1.8%
78
 
1.5%
76
 
1.5%
74
 
1.5%
Other values (252) 2554
50.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 4079
80.1%
Decimal Number 907
 
17.8%
Space Separator 96
 
1.9%
Open Punctuation 4
 
0.1%
Close Punctuation 4
 
0.1%
Other Punctuation 2
 
< 0.1%
Dash Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1217
29.8%
127
 
3.1%
90
 
2.2%
78
 
1.9%
76
 
1.9%
74
 
1.8%
67
 
1.6%
57
 
1.4%
53
 
1.3%
48
 
1.2%
Other values (237) 2192
53.7%
Decimal Number
ValueCountFrequency (%)
1 337
37.2%
2 318
35.1%
3 126
 
13.9%
4 54
 
6.0%
5 24
 
2.6%
6 16
 
1.8%
7 10
 
1.1%
9 8
 
0.9%
0 7
 
0.8%
8 7
 
0.8%
Space Separator
ValueCountFrequency (%)
96
100.0%
Open Punctuation
ValueCountFrequency (%)
( 4
100.0%
Close Punctuation
ValueCountFrequency (%)
) 4
100.0%
Other Punctuation
ValueCountFrequency (%)
, 2
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 4079
80.1%
Common 1014
 
19.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1217
29.8%
127
 
3.1%
90
 
2.2%
78
 
1.9%
76
 
1.9%
74
 
1.8%
67
 
1.6%
57
 
1.4%
53
 
1.3%
48
 
1.2%
Other values (237) 2192
53.7%
Common
ValueCountFrequency (%)
1 337
33.2%
2 318
31.4%
3 126
 
12.4%
96
 
9.5%
4 54
 
5.3%
5 24
 
2.4%
6 16
 
1.6%
7 10
 
1.0%
9 8
 
0.8%
0 7
 
0.7%
Other values (5) 18
 
1.8%

Most occurring blocks

ValueCountFrequency (%)
Hangul 4079
80.1%
ASCII 1014
 
19.9%

Most frequent character per block

Hangul
ValueCountFrequency (%)
1217
29.8%
127
 
3.1%
90
 
2.2%
78
 
1.9%
76
 
1.9%
74
 
1.8%
67
 
1.6%
57
 
1.4%
53
 
1.3%
48
 
1.2%
Other values (237) 2192
53.7%
ASCII
ValueCountFrequency (%)
1 337
33.2%
2 318
31.4%
3 126
 
12.4%
96
 
9.5%
4 54
 
5.3%
5 24
 
2.4%
6 16
 
1.6%
7 10
 
1.0%
9 8
 
0.8%
0 7
 
0.7%
Other values (5) 18
 
1.8%
Distinct1285
Distinct (%)99.7%
Missing0
Missing (%)0.0%
Memory size10.2 KiB
2023-12-12T16:04:54.685612image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length32
Median length30
Mean length23.076804
Min length16

Characters and Unicode

Total characters29746
Distinct characters346
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1281 ?
Unique (%)99.4%

Sample

1st row강원특별자치도 춘천시 북산면 오봉산길 674
2nd row강원특별자치도 춘천시 북산면 삼막길 565-26
3rd row강원특별자치도 춘천시 춘천시 북산면 삼막길 59
4th row강원특별자치도 춘천시 북산면 북산로 33-5
5th row강원특별자치도 춘천시 북산면 중추곡길 59
ValueCountFrequency (%)
강원특별자치도 1289
 
21.1%
횡성군 167
 
2.7%
삼척시 161
 
2.6%
춘천시 158
 
2.6%
영월군 156
 
2.6%
원주시 129
 
2.1%
양양군 114
 
1.9%
양구군 82
 
1.3%
홍천군 74
 
1.2%
화천군 50
 
0.8%
Other values (1883) 3733
61.1%
2023-12-12T16:04:55.337447image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
4841
 
16.3%
1511
 
5.1%
1404
 
4.7%
1340
 
4.5%
1307
 
4.4%
1298
 
4.4%
1290
 
4.3%
1289
 
4.3%
1 846
 
2.8%
824
 
2.8%
Other values (336) 13796
46.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 20436
68.7%
Space Separator 4841
 
16.3%
Decimal Number 4023
 
13.5%
Dash Punctuation 400
 
1.3%
Close Punctuation 23
 
0.1%
Open Punctuation 23
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1511
 
7.4%
1404
 
6.9%
1340
 
6.6%
1307
 
6.4%
1298
 
6.4%
1290
 
6.3%
1289
 
6.3%
824
 
4.0%
780
 
3.8%
677
 
3.3%
Other values (322) 8716
42.7%
Decimal Number
ValueCountFrequency (%)
1 846
21.0%
2 594
14.8%
3 421
10.5%
4 386
9.6%
5 341
8.5%
6 318
 
7.9%
9 289
 
7.2%
7 283
 
7.0%
8 276
 
6.9%
0 269
 
6.7%
Space Separator
ValueCountFrequency (%)
4841
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 400
100.0%
Close Punctuation
ValueCountFrequency (%)
) 23
100.0%
Open Punctuation
ValueCountFrequency (%)
( 23
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 20436
68.7%
Common 9310
31.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1511
 
7.4%
1404
 
6.9%
1340
 
6.6%
1307
 
6.4%
1298
 
6.4%
1290
 
6.3%
1289
 
6.3%
824
 
4.0%
780
 
3.8%
677
 
3.3%
Other values (322) 8716
42.7%
Common
ValueCountFrequency (%)
4841
52.0%
1 846
 
9.1%
2 594
 
6.4%
3 421
 
4.5%
- 400
 
4.3%
4 386
 
4.1%
5 341
 
3.7%
6 318
 
3.4%
9 289
 
3.1%
7 283
 
3.0%
Other values (4) 591
 
6.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 20436
68.7%
ASCII 9310
31.3%

Most frequent character per block

ASCII
ValueCountFrequency (%)
4841
52.0%
1 846
 
9.1%
2 594
 
6.4%
3 421
 
4.5%
- 400
 
4.3%
4 386
 
4.1%
5 341
 
3.7%
6 318
 
3.4%
9 289
 
3.1%
7 283
 
3.0%
Other values (4) 591
 
6.3%
Hangul
ValueCountFrequency (%)
1511
 
7.4%
1404
 
6.9%
1340
 
6.6%
1307
 
6.4%
1298
 
6.4%
1290
 
6.3%
1289
 
6.3%
824
 
4.0%
780
 
3.8%
677
 
3.3%
Other values (322) 8716
42.7%

Interactions

2023-12-12T16:04:51.744871image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T16:04:55.445345image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번시군구명
연번1.0000.952
시군구명0.9521.000
2023-12-12T16:04:55.548981image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번시군구명
연번1.0000.790
시군구명0.7901.000

Missing values

2023-12-12T16:04:52.126157image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T16:04:52.216373image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번시도명시군구명시설명마을명소재지도로명주소
01강원특별자치도춘천시청평1리마을회관청평1리강원특별자치도 춘천시 북산면 오봉산길 674
12강원특별자치도춘천시청평2리마을회관청평2리강원특별자치도 춘천시 북산면 삼막길 565-26
23강원특별자치도춘천시부귀리마을회관부귀리강원특별자치도 춘천시 춘천시 북산면 삼막길 59
34강원특별자치도춘천시추곡1리마을회관추곡1리강원특별자치도 춘천시 북산면 북산로 33-5
45강원특별자치도춘천시추곡2리마을회관추곡2리강원특별자치도 춘천시 북산면 중추곡길 59
56강원특별자치도춘천시오항1리마을회관오항1리강원특별자치도 춘천시 북산면 북산로 491
67강원특별자치도춘천시오항2리마을회관오항2리강원특별자치도 춘천시 북산면 장재골길 23
78강원특별자치도춘천시물로2리마을회관물로2리강원특별자치도 춘천시 북산면 물로길 381-5
89강원특별자치도춘천시조교1리마을회관조교1리강원특별자치도 춘천시 북산면 원동조교로 361
910강원특별자치도춘천시대곡리마을회관대곡리강원특별자치도 춘천시 북산면 더운샘길 43
연번시도명시군구명시설명마을명소재지도로명주소
12791280강원특별자치도양양군적은리 종합복지회관적은리강원특별자치도 양양군 강현면 안골로 296-8
12801281강원특별자치도양양군방축리 종합복지회관방축리강원특별자치도 양양군 강현면 안골로 245
12811282강원특별자치도양양군광석리 종합복지회관광석리강원특별자치도 양양군 강현면 안골로 137
12821283강원특별자치도양양군답리 종합복지회관답리강원특별자치도 양양군 강현면 안골로 56-2
12831284강원특별자치도양양군주청리 종합복지회관주청리강원특별자치도 양양군 강현면 동해대로 3066-11
12841285강원특별자치도양양군전진1리 종합복지회관전진1리강원특별자치도 양양군 강현면 해맞이길 8-14
12851286강원특별자치도양양군전진2리 종합복지회관전진2리강원특별자치도 양양군 강현면 뒷나루2길 23
12861287강원특별자치도양양군용호리 종합복지회관용호리강원특별자치도 양양군 강현면 용호길 74
12871288강원특별자치도양양군정암1리 종합복지회관정암1리강원특별자치도 양양군 강현면 정암1길 40
12881289강원특별자치도양양군정암2리 종합복지회관정암2리강원특별자치도 양양군 강현면 진미로12번길 11