Overview

Dataset statistics

Number of variables7
Number of observations208
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory11.9 KiB
Average record size in memory58.6 B

Variable types

Numeric2
Categorical2
Text3

Dataset

Description충청남도 시군 읍면동사무소 정보를 시도, 시군구, 우편번호, 주소, 전화번호로 표현하여 데이터로 개방하고자 합니다.
URLhttps://www.data.go.kr/data/15032211/fileData.do

Alerts

시도 has constant value ""Constant
연번 is highly overall correlated with 시군구High correlation
우편번호 is highly overall correlated with 시군구High correlation
시군구 is highly overall correlated with 연번 and 1 other fieldsHigh correlation
연번 has unique valuesUnique
읍면동 has unique valuesUnique
전화번호 has unique valuesUnique

Reproduction

Analysis started2023-12-12 12:00:21.130986
Analysis finished2023-12-12 12:00:22.202128
Duration1.07 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct208
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean104.5
Minimum1
Maximum208
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size2.0 KiB
2023-12-12T21:00:22.313935image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile11.35
Q152.75
median104.5
Q3156.25
95-th percentile197.65
Maximum208
Range207
Interquartile range (IQR)103.5

Descriptive statistics

Standard deviation60.188592
Coefficient of variation (CV)0.57596739
Kurtosis-1.2
Mean104.5
Median Absolute Deviation (MAD)52
Skewness0
Sum21736
Variance3622.6667
MonotonicityStrictly increasing
2023-12-12T21:00:22.492145image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.5%
106 1
 
0.5%
134 1
 
0.5%
135 1
 
0.5%
136 1
 
0.5%
137 1
 
0.5%
138 1
 
0.5%
139 1
 
0.5%
140 1
 
0.5%
141 1
 
0.5%
Other values (198) 198
95.2%
ValueCountFrequency (%)
1 1
0.5%
2 1
0.5%
3 1
0.5%
4 1
0.5%
5 1
0.5%
6 1
0.5%
7 1
0.5%
8 1
0.5%
9 1
0.5%
10 1
0.5%
ValueCountFrequency (%)
208 1
0.5%
207 1
0.5%
206 1
0.5%
205 1
0.5%
204 1
0.5%
203 1
0.5%
202 1
0.5%
201 1
0.5%
200 1
0.5%
199 1
0.5%

시도
Categorical

CONSTANT 

Distinct1
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size1.8 KiB
충청남도
208 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row충청남도
2nd row충청남도
3rd row충청남도
4th row충청남도
5th row충청남도

Common Values

ValueCountFrequency (%)
충청남도 208
100.0%

Length

2023-12-12T21:00:22.667248image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T21:00:22.815832image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
충청남도 208
100.0%

시군구
Categorical

HIGH CORRELATION 

Distinct15
Distinct (%)7.2%
Missing0
Missing (%)0.0%
Memory size1.8 KiB
천안시
31 
아산시
17 
공주시
16 
보령시
16 
부여군
16 
Other values (10)
112 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row천안시
2nd row천안시
3rd row천안시
4th row천안시
5th row천안시

Common Values

ValueCountFrequency (%)
천안시 31
14.9%
아산시 17
 
8.2%
공주시 16
 
7.7%
보령시 16
 
7.7%
부여군 16
 
7.7%
서산시 15
 
7.2%
논산시 15
 
7.2%
당진시 14
 
6.7%
서천군 13
 
6.2%
예산군 12
 
5.8%
Other values (5) 43
20.7%

Length

2023-12-12T21:00:22.958776image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
천안시 31
14.9%
아산시 17
 
8.2%
공주시 16
 
7.7%
보령시 16
 
7.7%
부여군 16
 
7.7%
서산시 15
 
7.2%
논산시 15
 
7.2%
당진시 14
 
6.7%
서천군 13
 
6.2%
예산군 12
 
5.8%
Other values (5) 43
20.7%

읍면동
Text

UNIQUE 

Distinct208
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size1.8 KiB
2023-12-12T21:00:23.234401image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length11
Median length9
Mean length8.5528846
Min length5

Characters and Unicode

Total characters1779
Distinct characters151
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique208 ?
Unique (%)100.0%

Sample

1st row목천읍행정복지센터
2nd row풍세면행정복지센터
3rd row광덕면행정복지센터
4th row북면행정복지센터
5th row성남면행정복지센터
ValueCountFrequency (%)
행정복지센터 8
 
3.7%
목천읍행정복지센터 1
 
0.5%
구룡면행정복지센터 1
 
0.5%
군북면행정복지센터 1
 
0.5%
남일면행정복지센터 1
 
0.5%
남이면행정복지센터 1
 
0.5%
진산면행정복지센터 1
 
0.5%
복수면행정복지센터 1
 
0.5%
추부면행정복지센터 1
 
0.5%
부여읍행정복지센터 1
 
0.5%
Other values (199) 199
92.1%
2023-12-12T21:00:23.734756image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
171
 
9.6%
169
 
9.5%
169
 
9.5%
168
 
9.4%
166
 
9.3%
165
 
9.3%
138
 
7.8%
52
 
2.9%
41
 
2.3%
41
 
2.3%
Other values (141) 499
28.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1744
98.0%
Decimal Number 27
 
1.5%
Space Separator 8
 
0.4%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
171
 
9.8%
169
 
9.7%
169
 
9.7%
168
 
9.6%
166
 
9.5%
165
 
9.5%
138
 
7.9%
52
 
3.0%
41
 
2.4%
41
 
2.4%
Other values (134) 464
26.6%
Decimal Number
ValueCountFrequency (%)
2 9
33.3%
1 9
33.3%
3 4
14.8%
4 2
 
7.4%
5 2
 
7.4%
6 1
 
3.7%
Space Separator
ValueCountFrequency (%)
8
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1744
98.0%
Common 35
 
2.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
171
 
9.8%
169
 
9.7%
169
 
9.7%
168
 
9.6%
166
 
9.5%
165
 
9.5%
138
 
7.9%
52
 
3.0%
41
 
2.4%
41
 
2.4%
Other values (134) 464
26.6%
Common
ValueCountFrequency (%)
2 9
25.7%
1 9
25.7%
8
22.9%
3 4
11.4%
4 2
 
5.7%
5 2
 
5.7%
6 1
 
2.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1744
98.0%
ASCII 35
 
2.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
171
 
9.8%
169
 
9.7%
169
 
9.7%
168
 
9.6%
166
 
9.5%
165
 
9.5%
138
 
7.9%
52
 
3.0%
41
 
2.4%
41
 
2.4%
Other values (134) 464
26.6%
ASCII
ValueCountFrequency (%)
2 9
25.7%
1 9
25.7%
8
22.9%
3 4
11.4%
4 2
 
5.7%
5 2
 
5.7%
6 1
 
2.9%

우편번호
Real number (ℝ)

HIGH CORRELATION 

Distinct206
Distinct (%)99.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean32405.096
Minimum31015
Maximum35718
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size2.0 KiB
2023-12-12T21:00:23.927546image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum31015
5-th percentile31138.75
Q131750.75
median32445
Q333169
95-th percentile33613.95
Maximum35718
Range4703
Interquartile range (IQR)1418.25

Descriptive statistics

Standard deviation836.80948
Coefficient of variation (CV)0.025823391
Kurtosis-0.25125082
Mean32405.096
Median Absolute Deviation (MAD)708
Skewness0.15970296
Sum6740260
Variance700250.11
MonotonicityNot monotonic
2023-12-12T21:00:24.468469image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
32954 2
 
1.0%
31125 2
 
1.0%
31231 1
 
0.5%
33200 1
 
0.5%
32751 1
 
0.5%
32704 1
 
0.5%
32703 1
 
0.5%
32712 1
 
0.5%
33167 1
 
0.5%
33123 1
 
0.5%
Other values (196) 196
94.2%
ValueCountFrequency (%)
31015 1
0.5%
31037 1
0.5%
31045 1
0.5%
31055 1
0.5%
31080 1
0.5%
31100 1
0.5%
31125 2
1.0%
31128 1
0.5%
31131 1
0.5%
31137 1
0.5%
ValueCountFrequency (%)
35718 1
0.5%
33673 1
0.5%
33654 1
0.5%
33643 1
0.5%
33630 1
0.5%
33628 1
0.5%
33624 1
0.5%
33622 1
0.5%
33620 1
0.5%
33617 1
0.5%

주소
Text

Distinct207
Distinct (%)99.5%
Missing0
Missing (%)0.0%
Memory size1.8 KiB
2023-12-12T21:00:24.878616image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length27
Median length25
Mean length20.403846
Min length14

Characters and Unicode

Total characters4244
Distinct characters211
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique206 ?
Unique (%)99.0%

Sample

1st row충청남도 천안시 동남구 목천읍 서리1길 41-7
2nd row충청남도 천안시 동남구 풍세면 상정1길 3
3rd row충청남도 천안시 동남구 광덕면 신흥리3길 33
4th row충청남도 천안시 동남구 북면 위례성로 724
5th row충청남도 천안시 동남구 성남면 신사대화로 149
ValueCountFrequency (%)
충청남도 208
 
20.3%
천안시 31
 
3.0%
동남구 17
 
1.7%
부여군 16
 
1.6%
아산시 16
 
1.6%
공주시 16
 
1.6%
보령시 16
 
1.6%
서북구 15
 
1.5%
논산시 15
 
1.5%
서산시 15
 
1.5%
Other values (518) 661
64.4%
2023-12-12T21:00:25.494803image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
824
19.4%
240
 
5.7%
227
 
5.3%
215
 
5.1%
214
 
5.0%
151
 
3.6%
144
 
3.4%
135
 
3.2%
1 122
 
2.9%
111
 
2.6%
Other values (201) 1861
43.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2780
65.5%
Space Separator 824
 
19.4%
Decimal Number 612
 
14.4%
Dash Punctuation 27
 
0.6%
Other Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
240
 
8.6%
227
 
8.2%
215
 
7.7%
214
 
7.7%
151
 
5.4%
144
 
5.2%
135
 
4.9%
111
 
4.0%
81
 
2.9%
73
 
2.6%
Other values (188) 1189
42.8%
Decimal Number
ValueCountFrequency (%)
1 122
19.9%
3 74
12.1%
2 72
11.8%
4 62
10.1%
5 58
9.5%
7 56
9.2%
9 49
8.0%
6 47
 
7.7%
8 41
 
6.7%
0 31
 
5.1%
Space Separator
ValueCountFrequency (%)
824
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 27
100.0%
Other Punctuation
ValueCountFrequency (%)
. 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2780
65.5%
Common 1464
34.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
240
 
8.6%
227
 
8.2%
215
 
7.7%
214
 
7.7%
151
 
5.4%
144
 
5.2%
135
 
4.9%
111
 
4.0%
81
 
2.9%
73
 
2.6%
Other values (188) 1189
42.8%
Common
ValueCountFrequency (%)
824
56.3%
1 122
 
8.3%
3 74
 
5.1%
2 72
 
4.9%
4 62
 
4.2%
5 58
 
4.0%
7 56
 
3.8%
9 49
 
3.3%
6 47
 
3.2%
8 41
 
2.8%
Other values (3) 59
 
4.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2780
65.5%
ASCII 1464
34.5%

Most frequent character per block

ASCII
ValueCountFrequency (%)
824
56.3%
1 122
 
8.3%
3 74
 
5.1%
2 72
 
4.9%
4 62
 
4.2%
5 58
 
4.0%
7 56
 
3.8%
9 49
 
3.3%
6 47
 
3.2%
8 41
 
2.8%
Other values (3) 59
 
4.0%
Hangul
ValueCountFrequency (%)
240
 
8.6%
227
 
8.2%
215
 
7.7%
214
 
7.7%
151
 
5.4%
144
 
5.2%
135
 
4.9%
111
 
4.0%
81
 
2.9%
73
 
2.6%
Other values (188) 1189
42.8%

전화번호
Text

UNIQUE 

Distinct208
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size1.8 KiB
2023-12-12T21:00:25.892220image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length12
Mean length12
Min length12

Characters and Unicode

Total characters2496
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique208 ?
Unique (%)100.0%

Sample

1st row041-521-4651
2nd row041-521-4681
3rd row041-521-4701
4th row041-521-4729
5th row041-521-4749
ValueCountFrequency (%)
041-521-4651 1
 
0.5%
041-521-4681 1
 
0.5%
041-830-6453 1
 
0.5%
041-750-8453 1
 
0.5%
041-750-8506 1
 
0.5%
041-750-8553 1
 
0.5%
041-750-8605 1
 
0.5%
041-750-8653 1
 
0.5%
041-750-3107 1
 
0.5%
041-830-6302 1
 
0.5%
Other values (198) 198
95.2%
2023-12-12T21:00:26.394874image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 416
16.7%
0 398
15.9%
4 365
14.6%
1 305
12.2%
3 213
8.5%
6 193
7.7%
5 152
 
6.1%
8 137
 
5.5%
2 117
 
4.7%
7 103
 
4.1%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 2080
83.3%
Dash Punctuation 416
 
16.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 398
19.1%
4 365
17.5%
1 305
14.7%
3 213
10.2%
6 193
9.3%
5 152
 
7.3%
8 137
 
6.6%
2 117
 
5.6%
7 103
 
5.0%
9 97
 
4.7%
Dash Punctuation
ValueCountFrequency (%)
- 416
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 2496
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 416
16.7%
0 398
15.9%
4 365
14.6%
1 305
12.2%
3 213
8.5%
6 193
7.7%
5 152
 
6.1%
8 137
 
5.5%
2 117
 
4.7%
7 103
 
4.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 2496
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 416
16.7%
0 398
15.9%
4 365
14.6%
1 305
12.2%
3 213
8.5%
6 193
7.7%
5 152
 
6.1%
8 137
 
5.5%
2 117
 
4.7%
7 103
 
4.1%

Interactions

2023-12-12T21:00:21.730776image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:00:21.506558image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:00:21.833091image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:00:21.614534image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T21:00:26.514784image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번시군구우편번호
연번1.0000.9790.803
시군구0.9791.0000.957
우편번호0.8030.9571.000
2023-12-12T21:00:26.625232image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번우편번호시군구
연번1.0000.4400.839
우편번호0.4401.0000.826
시군구0.8390.8261.000

Missing values

2023-12-12T21:00:21.986758image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T21:00:22.129611image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번시도시군구읍면동우편번호주소전화번호
01충청남도천안시목천읍행정복지센터31231충청남도 천안시 동남구 목천읍 서리1길 41-7041-521-4651
12충청남도천안시풍세면행정복지센터31217충청남도 천안시 동남구 풍세면 상정1길 3041-521-4681
23충청남도천안시광덕면행정복지센터31221충청남도 천안시 동남구 광덕면 신흥리3길 33041-521-4701
34충청남도천안시북면행정복지센터31238충청남도 천안시 동남구 북면 위례성로 724041-521-4729
45충청남도천안시성남면행정복지센터31245충청남도 천안시 동남구 성남면 신사대화로 149041-521-4749
56충청남도천안시수신면행정복지센터31251충청남도 천안시 동남구 수신면 수신로 431041-521-4766
67충청남도천안시병천면행정복지센터31254충청남도 천안시 동남구 병천면 병천2로 57041-521-4781
78충청남도천안시동면행정복지센터31258충청남도 천안시 동남구 동면 동산1길 15041-521-4809
89충청남도천안시중앙동행정복지센터31131충청남도 천안시 동남구 원성천1길 17041-521-4821
910충청남도천안시문성동행정복지센터31128충청남도 천안시 동남구 문화로 15041-521-4840
연번시도시군구읍면동우편번호주소전화번호
198199충청남도예산군신암면행정복지센터32418충청남도 예산군 신암면 종경길 70041-339-8844
199200충청남도예산군오가면행정복지센터32425충청남도 예산군 오가면 오가중앙로 86-12041-339-8882
200201충청남도태안군태안읍행정복지센터32141충청남도 태안군 태안읍 백화로 54041-670-5502
201202충청남도태안군안면읍행정복지센터32164충청남도 태안군 안면읍 장터로 149041-670-5560
202203충청남도태안군고남면사무소32172충청남도 태안군 고남면 안면대로 4254-12041-670-5593
203204충청남도태안군남면사무소32154충청남도 태안군 남면 달산포로 311041-670-5135
204205충청남도태안군근흥면사무소32129충청남도 태안군 근흥면 근흥로 724041-670-5656
205206충청남도태안군소원면행정복지센터32120충청남도 태안군 소원면 소근로 26-11041-670-5685
206207충청남도태안군원북면사무소32109충청남도 태안군 원북면 상리길 11041-670-5713
207208충청남도태안군이원면사무소32103충청남도 태안군 이원면 분지길 14041-670-5745