Overview

Dataset statistics

Number of variables5
Number of observations159
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory6.5 KiB
Average record size in memory41.8 B

Variable types

Categorical1
Text3
Numeric1

Dataset

Description충청북도에 소재하고 있는 보건진료소에 대한 현황 정보를 제공합니다. (시군, 진료소명, 전화번화, 우편번호, 주소)
Author충청북도
URLhttps://www.data.go.kr/data/15071084/fileData.do

Alerts

우편번호 is highly overall correlated with 시군High correlation
시군 is highly overall correlated with 우편번호High correlation
주소 has unique valuesUnique

Reproduction

Analysis started2024-03-14 18:59:29.568238
Analysis finished2024-03-14 18:59:30.671813
Duration1.1 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

시군
Categorical

HIGH CORRELATION 

Distinct11
Distinct (%)6.9%
Missing0
Missing (%)0.0%
Memory size1.4 KiB
충청북도 청주시
25 
충청북도 음성군
19 
충청북도 영동군
17 
충청북도 괴산군
17 
충청북도 충주시
16 
Other values (6)
65 

Length

Max length8
Median length8
Mean length8
Min length8

Unique

Unique1 ?
Unique (%)0.6%

Sample

1st row충청북도 청주시
2nd row충청북도 청주시
3rd row충청북도 청주시
4th row충청북도 청주시
5th row충청북도 청주시

Common Values

ValueCountFrequency (%)
충청북도 청주시 25
15.7%
충청북도 음성군 19
11.9%
충청북도 영동군 17
10.7%
충청북도 괴산군 17
10.7%
충청북도 충주시 16
10.1%
충청북도 옥천군 16
10.1%
충청북도 보은군 15
9.4%
충청북도 단양군 15
9.4%
충청북도 제천시 11
6.9%
충청북도 진천군 7
 
4.4%

Length

2024-03-15T03:59:30.885541image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
충청북도 159
50.0%
청주시 25
 
7.9%
음성군 19
 
6.0%
영동군 17
 
5.3%
괴산군 17
 
5.3%
충주시 16
 
5.0%
옥천군 16
 
5.0%
보은군 15
 
4.7%
단양군 15
 
4.7%
제천시 11
 
3.5%
Other values (2) 8
 
2.5%
Distinct156
Distinct (%)98.1%
Missing0
Missing (%)0.0%
Memory size1.4 KiB
2024-03-15T03:59:32.472560image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length3
Median length2
Mean length2.0188679
Min length1

Characters and Unicode

Total characters321
Distinct characters127
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique153 ?
Unique (%)96.2%

Sample

1st row갈산
2nd row금관
3rd row운교
4th row용곡
5th row기암
ValueCountFrequency (%)
대전 2
 
1.3%
신대 2
 
1.3%
신월 2
 
1.3%
의풍 1
 
0.6%
용산 1
 
0.6%
기호 1
 
0.6%
사석 1
 
0.6%
운곡 1
 
0.6%
갈산 1
 
0.6%
신기 1
 
0.6%
Other values (146) 146
91.8%
2024-03-15T03:59:35.125276image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
14
 
4.4%
10
 
3.1%
9
 
2.8%
9
 
2.8%
8
 
2.5%
7
 
2.2%
7
 
2.2%
7
 
2.2%
6
 
1.9%
6
 
1.9%
Other values (117) 238
74.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 321
100.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
14
 
4.4%
10
 
3.1%
9
 
2.8%
9
 
2.8%
8
 
2.5%
7
 
2.2%
7
 
2.2%
7
 
2.2%
6
 
1.9%
6
 
1.9%
Other values (117) 238
74.1%

Most occurring scripts

ValueCountFrequency (%)
Hangul 321
100.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
14
 
4.4%
10
 
3.1%
9
 
2.8%
9
 
2.8%
8
 
2.5%
7
 
2.2%
7
 
2.2%
7
 
2.2%
6
 
1.9%
6
 
1.9%
Other values (117) 238
74.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 321
100.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
14
 
4.4%
10
 
3.1%
9
 
2.8%
9
 
2.8%
8
 
2.5%
7
 
2.2%
7
 
2.2%
7
 
2.2%
6
 
1.9%
6
 
1.9%
Other values (117) 238
74.1%
Distinct158
Distinct (%)99.4%
Missing0
Missing (%)0.0%
Memory size1.4 KiB
2024-03-15T03:59:36.591477image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length12
Mean length12
Min length12

Characters and Unicode

Total characters1908
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique157 ?
Unique (%)98.7%

Sample

1st row043-201-3682
2nd row043-201-3683
3rd row043-201-3686
4th row043-201-3685
5th row043-201-3684
ValueCountFrequency (%)
043-542-9515 2
 
1.3%
043-539-7497 1
 
0.6%
043-833-7184 1
 
0.6%
043-832-3716 1
 
0.6%
043-539-7492 1
 
0.6%
043-539-7493 1
 
0.6%
043-838-3001 1
 
0.6%
043-539-7495 1
 
0.6%
043-539-7496 1
 
0.6%
043-835-3897 1
 
0.6%
Other values (148) 148
93.1%
2024-03-15T03:59:38.340089image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 318
16.7%
3 286
15.0%
0 279
14.6%
4 266
13.9%
2 171
9.0%
8 126
 
6.6%
7 115
 
6.0%
1 112
 
5.9%
5 89
 
4.7%
9 81
 
4.2%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 1590
83.3%
Dash Punctuation 318
 
16.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
3 286
18.0%
0 279
17.5%
4 266
16.7%
2 171
10.8%
8 126
7.9%
7 115
7.2%
1 112
 
7.0%
5 89
 
5.6%
9 81
 
5.1%
6 65
 
4.1%
Dash Punctuation
ValueCountFrequency (%)
- 318
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 1908
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 318
16.7%
3 286
15.0%
0 279
14.6%
4 266
13.9%
2 171
9.0%
8 126
 
6.6%
7 115
 
6.0%
1 112
 
5.9%
5 89
 
4.7%
9 81
 
4.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1908
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 318
16.7%
3 286
15.0%
0 279
14.6%
4 266
13.9%
2 171
9.0%
8 126
 
6.6%
7 115
 
6.0%
1 112
 
5.9%
5 89
 
4.7%
9 81
 
4.2%

우편번호
Real number (ℝ)

HIGH CORRELATION 

Distinct158
Distinct (%)99.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean28089.962
Minimum27000
Maximum29198
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.5 KiB
2024-03-15T03:59:38.805187image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum27000
5-th percentile27021.9
Q127492.5
median28045
Q328920
95-th percentile29126.6
Maximum29198
Range2198
Interquartile range (IQR)1427.5

Descriptive statistics

Standard deviation718.51032
Coefficient of variation (CV)0.025578899
Kurtosis-1.2740764
Mean28089.962
Median Absolute Deviation (MAD)724
Skewness0.13299747
Sum4466304
Variance516257.07
MonotonicityNot monotonic
2024-03-15T03:59:39.270131image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
28000 2
 
1.3%
28192 1
 
0.6%
28004 1
 
0.6%
27873 1
 
0.6%
27847 1
 
0.6%
27861 1
 
0.6%
27863 1
 
0.6%
27821 1
 
0.6%
27816 1
 
0.6%
28022 1
 
0.6%
Other values (148) 148
93.1%
ValueCountFrequency (%)
27000 1
0.6%
27002 1
0.6%
27008 1
0.6%
27015 1
0.6%
27016 1
0.6%
27017 1
0.6%
27019 1
0.6%
27021 1
0.6%
27022 1
0.6%
27023 1
0.6%
ValueCountFrequency (%)
29198 1
0.6%
29167 1
0.6%
29164 1
0.6%
29162 1
0.6%
29160 1
0.6%
29159 1
0.6%
29155 1
0.6%
29132 1
0.6%
29126 1
0.6%
29125 1
0.6%

주소
Text

UNIQUE 

Distinct159
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size1.4 KiB
2024-03-15T03:59:40.862094image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length30
Median length28
Mean length22.075472
Min length18

Characters and Unicode

Total characters3510
Distinct characters204
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique159 ?
Unique (%)100.0%

Sample

1st row충청북도 청주시 상당구 낭성면 갈산 무성길 5
2nd row충청북도 청주시 상당구 미원면 운암 계원로 615
3rd row충청북도 청주시 상당구 미원면 쌍이 운교로 585
4th row충청북도 청주시 상당구 미원면 미원 초정로 472
5th row충청북도 청주시 상당구 미원면 터기암길 6-42
ValueCountFrequency (%)
충청북도 159
 
19.1%
청주시 25
 
3.0%
음성군 19
 
2.3%
영동군 17
 
2.0%
괴산군 17
 
2.0%
옥천군 16
 
1.9%
충주시 16
 
1.9%
보은군 15
 
1.8%
단양군 15
 
1.8%
제천시 11
 
1.3%
Other values (389) 522
62.7%
2024-03-15T03:59:42.651930image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
695
19.8%
205
 
5.8%
177
 
5.0%
169
 
4.8%
162
 
4.6%
136
 
3.9%
111
 
3.2%
1 100
 
2.8%
95
 
2.7%
89
 
2.5%
Other values (194) 1571
44.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2281
65.0%
Space Separator 695
 
19.8%
Decimal Number 493
 
14.0%
Dash Punctuation 37
 
1.1%
Open Punctuation 2
 
0.1%
Close Punctuation 2
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
205
 
9.0%
177
 
7.8%
169
 
7.4%
162
 
7.1%
136
 
6.0%
111
 
4.9%
95
 
4.2%
89
 
3.9%
58
 
2.5%
53
 
2.3%
Other values (180) 1026
45.0%
Decimal Number
ValueCountFrequency (%)
1 100
20.3%
2 79
16.0%
3 55
11.2%
5 48
9.7%
4 42
8.5%
7 42
8.5%
6 40
 
8.1%
9 33
 
6.7%
0 31
 
6.3%
8 23
 
4.7%
Space Separator
ValueCountFrequency (%)
695
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 37
100.0%
Open Punctuation
ValueCountFrequency (%)
( 2
100.0%
Close Punctuation
ValueCountFrequency (%)
) 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2281
65.0%
Common 1229
35.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
205
 
9.0%
177
 
7.8%
169
 
7.4%
162
 
7.1%
136
 
6.0%
111
 
4.9%
95
 
4.2%
89
 
3.9%
58
 
2.5%
53
 
2.3%
Other values (180) 1026
45.0%
Common
ValueCountFrequency (%)
695
56.6%
1 100
 
8.1%
2 79
 
6.4%
3 55
 
4.5%
5 48
 
3.9%
4 42
 
3.4%
7 42
 
3.4%
6 40
 
3.3%
- 37
 
3.0%
9 33
 
2.7%
Other values (4) 58
 
4.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2281
65.0%
ASCII 1229
35.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
695
56.6%
1 100
 
8.1%
2 79
 
6.4%
3 55
 
4.5%
5 48
 
3.9%
4 42
 
3.4%
7 42
 
3.4%
6 40
 
3.3%
- 37
 
3.0%
9 33
 
2.7%
Other values (4) 58
 
4.7%
Hangul
ValueCountFrequency (%)
205
 
9.0%
177
 
7.8%
169
 
7.4%
162
 
7.1%
136
 
6.0%
111
 
4.9%
95
 
4.2%
89
 
3.9%
58
 
2.5%
53
 
2.3%
Other values (180) 1026
45.0%

Interactions

2024-03-15T03:59:29.922232image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-03-15T03:59:42.922843image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
시군우편번호
시군1.0000.943
우편번호0.9431.000
2024-03-15T03:59:43.154060image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
우편번호시군
우편번호1.0000.813
시군0.8131.000

Missing values

2024-03-15T03:59:30.234433image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-15T03:59:30.550209image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

시군진료소명전화번호우편번호주소
0충청북도 청주시갈산043-201-368228192충청북도 청주시 상당구 낭성면 갈산 무성길 5
1충청북도 청주시금관043-201-368328201충청북도 청주시 상당구 미원면 운암 계원로 615
2충청북도 청주시운교043-201-368628196충청북도 청주시 상당구 미원면 쌍이 운교로 585
3충청북도 청주시용곡043-201-368528195충청북도 청주시 상당구 미원면 미원 초정로 472
4충청북도 청주시기암043-201-368428198충청북도 청주시 상당구 미원면 터기암길 6-42
5충청북도 청주시행정043-201-368728203충청북도 청주시 상당구 가덕면 상장 인차로 295-7
6충청북도 청주시상야043-201-368828189충청북도 청주시 상당구 가덕면 상야길 30
7충청북도 청주시병암043-201-368928205충청북도 청주시 상당구 가덕면 병암3길 11-6
8충청북도 청주시소전043-201-369128210충청북도 청주시 상당구 문의면 소전길 1
9충청북도 청주시두모043-201-369028206충청북도 청주시 상당구 문의면 두모1길 13-20
시군진료소명전화번호우편번호주소
149충청북도 단양군대대043-422-847527023충청북도 단양군 가곡면 새밭로 469-17
150충청북도 단양군덕문곡043-423-620327016충청북도 단양군 어상천면 연곡심곡로 166
151충청북도 단양군별방043-422-878727017충청북도 단양군 영춘면 별방창원로 439-5
152충청북도 단양군보발043-422-818927022충청북도 단양군 가곡면 보발본동1길 12
153충청북도 단양군석교043-423-660727015충청북도 단양군 어상천면 석교1길 28-3
154충청북도 단양군여천043-422-821327008충청북도 단양군 가곡면 여천1길 20
155충청북도 단양군의풍043-422-686727019충청북도 단양군 영춘면 영단로1671-2
156충청북도 단양군장정043-422-124527029충청북도 단양군 대강면 장정길 11
157충청북도 단양군황정043-542-951527028충청북도 단양군 대강면 황정1길 36-6
158충청북도 단양군가평043-422-812227002충청북도 단양군 매포읍 가평10길 16