Overview

Dataset statistics

Number of variables8
Number of observations167
Missing cells9
Missing cells (%)0.7%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory10.7 KiB
Average record size in memory65.8 B

Variable types

Categorical2
Text3
DateTime2
Numeric1

Dataset

Description경기도 하남시_경로당현황에 대한 데이터로 시군명, 사업장명, 소재지주소, 전화번호, 인허가일자, 영업상태명, 입소정원 등의 항목을 제공합니다.
Author공공데이터포털
URLhttps://www.data.go.kr/data/3047282/fileData.do

Alerts

시군명 has constant value ""Constant
데이터기준일자 has constant value ""Constant
영업상태명 is highly imbalanced (90.6%)Imbalance
전화번호 has 9 (5.4%) missing valuesMissing
사업장명 has unique valuesUnique
입소정원 has 2 (1.2%) zerosZeros

Reproduction

Analysis started2024-04-17 09:10:56.520664
Analysis finished2024-04-17 09:10:56.997317
Duration0.48 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

시군명
Categorical

CONSTANT 

Distinct1
Distinct (%)0.6%
Missing0
Missing (%)0.0%
Memory size1.4 KiB
경기도 하남시
167 

Length

Max length7
Median length7
Mean length7
Min length7

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row경기도 하남시
2nd row경기도 하남시
3rd row경기도 하남시
4th row경기도 하남시
5th row경기도 하남시

Common Values

ValueCountFrequency (%)
경기도 하남시 167
100.0%

Length

2024-04-17T18:10:57.060214image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-17T18:10:57.142317image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
경기도 167
50.0%
하남시 167
50.0%

사업장명
Text

UNIQUE 

Distinct167
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size1.4 KiB
2024-04-17T18:10:57.334191image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length17
Median length14
Mean length6.245509
Min length2

Characters and Unicode

Total characters1043
Distinct characters202
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique167 ?
Unique (%)100.0%

Sample

1st row검단
2nd row동수
3rd row봉학
4th row산곡
5th row새능
ValueCountFrequency (%)
힐즈파크 2
 
1.2%
센트럴풍경채@(33단지 1
 
0.6%
센트럴자이@(21단지 1
 
0.6%
동원로얄듀크@(22단지 1
 
0.6%
더샵센트럴포레@(23단지 1
 
0.6%
브라운스톤@(24단지 1
 
0.6%
아란티움@(25단지 1
 
0.6%
미사강변26단지 1
 
0.6%
이편한세상@(27단지 1
 
0.6%
신안인스빌@(32단지 1
 
0.6%
Other values (160) 160
93.6%
2024-04-17T18:10:57.660485image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
@ 91
 
8.7%
60
 
5.8%
54
 
5.2%
28
 
2.7%
26
 
2.5%
1 26
 
2.5%
23
 
2.2%
22
 
2.1%
2 20
 
1.9%
19
 
1.8%
Other values (192) 674
64.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 808
77.5%
Decimal Number 97
 
9.3%
Other Punctuation 93
 
8.9%
Close Punctuation 18
 
1.7%
Open Punctuation 18
 
1.7%
Space Separator 4
 
0.4%
Uppercase Letter 4
 
0.4%
Lowercase Letter 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
60
 
7.4%
54
 
6.7%
28
 
3.5%
26
 
3.2%
23
 
2.8%
22
 
2.7%
19
 
2.4%
17
 
2.1%
17
 
2.1%
17
 
2.1%
Other values (174) 525
65.0%
Decimal Number
ValueCountFrequency (%)
1 26
26.8%
2 20
20.6%
3 16
16.5%
4 6
 
6.2%
6 6
 
6.2%
7 6
 
6.2%
5 5
 
5.2%
8 5
 
5.2%
0 4
 
4.1%
9 3
 
3.1%
Other Punctuation
ValueCountFrequency (%)
@ 91
97.8%
. 2
 
2.2%
Uppercase Letter
ValueCountFrequency (%)
C 3
75.0%
K 1
 
25.0%
Close Punctuation
ValueCountFrequency (%)
) 18
100.0%
Open Punctuation
ValueCountFrequency (%)
( 18
100.0%
Space Separator
ValueCountFrequency (%)
4
100.0%
Lowercase Letter
ValueCountFrequency (%)
s 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 808
77.5%
Common 230
 
22.1%
Latin 5
 
0.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
60
 
7.4%
54
 
6.7%
28
 
3.5%
26
 
3.2%
23
 
2.8%
22
 
2.7%
19
 
2.4%
17
 
2.1%
17
 
2.1%
17
 
2.1%
Other values (174) 525
65.0%
Common
ValueCountFrequency (%)
@ 91
39.6%
1 26
 
11.3%
2 20
 
8.7%
) 18
 
7.8%
( 18
 
7.8%
3 16
 
7.0%
4 6
 
2.6%
6 6
 
2.6%
7 6
 
2.6%
5 5
 
2.2%
Other values (5) 18
 
7.8%
Latin
ValueCountFrequency (%)
C 3
60.0%
s 1
 
20.0%
K 1
 
20.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 808
77.5%
ASCII 235
 
22.5%

Most frequent character per block

ASCII
ValueCountFrequency (%)
@ 91
38.7%
1 26
 
11.1%
2 20
 
8.5%
) 18
 
7.7%
( 18
 
7.7%
3 16
 
6.8%
4 6
 
2.6%
6 6
 
2.6%
7 6
 
2.6%
5 5
 
2.1%
Other values (8) 23
 
9.8%
Hangul
ValueCountFrequency (%)
60
 
7.4%
54
 
6.7%
28
 
3.5%
26
 
3.2%
23
 
2.8%
22
 
2.7%
19
 
2.4%
17
 
2.1%
17
 
2.1%
17
 
2.1%
Other values (174) 525
65.0%
Distinct159
Distinct (%)95.2%
Missing0
Missing (%)0.0%
Memory size1.4 KiB
2024-04-17T18:10:57.925307image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length24
Median length21
Mean length17.532934
Min length14

Characters and Unicode

Total characters2928
Distinct characters82
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique151 ?
Unique (%)90.4%

Sample

1st row경기도 하남시 하남대로302번길 7
2nd row경기도 하남시 산곡로 16
3rd row경기도 하남시 하남대로 564
4th row경기도 하남시 하남대로186번길 9
5th row경기도 하남시 산곡동로 94-13
ValueCountFrequency (%)
경기도 167
24.2%
하남시 167
24.2%
하남대로 9
 
1.3%
감일순환로 6
 
0.9%
30 6
 
0.9%
50 6
 
0.9%
미사강변서로 6
 
0.9%
미사강변동로 5
 
0.7%
미사강변한강로 5
 
0.7%
미사강변대로 5
 
0.7%
Other values (206) 307
44.6%
2024-04-17T18:10:58.585037image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
523
17.9%
198
 
6.8%
194
 
6.6%
167
 
5.7%
167
 
5.7%
167
 
5.7%
167
 
5.7%
164
 
5.6%
1 122
 
4.2%
5 74
 
2.5%
Other values (72) 985
33.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1812
61.9%
Decimal Number 568
 
19.4%
Space Separator 523
 
17.9%
Dash Punctuation 25
 
0.9%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
198
10.9%
194
10.7%
167
 
9.2%
167
 
9.2%
167
 
9.2%
167
 
9.2%
164
 
9.1%
61
 
3.4%
55
 
3.0%
39
 
2.2%
Other values (60) 433
23.9%
Decimal Number
ValueCountFrequency (%)
1 122
21.5%
5 74
13.0%
0 69
12.1%
2 58
10.2%
6 48
 
8.5%
9 48
 
8.5%
7 41
 
7.2%
4 37
 
6.5%
8 36
 
6.3%
3 35
 
6.2%
Space Separator
ValueCountFrequency (%)
523
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 25
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1812
61.9%
Common 1116
38.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
198
10.9%
194
10.7%
167
 
9.2%
167
 
9.2%
167
 
9.2%
167
 
9.2%
164
 
9.1%
61
 
3.4%
55
 
3.0%
39
 
2.2%
Other values (60) 433
23.9%
Common
ValueCountFrequency (%)
523
46.9%
1 122
 
10.9%
5 74
 
6.6%
0 69
 
6.2%
2 58
 
5.2%
6 48
 
4.3%
9 48
 
4.3%
7 41
 
3.7%
4 37
 
3.3%
8 36
 
3.2%
Other values (2) 60
 
5.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1812
61.9%
ASCII 1116
38.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
523
46.9%
1 122
 
10.9%
5 74
 
6.6%
0 69
 
6.2%
2 58
 
5.2%
6 48
 
4.3%
9 48
 
4.3%
7 41
 
3.7%
4 37
 
3.3%
8 36
 
3.2%
Other values (2) 60
 
5.4%
Hangul
ValueCountFrequency (%)
198
10.9%
194
10.7%
167
 
9.2%
167
 
9.2%
167
 
9.2%
167
 
9.2%
164
 
9.1%
61
 
3.4%
55
 
3.0%
39
 
2.2%
Other values (60) 433
23.9%

전화번호
Text

MISSING 

Distinct157
Distinct (%)99.4%
Missing9
Missing (%)5.4%
Memory size1.4 KiB
2024-04-17T18:10:58.787322image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length11.867089
Min length11

Characters and Unicode

Total characters1875
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique156 ?
Unique (%)98.7%

Sample

1st row031-793-3770
2nd row031-793-8185
3rd row031-792-5070
4th row031-793-8606
5th row031-793-0016
ValueCountFrequency (%)
070-8987-6042 2
 
1.3%
031-793-3770 1
 
0.6%
031-795-1752 1
 
0.6%
02-427-6845 1
 
0.6%
031-793-5688 1
 
0.6%
031-791-5233 1
 
0.6%
031-793-0618 1
 
0.6%
031-792-4445 1
 
0.6%
031-792-7458 1
 
0.6%
031-794-9646 1
 
0.6%
Other values (147) 147
93.0%
2024-04-17T18:10:59.109782image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 316
16.9%
0 251
13.4%
3 228
12.2%
1 210
11.2%
7 208
11.1%
9 185
9.9%
4 121
 
6.5%
2 117
 
6.2%
5 89
 
4.7%
6 79
 
4.2%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 1559
83.1%
Dash Punctuation 316
 
16.9%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 251
16.1%
3 228
14.6%
1 210
13.5%
7 208
13.3%
9 185
11.9%
4 121
7.8%
2 117
7.5%
5 89
 
5.7%
6 79
 
5.1%
8 71
 
4.6%
Dash Punctuation
ValueCountFrequency (%)
- 316
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 1875
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 316
16.9%
0 251
13.4%
3 228
12.2%
1 210
11.2%
7 208
11.1%
9 185
9.9%
4 121
 
6.5%
2 117
 
6.2%
5 89
 
4.7%
6 79
 
4.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1875
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 316
16.9%
0 251
13.4%
3 228
12.2%
1 210
11.2%
7 208
11.1%
9 185
9.9%
4 121
 
6.5%
2 117
 
6.2%
5 89
 
4.7%
6 79
 
4.2%
Distinct115
Distinct (%)68.9%
Missing0
Missing (%)0.0%
Memory size1.4 KiB
Minimum1989-07-12 00:00:00
Maximum2023-05-16 00:00:00
2024-04-17T18:10:59.240110image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-17T18:10:59.356620image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

영업상태명
Categorical

IMBALANCE 

Distinct2
Distinct (%)1.2%
Missing0
Missing (%)0.0%
Memory size1.4 KiB
운영중
165 
미운영
 
2

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row운영중
2nd row운영중
3rd row운영중
4th row운영중
5th row운영중

Common Values

ValueCountFrequency (%)
운영중 165
98.8%
미운영 2
 
1.2%

Length

2024-04-17T18:10:59.464102image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-17T18:10:59.545895image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
운영중 165
98.8%
미운영 2
 
1.2%

입소정원
Real number (ℝ)

ZEROS 

Distinct47
Distinct (%)28.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean32.317365
Minimum0
Maximum170
Zeros2
Zeros (%)1.2%
Negative0
Negative (%)0.0%
Memory size1.6 KiB
2024-04-17T18:10:59.639727image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile16.3
Q123
median30
Q336
95-th percentile56.5
Maximum170
Range170
Interquartile range (IQR)13

Descriptive statistics

Standard deviation17.334446
Coefficient of variation (CV)0.53638178
Kurtosis26.838057
Mean32.317365
Median Absolute Deviation (MAD)7
Skewness4.0496412
Sum5397
Variance300.48301
MonotonicityNot monotonic
2024-04-17T18:10:59.751696image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=47)
ValueCountFrequency (%)
30 14
 
8.4%
20 10
 
6.0%
23 9
 
5.4%
28 8
 
4.8%
25 8
 
4.8%
22 8
 
4.8%
32 7
 
4.2%
35 7
 
4.2%
24 7
 
4.2%
21 7
 
4.2%
Other values (37) 82
49.1%
ValueCountFrequency (%)
0 2
 
1.2%
10 1
 
0.6%
14 1
 
0.6%
15 4
 
2.4%
16 1
 
0.6%
17 1
 
0.6%
18 1
 
0.6%
19 1
 
0.6%
20 10
6.0%
21 7
4.2%
ValueCountFrequency (%)
170 1
0.6%
119 1
0.6%
75 1
0.6%
74 2
1.2%
61 1
0.6%
60 1
0.6%
58 2
1.2%
53 2
1.2%
52 1
0.6%
51 2
1.2%

데이터기준일자
Date

CONSTANT 

Distinct1
Distinct (%)0.6%
Missing0
Missing (%)0.0%
Memory size1.4 KiB
Minimum2023-08-11 00:00:00
Maximum2023-08-11 00:00:00
2024-04-17T18:10:59.847081image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-17T18:10:59.921391image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=1)

Interactions

2024-04-17T18:10:56.753021image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-04-17T18:10:59.975953image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
영업상태명입소정원
영업상태명1.0000.400
입소정원0.4001.000
2024-04-17T18:11:00.053616image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
입소정원영업상태명
입소정원1.0000.394
영업상태명0.3941.000

Missing values

2024-04-17T18:10:56.851784image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-17T18:10:56.954946image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

시군명사업장명소재지도로명주소전화번호인허가일자영업상태명입소정원데이터기준일자
0경기도 하남시검단경기도 하남시 하남대로302번길 7031-793-37701989-07-12운영중402023-08-11
1경기도 하남시동수경기도 하남시 산곡로 16031-793-81851989-07-12운영중482023-08-11
2경기도 하남시봉학경기도 하남시 하남대로 564031-792-50701999-08-17운영중292023-08-11
3경기도 하남시산곡경기도 하남시 하남대로186번길 9031-793-86061993-07-08운영중512023-08-11
4경기도 하남시새능경기도 하남시 산곡동로 94-13031-793-00161989-07-12운영중392023-08-11
5경기도 하남시양곡경기도 하남시 하산곡동 181-2031-796-30291989-07-12운영중372023-08-11
6경기도 하남시양곡부녀경기도 하남시 하산곡동 181-2031-792-49361992-05-28운영중252023-08-11
7경기도 하남시천현경기도 하남시 샘재로 91-14031-796-77011989-07-12운영중342023-08-11
8경기도 하남시창우6통경기도 하남시 검단산로 290-7031-795-48872004-04-07운영중202023-08-11
9경기도 하남시창우7통경기도 하남시 검단산로146번길 8031-792-55302005-02-16운영중302023-08-11
시군명사업장명소재지도로명주소전화번호인허가일자영업상태명입소정원데이터기준일자
157경기도 하남시감일스윗시티7단지@경기도 하남시 감일순환로 14002-474-10082022-06-13운영중242023-08-11
158경기도 하남시포웰시티푸르지오라포레@경기도 하남시 감일백제로 20<NA>2022-06-29운영중372023-08-11
159경기도 하남시감일3단지@경기도 하남시 감일순환로 170<NA>2022-06-29운영중272023-08-11
160경기도 하남시위례포레자이@경기도 하남시 위례대로 6길 95<NA>2022-06-29운영중282023-08-11
161경기도 하남시미사강변리버뷰자이@경기도 하남시 미사강변 한강로 10031-793-06912022-06-29운영중212023-08-11
162경기도 하남시힐스테이트센트럴위례경기도 하남시 위례대로6길 1502-431-86262022-11-28운영중282023-08-11
163경기도 하남시감일파크경기도 하남시 감일순환로 190070-8151-12002022-10-11운영중212023-08-11
164경기도 하남시미사역파라곤@경기도 하남시 미사강변동로 100<NA>2023-05-16운영중252023-08-11
165경기도 하남시위례아스트로@경기도 하남시 위례광장로 28502-6956-63062023-04-26운영중312023-08-11
166경기도 하남시위례중흥s클래스@경기도 하남시 위례대로6길 70<NA>2023-04-16운영중252023-08-11