Overview

Dataset statistics

Number of variables7
Number of observations49
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.9 KiB
Average record size in memory60.6 B

Variable types

Categorical1
Text3
Numeric2
DateTime1

Dataset

Description충청남도 내 청소년 수련시설 보유 현황을 수련관, 수련원, 유스호스텔, 야영장, 문화의집으로 분류하여 제공하고자합니다.
Author충청남도
URLhttps://www.data.go.kr/data/15019508/fileData.do

Alerts

건물(제곱미터) is highly overall correlated with 수용인원High correlation
수용인원 is highly overall correlated with 건물(제곱미터)High correlation
시설명 has unique valuesUnique
소재지 has unique valuesUnique
개소일 has unique valuesUnique

Reproduction

Analysis started2024-03-14 22:56:01.980349
Analysis finished2024-03-14 22:56:04.445718
Duration2.47 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

구분
Categorical

Distinct5
Distinct (%)10.2%
Missing0
Missing (%)0.0%
Memory size520.0 B
청소년 문화의 집
15 
청소년 수련원
14 
청소년 수련관
11 
유스호스텔
청소년 야영장

Length

Max length9
Median length7
Mean length7.3265306
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row청소년 수련관
2nd row청소년 수련관
3rd row청소년 수련관
4th row청소년 수련관
5th row청소년 수련관

Common Values

ValueCountFrequency (%)
청소년 문화의 집 15
30.6%
청소년 수련원 14
28.6%
청소년 수련관 11
22.4%
유스호스텔 7
14.3%
청소년 야영장 2
 
4.1%

Length

2024-03-15T07:56:04.679826image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-15T07:56:04.895115image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
청소년 42
39.6%
문화의 15
 
14.2%
15
 
14.2%
수련원 14
 
13.2%
수련관 11
 
10.4%
유스호스텔 7
 
6.6%
야영장 2
 
1.9%

시설명
Text

UNIQUE 

Distinct49
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size520.0 B
2024-03-15T07:56:05.988286image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length15
Median length14
Mean length10.673469
Min length8

Characters and Unicode

Total characters523
Distinct characters113
Distinct categories7 ?
Distinct scripts4 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique49 ?
Unique (%)100.0%

Sample

1st row태조산 청소년수련관
2nd row천안시 청소년수련관
3rd row보령시청소년 수련관
4th row아산청소년 교육문화센터
5th row서산시청소년수련관
ValueCountFrequency (%)
청소년 10
 
8.9%
청소년수련원 10
 
8.9%
문화의집 10
 
8.9%
유스호스텔 7
 
6.2%
수련관 5
 
4.5%
청소년수련관 3
 
2.7%
문화센터 3
 
2.7%
수련원 2
 
1.8%
부여 2
 
1.8%
부여군 2
 
1.8%
Other values (55) 58
51.8%
2024-03-15T07:56:07.479602image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
64
 
12.2%
42
 
8.0%
40
 
7.6%
39
 
7.5%
22
 
4.2%
21
 
4.0%
14
 
2.7%
14
 
2.7%
14
 
2.7%
13
 
2.5%
Other values (103) 240
45.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 450
86.0%
Space Separator 64
 
12.2%
Lowercase Letter 4
 
0.8%
Uppercase Letter 2
 
0.4%
Open Punctuation 1
 
0.2%
Close Punctuation 1
 
0.2%
Dash Punctuation 1
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
42
 
9.3%
40
 
8.9%
39
 
8.7%
22
 
4.9%
21
 
4.7%
14
 
3.1%
14
 
3.1%
14
 
3.1%
13
 
2.9%
12
 
2.7%
Other values (93) 219
48.7%
Lowercase Letter
ValueCountFrequency (%)
p 1
25.0%
e 1
25.0%
k 1
25.0%
a 1
25.0%
Uppercase Letter
ValueCountFrequency (%)
U 1
50.0%
W 1
50.0%
Space Separator
ValueCountFrequency (%)
64
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 449
85.9%
Common 67
 
12.8%
Latin 6
 
1.1%
Han 1
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
42
 
9.4%
40
 
8.9%
39
 
8.7%
22
 
4.9%
21
 
4.7%
14
 
3.1%
14
 
3.1%
14
 
3.1%
13
 
2.9%
12
 
2.7%
Other values (92) 218
48.6%
Latin
ValueCountFrequency (%)
p 1
16.7%
U 1
16.7%
e 1
16.7%
k 1
16.7%
a 1
16.7%
W 1
16.7%
Common
ValueCountFrequency (%)
64
95.5%
( 1
 
1.5%
) 1
 
1.5%
- 1
 
1.5%
Han
ValueCountFrequency (%)
1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 449
85.9%
ASCII 73
 
14.0%
CJK 1
 
0.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
64
87.7%
( 1
 
1.4%
) 1
 
1.4%
- 1
 
1.4%
p 1
 
1.4%
U 1
 
1.4%
e 1
 
1.4%
k 1
 
1.4%
a 1
 
1.4%
W 1
 
1.4%
Hangul
ValueCountFrequency (%)
42
 
9.4%
40
 
8.9%
39
 
8.7%
22
 
4.9%
21
 
4.7%
14
 
3.1%
14
 
3.1%
14
 
3.1%
13
 
2.9%
12
 
2.7%
Other values (92) 218
48.6%
CJK
ValueCountFrequency (%)
1
100.0%

소재지
Text

UNIQUE 

Distinct49
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size520.0 B
2024-03-15T07:56:08.720019image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length24
Median length20
Mean length16.061224
Min length9

Characters and Unicode

Total characters787
Distinct characters118
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique49 ?
Unique (%)100.0%

Sample

1st row천안시 동남구 태조산길 261
2nd row천안시동남구 중앙로 111
3rd row보령시 성주면 성주산로 500
4th row아산시 시민로 500
5th row서산시 서령로 136
ValueCountFrequency (%)
태안군 6
 
3.1%
서산시 5
 
2.6%
천안시 4
 
2.1%
청양군 4
 
2.1%
공주시 4
 
2.1%
부여군 4
 
2.1%
동남구 3
 
1.6%
33 3
 
1.6%
아산시 3
 
1.6%
서천군 3
 
1.6%
Other values (136) 154
79.8%
2024-03-15T07:56:10.452516image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
152
 
19.3%
1 35
 
4.4%
30
 
3.8%
27
 
3.4%
3 27
 
3.4%
26
 
3.3%
24
 
3.0%
24
 
3.0%
21
 
2.7%
4 19
 
2.4%
Other values (108) 402
51.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 455
57.8%
Decimal Number 167
 
21.2%
Space Separator 152
 
19.3%
Dash Punctuation 13
 
1.7%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
30
 
6.6%
27
 
5.9%
26
 
5.7%
24
 
5.3%
24
 
5.3%
21
 
4.6%
16
 
3.5%
16
 
3.5%
14
 
3.1%
13
 
2.9%
Other values (96) 244
53.6%
Decimal Number
ValueCountFrequency (%)
1 35
21.0%
3 27
16.2%
4 19
11.4%
2 18
10.8%
0 16
9.6%
5 15
9.0%
6 12
 
7.2%
7 10
 
6.0%
9 8
 
4.8%
8 7
 
4.2%
Space Separator
ValueCountFrequency (%)
152
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 13
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 455
57.8%
Common 332
42.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
30
 
6.6%
27
 
5.9%
26
 
5.7%
24
 
5.3%
24
 
5.3%
21
 
4.6%
16
 
3.5%
16
 
3.5%
14
 
3.1%
13
 
2.9%
Other values (96) 244
53.6%
Common
ValueCountFrequency (%)
152
45.8%
1 35
 
10.5%
3 27
 
8.1%
4 19
 
5.7%
2 18
 
5.4%
0 16
 
4.8%
5 15
 
4.5%
- 13
 
3.9%
6 12
 
3.6%
7 10
 
3.0%
Other values (2) 15
 
4.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 455
57.8%
ASCII 332
42.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
152
45.8%
1 35
 
10.5%
3 27
 
8.1%
4 19
 
5.7%
2 18
 
5.4%
0 16
 
4.8%
5 15
 
4.5%
- 13
 
3.9%
6 12
 
3.6%
7 10
 
3.0%
Other values (2) 15
 
4.5%
Hangul
ValueCountFrequency (%)
30
 
6.6%
27
 
5.9%
26
 
5.7%
24
 
5.3%
24
 
5.3%
21
 
4.6%
16
 
3.5%
16
 
3.5%
14
 
3.1%
13
 
2.9%
Other values (96) 244
53.6%
Distinct48
Distinct (%)98.0%
Missing0
Missing (%)0.0%
Memory size520.0 B
2024-03-15T07:56:11.287485image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length6
Median length5
Mean length4.5510204
Min length3

Characters and Unicode

Total characters223
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique47 ?
Unique (%)95.9%

Sample

1st row28734
2nd row4410
3rd row11315
4th row71952
5th row4014
ValueCountFrequency (%)
11245 2
 
4.1%
28734 1
 
2.0%
19109 1
 
2.0%
52491 1
 
2.0%
809 1
 
2.0%
6640 1
 
2.0%
11689 1
 
2.0%
42671 1
 
2.0%
14580 1
 
2.0%
24,566 1
 
2.0%
Other values (38) 38
77.6%
2024-03-15T07:56:12.374491image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1 41
18.4%
2 28
12.6%
4 26
11.7%
6 26
11.7%
9 22
9.9%
8 18
8.1%
5 17
7.6%
0 17
7.6%
3 15
 
6.7%
7 12
 
5.4%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 222
99.6%
Other Punctuation 1
 
0.4%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
1 41
18.5%
2 28
12.6%
4 26
11.7%
6 26
11.7%
9 22
9.9%
8 18
8.1%
5 17
7.7%
0 17
7.7%
3 15
 
6.8%
7 12
 
5.4%
Other Punctuation
ValueCountFrequency (%)
, 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 223
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
1 41
18.4%
2 28
12.6%
4 26
11.7%
6 26
11.7%
9 22
9.9%
8 18
8.1%
5 17
7.6%
0 17
7.6%
3 15
 
6.7%
7 12
 
5.4%

Most occurring blocks

ValueCountFrequency (%)
ASCII 223
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1 41
18.4%
2 28
12.6%
4 26
11.7%
6 26
11.7%
9 22
9.9%
8 18
8.1%
5 17
7.6%
0 17
7.6%
3 15
 
6.7%
7 12
 
5.4%

건물(제곱미터)
Real number (ℝ)

HIGH CORRELATION 

Distinct48
Distinct (%)98.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean4073.2653
Minimum135
Maximum35144
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size569.0 B
2024-03-15T07:56:12.870436image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum135
5-th percentile514.6
Q11339
median2870
Q34750
95-th percentile8554.2
Maximum35144
Range35009
Interquartile range (IQR)3411

Descriptive statistics

Standard deviation5314.3037
Coefficient of variation (CV)1.304679
Kurtosis24.729322
Mean4073.2653
Median Absolute Deviation (MAD)1612
Skewness4.4350203
Sum199590
Variance28241824
MonotonicityNot monotonic
2024-03-15T07:56:13.194150image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=48)
ValueCountFrequency (%)
7518 2
 
4.1%
461 1
 
2.0%
1451 1
 
2.0%
3399 1
 
2.0%
6408 1
 
2.0%
1294 1
 
2.0%
3720 1
 
2.0%
2886 1
 
2.0%
4963 1
 
2.0%
6251 1
 
2.0%
Other values (38) 38
77.6%
ValueCountFrequency (%)
135 1
2.0%
381 1
2.0%
461 1
2.0%
595 1
2.0%
669 1
2.0%
746 1
2.0%
843 1
2.0%
1053 1
2.0%
1121 1
2.0%
1177 1
2.0%
ValueCountFrequency (%)
35144 1
2.0%
13614 1
2.0%
8625 1
2.0%
8448 1
2.0%
7518 2
4.1%
7252 1
2.0%
7104 1
2.0%
6925 1
2.0%
6408 1
2.0%
6251 1
2.0%

수용인원
Real number (ℝ)

HIGH CORRELATION 

Distinct36
Distinct (%)73.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean440.79592
Minimum50
Maximum3720
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size569.0 B
2024-03-15T07:56:13.591324image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum50
5-th percentile68.8
Q1120
median300
Q3500
95-th percentile1043.2
Maximum3720
Range3670
Interquartile range (IQR)380

Descriptive statistics

Standard deviation567.4569
Coefficient of variation (CV)1.2873461
Kurtosis23.373018
Mean440.79592
Median Absolute Deviation (MAD)191
Skewness4.2640603
Sum21599
Variance322007.33
MonotonicityNot monotonic
2024-03-15T07:56:14.021614image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=36)
ValueCountFrequency (%)
300 5
 
10.2%
100 4
 
8.2%
500 3
 
6.1%
101 2
 
4.1%
1000 2
 
4.1%
50 2
 
4.1%
150 2
 
4.1%
153 1
 
2.0%
400 1
 
2.0%
238 1
 
2.0%
Other values (26) 26
53.1%
ValueCountFrequency (%)
50 2
4.1%
68 1
 
2.0%
70 1
 
2.0%
100 4
8.2%
101 2
4.1%
109 1
 
2.0%
117 1
 
2.0%
120 1
 
2.0%
143 1
 
2.0%
150 2
4.1%
ValueCountFrequency (%)
3720 1
2.0%
1212 1
2.0%
1072 1
2.0%
1000 2
4.1%
885 1
2.0%
851 1
2.0%
800 1
2.0%
789 1
2.0%
787 1
2.0%
530 1
2.0%

개소일
Date

UNIQUE 

Distinct49
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size520.0 B
Minimum1994-08-17 00:00:00
Maximum2023-03-14 00:00:00
2024-03-15T07:56:14.448694image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T07:56:14.863389image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=49)

Interactions

2024-03-15T07:56:03.022931image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T07:56:02.580311image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T07:56:03.496300image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T07:56:02.766376image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-03-15T07:56:15.055187image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
구분시설명소재지부지(제곱미터)건물(제곱미터)수용인원개소일
구분1.0001.0001.0000.8480.4980.3961.000
시설명1.0001.0001.0001.0001.0001.0001.000
소재지1.0001.0001.0001.0001.0001.0001.000
부지(제곱미터)0.8481.0001.0001.0001.0001.0001.000
건물(제곱미터)0.4981.0001.0001.0001.0000.9011.000
수용인원0.3961.0001.0001.0000.9011.0001.000
개소일1.0001.0001.0001.0001.0001.0001.000
2024-03-15T07:56:15.337351image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
건물(제곱미터)수용인원구분
건물(제곱미터)1.0000.8000.211
수용인원0.8001.0000.151
구분0.2110.1511.000

Missing values

2024-03-15T07:56:03.871330image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-15T07:56:04.287429image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

구분시설명소재지부지(제곱미터)건물(제곱미터)수용인원개소일
0청소년 수련관태조산 청소년수련관천안시 동남구 태조산길 2612873426363001994-08-17
1청소년 수련관천안시 청소년수련관천안시동남구 중앙로 111441044103502012-07-03
2청소년 수련관보령시청소년 수련관보령시 성주면 성주산로 5001131538031171998-06-23
3청소년 수련관아산청소년 교육문화센터아산시 시민로 5007195272527872010-06-15
4청소년 수련관서산시청소년수련관서산시 서령로 136401440143002013-03-11
5청소년 수련관논산시청소년 문화센터논산시 논산대로 4244958318210001999-10-20
6청소년 수련관금산다락원 청소년수련관금산군 금산읍 금산로 155939333136148002005-04-11
7청소년 수련관서천청소년 수련관서천군 장항읍 장항산단로 34번길 60-231236328703002014-07-28
8청소년 수련관홍성청소년 수련관홍성군 홍성읍 문화로33번길 33164127567892002-03-29
9청소년 수련관예산군청소년 수련관예산군 예산읍 벚꽃로 2148919229543602014-01-03
구분시설명소재지부지(제곱미터)건물(제곱미터)수용인원개소일
39청소년 수련원청포대썬셋 수련원태안군 남면 안면대로 13081668269258512014-11-21
40청소년 야영장모두 휴(休) 청소년야영장청양군 대치면 청산로 690-1313624595682019-05-15
41청소년 야영장솔향기길청소년야영장태안군 이원면 내리 522-795761352202019-07-23
42유스호스텔천안 상록 유스호스텔천안시 동남구 수신면 수신로 5762538086258851999-12-23
43유스호스텔계룡산 갑사 유스호스텔공주시 계룡면 갑사로 458-4247738355301995-07-10
44유스호스텔공주 유스호스텔공주시 탄천면 삼거리1길 8-119639710410722004-04-09
45유스호스텔자연 부여 유스호스텔부여군 외산면 반교동로 6995018181802003-06-14
46유스호스텔부여군 유스호스텔부여군 부여읍 의열로 431144447502802015-08-28
47유스호스텔서천 유스호스텔서천군 장항읍 장항산단로 34번길 72-40626824653332014-06-18
48유스호스텔아가페 유스호스텔태안군 근흥면 용안길 1001124575185002007-10-12