Overview

Dataset statistics

Number of variables7
Number of observations48
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.9 KiB
Average record size in memory61.8 B

Variable types

Categorical1
Text2
Numeric3
DateTime1

Dataset

Description충청남도 내 청소년 수련시설 보유 현황을 수련관, 수련원, 유스호스텔, 야영장, 문화의집으로 분류하여 제공하고자합니다.
Author충청남도
URLhttps://alldam.chungnam.go.kr/index.chungnam?menuCd=DOM_000000201001001001&st=&cds=&orgCd=&apiType=&isOpen=Y&pageIndex=404&beforeMenuCd=DOM_000000201001001000&publicdatapk=15019508

Alerts

건물(제곱미터) is highly overall correlated with 수용인원High correlation
수용인원 is highly overall correlated with 건물(제곱미터)High correlation
시설명 has unique valuesUnique
소재지 has unique valuesUnique
개소일 has unique valuesUnique

Reproduction

Analysis started2024-01-09 23:05:52.034890
Analysis finished2024-01-09 23:05:53.294335
Duration1.26 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

구분
Categorical

Distinct5
Distinct (%)10.4%
Missing0
Missing (%)0.0%
Memory size516.0 B
청소년 문화의 집
15 
청소년 수련원
13 
청소년 수련관
11 
유스호스텔
청소년 야영장

Length

Max length9
Median length7
Mean length7.3333333
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row청소년 수련관
2nd row청소년 수련관
3rd row청소년 수련관
4th row청소년 수련관
5th row청소년 수련관

Common Values

ValueCountFrequency (%)
청소년 문화의 집 15
31.2%
청소년 수련원 13
27.1%
청소년 수련관 11
22.9%
유스호스텔 7
14.6%
청소년 야영장 2
 
4.2%

Length

2024-01-10T08:05:53.363705image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-10T08:05:53.464108image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
청소년 41
39.4%
문화의 15
 
14.4%
15
 
14.4%
수련원 13
 
12.5%
수련관 11
 
10.6%
유스호스텔 7
 
6.7%
야영장 2
 
1.9%

시설명
Text

UNIQUE 

Distinct48
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size516.0 B
2024-01-10T08:05:53.679947image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length14
Median length12
Mean length10.604167
Min length8

Characters and Unicode

Total characters509
Distinct characters105
Distinct categories4 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique48 ?
Unique (%)100.0%

Sample

1st row태조산 청소년수련관
2nd row천안시 청소년수련관
3rd row보령시청소년 수련관
4th row아산청소년 교육문화센터
5th row서산시청소년수련관
ValueCountFrequency (%)
청소년 10
 
9.1%
문화의집 10
 
9.1%
청소년수련원 10
 
9.1%
유스호스텔 7
 
6.4%
수련관 5
 
4.5%
청소년수련관 3
 
2.7%
문화센터 3
 
2.7%
부여 2
 
1.8%
수련원 2
 
1.8%
부여군 2
 
1.8%
Other values (53) 56
50.9%
2024-01-10T08:05:54.010408image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
63
 
12.4%
41
 
8.1%
39
 
7.7%
38
 
7.5%
22
 
4.3%
21
 
4.1%
14
 
2.8%
14
 
2.8%
14
 
2.8%
13
 
2.6%
Other values (95) 230
45.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 444
87.2%
Space Separator 63
 
12.4%
Open Punctuation 1
 
0.2%
Close Punctuation 1
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
41
 
9.2%
39
 
8.8%
38
 
8.6%
22
 
5.0%
21
 
4.7%
14
 
3.2%
14
 
3.2%
14
 
3.2%
13
 
2.9%
13
 
2.9%
Other values (92) 215
48.4%
Space Separator
ValueCountFrequency (%)
63
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 443
87.0%
Common 65
 
12.8%
Han 1
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
41
 
9.3%
39
 
8.8%
38
 
8.6%
22
 
5.0%
21
 
4.7%
14
 
3.2%
14
 
3.2%
14
 
3.2%
13
 
2.9%
13
 
2.9%
Other values (91) 214
48.3%
Common
ValueCountFrequency (%)
63
96.9%
( 1
 
1.5%
) 1
 
1.5%
Han
ValueCountFrequency (%)
1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 443
87.0%
ASCII 65
 
12.8%
CJK 1
 
0.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
63
96.9%
( 1
 
1.5%
) 1
 
1.5%
Hangul
ValueCountFrequency (%)
41
 
9.3%
39
 
8.8%
38
 
8.6%
22
 
5.0%
21
 
4.7%
14
 
3.2%
14
 
3.2%
14
 
3.2%
13
 
2.9%
13
 
2.9%
Other values (91) 214
48.3%
CJK
ValueCountFrequency (%)
1
100.0%

소재지
Text

UNIQUE 

Distinct48
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size516.0 B
2024-01-10T08:05:54.304435image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length24
Median length20
Mean length16.041667
Min length9

Characters and Unicode

Total characters770
Distinct characters120
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique48 ?
Unique (%)100.0%

Sample

1st row천안시 동남구 태조산길 261
2nd row천안시동남구 중앙로 111
3rd row보령시 성주면 성주산로 500
4th row아산시 시민로 500
5th row서산시 서령로 136
ValueCountFrequency (%)
태안군 6
 
3.2%
천안시 4
 
2.1%
서산시 4
 
2.1%
부여군 4
 
2.1%
공주시 4
 
2.1%
청양군 4
 
2.1%
서천군 3
 
1.6%
보령시 3
 
1.6%
33 3
 
1.6%
당진시 3
 
1.6%
Other values (134) 151
79.9%
2024-01-10T08:05:54.735868image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
149
 
19.4%
1 33
 
4.3%
29
 
3.8%
3 27
 
3.5%
26
 
3.4%
24
 
3.1%
24
 
3.1%
23
 
3.0%
21
 
2.7%
4 18
 
2.3%
Other values (110) 396
51.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 447
58.1%
Decimal Number 161
 
20.9%
Space Separator 149
 
19.4%
Dash Punctuation 13
 
1.7%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
29
 
6.5%
26
 
5.8%
24
 
5.4%
24
 
5.4%
23
 
5.1%
21
 
4.7%
16
 
3.6%
15
 
3.4%
14
 
3.1%
12
 
2.7%
Other values (98) 243
54.4%
Decimal Number
ValueCountFrequency (%)
1 33
20.5%
3 27
16.8%
4 18
11.2%
2 17
10.6%
0 16
9.9%
6 13
 
8.1%
5 13
 
8.1%
7 10
 
6.2%
9 7
 
4.3%
8 7
 
4.3%
Space Separator
ValueCountFrequency (%)
149
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 13
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 447
58.1%
Common 323
41.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
29
 
6.5%
26
 
5.8%
24
 
5.4%
24
 
5.4%
23
 
5.1%
21
 
4.7%
16
 
3.6%
15
 
3.4%
14
 
3.1%
12
 
2.7%
Other values (98) 243
54.4%
Common
ValueCountFrequency (%)
149
46.1%
1 33
 
10.2%
3 27
 
8.4%
4 18
 
5.6%
2 17
 
5.3%
0 16
 
5.0%
- 13
 
4.0%
6 13
 
4.0%
5 13
 
4.0%
7 10
 
3.1%
Other values (2) 14
 
4.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 447
58.1%
ASCII 323
41.9%

Most frequent character per block

ASCII
ValueCountFrequency (%)
149
46.1%
1 33
 
10.2%
3 27
 
8.4%
4 18
 
5.6%
2 17
 
5.3%
0 16
 
5.0%
- 13
 
4.0%
6 13
 
4.0%
5 13
 
4.0%
7 10
 
3.1%
Other values (2) 14
 
4.3%
Hangul
ValueCountFrequency (%)
29
 
6.5%
26
 
5.8%
24
 
5.4%
24
 
5.4%
23
 
5.1%
21
 
4.7%
16
 
3.6%
15
 
3.4%
14
 
3.1%
12
 
2.7%
Other values (98) 243
54.4%

부지(제곱미터)
Real number (ℝ)

Distinct47
Distinct (%)97.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean24835.854
Minimum809
Maximum278521
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size564.0 B
2024-01-10T08:05:54.863241image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum809
5-th percentile1200.7
Q12470.75
median10597.5
Q321074.25
95-th percentile85746.6
Maximum278521
Range277712
Interquartile range (IQR)18603.5

Descriptive statistics

Standard deviation46579.735
Coefficient of variation (CV)1.8755037
Kurtosis19.088685
Mean24835.854
Median Absolute Deviation (MAD)8591
Skewness3.9807152
Sum1192121
Variance2.1696718 × 109
MonotonicityNot monotonic
2024-01-10T08:05:54.982313image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=47)
ValueCountFrequency (%)
11245 2
 
4.2%
4410 1
 
2.1%
809 1
 
2.1%
6640 1
 
2.1%
11689 1
 
2.1%
42671 1
 
2.1%
14580 1
 
2.1%
13798 1
 
2.1%
26726 1
 
2.1%
19109 1
 
2.1%
Other values (37) 37
77.1%
ValueCountFrequency (%)
809 1
2.1%
1159 1
2.1%
1186 1
2.1%
1228 1
2.1%
1503 1
2.1%
1548 1
2.1%
1626 1
2.1%
1641 1
2.1%
1822 1
2.1%
1897 1
2.1%
ValueCountFrequency (%)
278521 1
2.1%
136961 1
2.1%
89192 1
2.1%
79348 1
2.1%
71952 1
2.1%
68000 1
2.1%
52491 1
2.1%
42671 1
2.1%
39333 1
2.1%
28734 1
2.1%

건물(제곱미터)
Real number (ℝ)

HIGH CORRELATION 

Distinct47
Distinct (%)97.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean4080.625
Minimum135
Maximum35144
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size564.0 B
2024-01-10T08:05:55.097719image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum135
5-th percentile507.9
Q11327.75
median2813
Q34803.25
95-th percentile8563.05
Maximum35144
Range35009
Interquartile range (IQR)3475.5

Descriptive statistics

Standard deviation5370.2889
Coefficient of variation (CV)1.3160457
Kurtosis24.194976
Mean4080.625
Median Absolute Deviation (MAD)1616.5
Skewness4.3888086
Sum195870
Variance28840003
MonotonicityNot monotonic
2024-01-10T08:05:55.217378image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=47)
ValueCountFrequency (%)
7518 2
 
4.2%
4410 1
 
2.1%
35144 1
 
2.1%
1451 1
 
2.1%
3399 1
 
2.1%
6408 1
 
2.1%
1294 1
 
2.1%
2886 1
 
2.1%
4963 1
 
2.1%
6251 1
 
2.1%
Other values (37) 37
77.1%
ValueCountFrequency (%)
135 1
2.1%
381 1
2.1%
461 1
2.1%
595 1
2.1%
669 1
2.1%
746 1
2.1%
843 1
2.1%
1053 1
2.1%
1121 1
2.1%
1177 1
2.1%
ValueCountFrequency (%)
35144 1
2.1%
13614 1
2.1%
8625 1
2.1%
8448 1
2.1%
7518 2
4.2%
7252 1
2.1%
7104 1
2.1%
6925 1
2.1%
6408 1
2.1%
6251 1
2.1%

수용인원
Real number (ℝ)

HIGH CORRELATION 

Distinct35
Distinct (%)72.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean372.47917
Minimum50
Maximum1212
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size564.0 B
2024-01-10T08:05:55.339966image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum50
5-th percentile68.7
Q1119.25
median300
Q3500
95-th percentile1000
Maximum1212
Range1162
Interquartile range (IQR)380.75

Descriptive statistics

Standard deviation308.70863
Coefficient of variation (CV)0.82879436
Kurtosis0.41949649
Mean372.47917
Median Absolute Deviation (MAD)187
Skewness1.1653244
Sum17879
Variance95301.021
MonotonicityNot monotonic
2024-01-10T08:05:55.458395image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=35)
ValueCountFrequency (%)
300 5
 
10.4%
100 4
 
8.3%
500 3
 
6.2%
1000 2
 
4.2%
50 2
 
4.2%
150 2
 
4.2%
101 2
 
4.2%
68 1
 
2.1%
400 1
 
2.1%
238 1
 
2.1%
Other values (25) 25
52.1%
ValueCountFrequency (%)
50 2
4.2%
68 1
 
2.1%
70 1
 
2.1%
100 4
8.3%
101 2
4.2%
109 1
 
2.1%
117 1
 
2.1%
120 1
 
2.1%
143 1
 
2.1%
150 2
4.2%
ValueCountFrequency (%)
1212 1
 
2.1%
1072 1
 
2.1%
1000 2
4.2%
885 1
 
2.1%
851 1
 
2.1%
800 1
 
2.1%
789 1
 
2.1%
787 1
 
2.1%
530 1
 
2.1%
500 3
6.2%

개소일
Date

UNIQUE 

Distinct48
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size516.0 B
Minimum1994-08-17 00:00:00
Maximum2020-11-28 00:00:00
2024-01-10T08:05:55.577361image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T08:05:55.720820image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=48)

Interactions

2024-01-10T08:05:52.886236image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T08:05:52.361792image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T08:05:52.616285image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T08:05:52.964128image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T08:05:52.434423image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T08:05:52.712528image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T08:05:53.045801image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T08:05:52.518934image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T08:05:52.806437image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-01-10T08:05:55.805559image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
구분시설명소재지부지(제곱미터)건물(제곱미터)수용인원개소일
구분1.0001.0001.0000.0000.4760.5941.000
시설명1.0001.0001.0001.0001.0001.0001.000
소재지1.0001.0001.0001.0001.0001.0001.000
부지(제곱미터)0.0001.0001.0001.0000.3730.0001.000
건물(제곱미터)0.4761.0001.0000.3731.0000.7781.000
수용인원0.5941.0001.0000.0000.7781.0001.000
개소일1.0001.0001.0001.0001.0001.0001.000
2024-01-10T08:05:55.905819image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
부지(제곱미터)건물(제곱미터)수용인원구분
부지(제곱미터)1.0000.4930.3110.000
건물(제곱미터)0.4931.0000.8140.200
수용인원0.3110.8141.0000.373
구분0.0000.2000.3731.000

Missing values

2024-01-10T08:05:53.147421image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-01-10T08:05:53.251536image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

구분시설명소재지부지(제곱미터)건물(제곱미터)수용인원개소일
0청소년 수련관태조산 청소년수련관천안시 동남구 태조산길 2612873426363001994-08-17
1청소년 수련관천안시 청소년수련관천안시동남구 중앙로 111441044103502012-07-03
2청소년 수련관보령시청소년 수련관보령시 성주면 성주산로 5001131538031171998-06-23
3청소년 수련관아산청소년 교육문화센터아산시 시민로 5007195272527872010-06-15
4청소년 수련관서산시청소년수련관서산시 서령로 136401440143002013-03-11
5청소년 수련관논산시청소년 문화센터논산시 논산대로 4244958318210001999-10-20
6청소년 수련관금산다락원 청소년수련관금산군 금성면 적우실길 639333136148002005-04-11
7청소년 수련관서천청소년 수련관서천군 장항읍 장항산단로 34번길 60-231236328703002014-07-28
8청소년 수련관홍성청소년 수련관홍성군 홍성읍 문화로33번길 33164127567892002-03-29
9청소년 수련관예산군청소년 수련관예산군 예산읍 벚꽃로 2148919229543602014-01-03
구분시설명소재지부지(제곱미터)건물(제곱미터)수용인원개소일
38청소년 수련원청포대썬셋 수련원태안군 남면 안면대로 13081668269258512014-11-21
39청소년 야영장모두 휴(休) 청소년야영장청양군 대치면 청산로 690-1313624595682019-05-15
40청소년 야영장솔향기길청소년야영장태안군 이원면 내리 522-795761352202019-07-23
41유스호스텔천안 상록 유스호스텔천안시 동남구 수신면 수신로 5762538086258851999-12-23
42유스호스텔계룡산 갑사 유스호스텔공주시 계룡면 갑사로 458-4247738355301995-07-10
43유스호스텔공주 유스호스텔공주시 탄천면 삼거리1길 8-119639710410722004-04-09
44유스호스텔자연 부여 유스호스텔부여군 외산면 반교동로 6995018181802003-06-14
45유스호스텔부여군 유스호스텔부여군 부여읍 의열로 431144447502802015-08-28
46유스호스텔서천 유스호스텔서천군 장항읍 장항산단로 34번길 72-40626824653332014-06-18
47유스호스텔아가페 유스호스텔태안군 근흥면 용안길 1001124575185002007-10-12