Overview

Dataset statistics

Number of variables5
Number of observations30
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.4 KiB
Average record size in memory46.4 B

Variable types

Text1
Numeric2
Categorical1
DateTime1

Dataset

Description경기도청소년수련원 건축현황
Author경기도청소년수련원
URLhttps://data.gg.go.kr/portal/data/service/selectServicePage.do?&infId=38F0TPZ0N6ZW8ZDES1XU31687154&infSeq=1

Alerts

데이터기준일자 has constant value ""Constant
연면적 is highly overall correlated with 건축층수High correlation
건축년도 is highly overall correlated with 건축층수High correlation
건축층수 is highly overall correlated with 연면적 and 1 other fieldsHigh correlation
건물명 has unique valuesUnique

Reproduction

Analysis started2023-12-10 23:04:49.989831
Analysis finished2023-12-10 23:04:50.633656
Duration0.64 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

건물명
Text

UNIQUE 

Distinct30
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size372.0 B
2023-12-11T08:04:50.782978image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length26
Median length21
Mean length19.266667
Min length14

Characters and Unicode

Total characters578
Distinct characters72
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique30 ?
Unique (%)100.0%

Sample

1st row관리동(경기도청소년야영장)
2nd row종합지원본부(경기도청소년야영장)
3rd row실내체육관(경기도청소년야영장)
4th row수영장(경기도청소년야영장)
5th row1대피동(경기도청소년야영장)
ValueCountFrequency (%)
8
 
14.5%
취사 7
 
12.7%
샤워,화장실(경기도청소년야영장 5
 
9.1%
7영지 2
 
3.6%
샤워실(경기도청소년야영장 2
 
3.6%
화장실(경기도청소년야영장 2
 
3.6%
4영지 2
 
3.6%
체험동(도자기)(경기도청소년수련원 1
 
1.8%
가마동(도자기)(경기도청소년수련원 1
 
1.8%
도자기강의동(경기도청소년수련원 1
 
1.8%
Other values (24) 24
43.6%
2023-12-11T08:04:51.147759image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
) 39
 
6.7%
( 39
 
6.7%
33
 
5.7%
33
 
5.7%
31
 
5.4%
30
 
5.2%
30
 
5.2%
30
 
5.2%
30
 
5.2%
27
 
4.7%
Other values (62) 256
44.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 456
78.9%
Close Punctuation 39
 
6.7%
Open Punctuation 39
 
6.7%
Space Separator 25
 
4.3%
Decimal Number 12
 
2.1%
Other Punctuation 5
 
0.9%
Uppercase Letter 2
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
33
 
7.2%
33
 
7.2%
31
 
6.8%
30
 
6.6%
30
 
6.6%
30
 
6.6%
30
 
6.6%
27
 
5.9%
18
 
3.9%
15
 
3.3%
Other values (49) 179
39.3%
Decimal Number
ValueCountFrequency (%)
7 2
16.7%
4 2
16.7%
2 2
16.7%
3 2
16.7%
1 2
16.7%
6 1
8.3%
5 1
8.3%
Uppercase Letter
ValueCountFrequency (%)
H 1
50.0%
G 1
50.0%
Close Punctuation
ValueCountFrequency (%)
) 39
100.0%
Open Punctuation
ValueCountFrequency (%)
( 39
100.0%
Space Separator
ValueCountFrequency (%)
25
100.0%
Other Punctuation
ValueCountFrequency (%)
, 5
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 456
78.9%
Common 120
 
20.8%
Latin 2
 
0.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
33
 
7.2%
33
 
7.2%
31
 
6.8%
30
 
6.6%
30
 
6.6%
30
 
6.6%
30
 
6.6%
27
 
5.9%
18
 
3.9%
15
 
3.3%
Other values (49) 179
39.3%
Common
ValueCountFrequency (%)
) 39
32.5%
( 39
32.5%
25
20.8%
, 5
 
4.2%
7 2
 
1.7%
4 2
 
1.7%
2 2
 
1.7%
3 2
 
1.7%
1 2
 
1.7%
6 1
 
0.8%
Latin
ValueCountFrequency (%)
H 1
50.0%
G 1
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 456
78.9%
ASCII 122
 
21.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
) 39
32.0%
( 39
32.0%
25
20.5%
, 5
 
4.1%
7 2
 
1.6%
4 2
 
1.6%
2 2
 
1.6%
3 2
 
1.6%
1 2
 
1.6%
H 1
 
0.8%
Other values (3) 3
 
2.5%
Hangul
ValueCountFrequency (%)
33
 
7.2%
33
 
7.2%
31
 
6.8%
30
 
6.6%
30
 
6.6%
30
 
6.6%
30
 
6.6%
27
 
5.9%
18
 
3.9%
15
 
3.3%
Other values (49) 179
39.3%

연면적
Real number (ℝ)

HIGH CORRELATION 

Distinct25
Distinct (%)83.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean477.00033
Minimum42.7
Maximum2724.48
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size402.0 B
2023-12-11T08:04:51.298509image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum42.7
5-th percentile54
Q193.96
median137.38
Q3577.9425
95-th percentile2067.048
Maximum2724.48
Range2681.78
Interquartile range (IQR)483.9825

Descriptive statistics

Standard deviation703.03856
Coefficient of variation (CV)1.4738743
Kurtosis5.4270576
Mean477.00033
Median Absolute Deviation (MAD)76.845
Skewness2.3571071
Sum14310.01
Variance494263.21
MonotonicityNot monotonic
2023-12-11T08:04:51.410479image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=25)
ValueCountFrequency (%)
95.48 3
 
10.0%
209.82 2
 
6.7%
93.96 2
 
6.7%
54.0 2
 
6.7%
515.58 1
 
3.3%
56.7 1
 
3.3%
249.72 1
 
3.3%
989.0 1
 
3.3%
67.48 1
 
3.3%
105.6 1
 
3.3%
Other values (15) 15
50.0%
ValueCountFrequency (%)
42.7 1
 
3.3%
54.0 2
6.7%
56.7 1
 
3.3%
64.37 1
 
3.3%
67.48 1
 
3.3%
76.8 1
 
3.3%
93.96 2
6.7%
95.48 3
10.0%
102.96 1
 
3.3%
105.6 1
 
3.3%
ValueCountFrequency (%)
2724.48 1
3.3%
2685.78 1
3.3%
1310.82 1
3.3%
1080.22 1
3.3%
989.0 1
3.3%
978.32 1
3.3%
760.29 1
3.3%
598.73 1
3.3%
515.58 1
3.3%
425.25 1
3.3%

건축년도
Real number (ℝ)

HIGH CORRELATION 

Distinct7
Distinct (%)23.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2002.8
Minimum1996
Maximum2019
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size402.0 B
2023-12-11T08:04:51.512007image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1996
5-th percentile1997.35
Q12001
median2001
Q32001
95-th percentile2011
Maximum2019
Range23
Interquartile range (IQR)0

Descriptive statistics

Standard deviation4.9854962
Coefficient of variation (CV)0.0024892631
Kurtosis2.9774272
Mean2002.8
Median Absolute Deviation (MAD)0
Skewness1.7086348
Sum60084
Variance24.855172
MonotonicityNot monotonic
2023-12-11T08:04:51.624124image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=7)
ValueCountFrequency (%)
2001 20
66.7%
2011 4
 
13.3%
1996 2
 
6.7%
2019 1
 
3.3%
1999 1
 
3.3%
2007 1
 
3.3%
2003 1
 
3.3%
ValueCountFrequency (%)
1996 2
 
6.7%
1999 1
 
3.3%
2001 20
66.7%
2003 1
 
3.3%
2007 1
 
3.3%
2011 4
 
13.3%
2019 1
 
3.3%
ValueCountFrequency (%)
2019 1
 
3.3%
2011 4
 
13.3%
2007 1
 
3.3%
2003 1
 
3.3%
2001 20
66.7%
1999 1
 
3.3%
1996 2
 
6.7%

건축층수
Categorical

HIGH CORRELATION 

Distinct9
Distinct (%)30.0%
Missing0
Missing (%)0.0%
Memory size372.0 B
지하0층,지상1층
13 
지하0, 지상1
지하0층,지상3층
지하1, 지상3
지하1, 지상2
Other values (4)

Length

Max length9
Median length9
Mean length8.5333333
Min length8

Unique

Unique4 ?
Unique (%)13.3%

Sample

1st row지하0층,지상3층
2nd row지하0층,지상3층
3rd row지하0층,지상2층
4th row지하0층,지상1층
5th row지하0층,지상1층

Common Values

ValueCountFrequency (%)
지하0층,지상1층 13
43.3%
지하0, 지상1 7
23.3%
지하0층,지상3층 2
 
6.7%
지하1, 지상3 2
 
6.7%
지하1, 지상2 2
 
6.7%
지하0층,지상2층 1
 
3.3%
지하1, 지상4 1
 
3.3%
지하0, 지상2 1
 
3.3%
지하1, 지상1 1
 
3.3%

Length

2023-12-11T08:04:51.738265image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T08:04:51.850715image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
지하0층,지상1층 13
29.5%
지하0 8
18.2%
지상1 8
18.2%
지하1 6
13.6%
지상2 3
 
6.8%
지하0층,지상3층 2
 
4.5%
지상3 2
 
4.5%
지하0층,지상2층 1
 
2.3%
지상4 1
 
2.3%

데이터기준일자
Date

CONSTANT 

Distinct1
Distinct (%)3.3%
Missing0
Missing (%)0.0%
Memory size372.0 B
Minimum2022-12-15 00:00:00
Maximum2022-12-15 00:00:00
2023-12-11T08:04:51.984329image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T08:04:52.075441image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=1)

Interactions

2023-12-11T08:04:50.332236image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T08:04:50.158475image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T08:04:50.410904image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T08:04:50.254195image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-11T08:04:52.137836image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
건물명연면적건축년도건축층수
건물명1.0001.0001.0001.000
연면적1.0001.0000.8350.855
건축년도1.0000.8351.0000.898
건축층수1.0000.8550.8981.000
2023-12-11T08:04:52.236075image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연면적건축년도건축층수
연면적1.000-0.0940.585
건축년도-0.0941.0000.661
건축층수0.5850.6611.000

Missing values

2023-12-11T08:04:50.509995image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T08:04:50.595926image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

건물명연면적건축년도건축층수데이터기준일자
0관리동(경기도청소년야영장)515.581996지하0층,지상3층2022-12-15
1종합지원본부(경기도청소년야영장)598.732019지하0층,지상3층2022-12-15
2실내체육관(경기도청소년야영장)1310.822001지하0층,지상2층2022-12-15
3수영장(경기도청소년야영장)102.962001지하0층,지상1층2022-12-15
41대피동(경기도청소년야영장)209.822001지하0층,지상1층2022-12-15
52대피동(경기도청소년야영장)209.822001지하0층,지상1층2022-12-15
63대피동(경기도청소년야영장)76.82001지하0층,지상1층2022-12-15
71영지 취사 및 샤워,화장실(경기도청소년야영장)95.482001지하0층,지상1층2022-12-15
82영지 취사 및 샤워,화장실(경기도청소년야영장)139.762001지하0층,지상1층2022-12-15
93영지 취사 및 샤워,화장실(경기도청소년야영장)135.02001지하0층,지상1층2022-12-15
건물명연면적건축년도건축층수데이터기준일자
20가마동(도자기)(경기도청소년수련원)56.72011지하0, 지상12022-12-15
21도자기강의동(경기도청소년수련원)425.252011지하0, 지상12022-12-15
22야외화장실(운동장)(경기도청소년수련원)64.372011지하0, 지상12022-12-15
23체험동(도자기)(경기도청소년수련원)198.452011지하0, 지상12022-12-15
24관리동(본관)(경기도청소년수련원)978.322001지하1, 지상22022-12-15
25야영장화장실(경기도청소년수련원)42.72001지하0, 지상12022-12-15
26옥외화장실 및 샤워장(경기도청소년수련원)105.62001지하0, 지상12022-12-15
27전망대(팔효정)(경기도청소년수련원)67.482001지하0, 지상22022-12-15
28체육관동(경기도청소년수련원)989.02001지하0, 지상12022-12-15
29갯벌체험장동(경기도청소년수련원)249.722003지하1, 지상12022-12-15