Overview

Dataset statistics

Number of variables10
Number of observations30
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.6 KiB
Average record size in memory87.4 B

Variable types

Text3
Categorical1
DateTime2
Numeric3
Boolean1

Dataset

Description강원도내 문화시설현황으로 등록공연장의 면적, 객석수, 구동무대기계, 등
Author강원도
URLhttps://www.data.go.kr/data/3073012/fileData.do

Alerts

공연장면적(㎡) is highly overall correlated with 객석수 and 1 other fieldsHigh correlation
객석수 is highly overall correlated with 공연장면적(㎡)High correlation
구동무대기계수 is highly overall correlated with 공연장면적(㎡)High correlation
시설구분 is highly imbalanced (53.1%)Imbalance
시설명 has unique valuesUnique
공연장면적(㎡) has unique valuesUnique
구동무대기계수 has 10 (33.3%) zerosZeros

Reproduction

Analysis started2023-12-12 14:18:07.753882
Analysis finished2023-12-12 14:18:09.653301
Duration1.9 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct16
Distinct (%)53.3%
Missing0
Missing (%)0.0%
Memory size372.0 B
2023-12-12T23:18:09.775647image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length3
Median length3
Mean length3
Min length3

Characters and Unicode

Total characters90
Distinct characters30
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique9 ?
Unique (%)30.0%

Sample

1st row강릉시
2nd row강릉시
3rd row고성군
4th row고성군
5th row고성군
ValueCountFrequency (%)
원주시 5
16.7%
춘천시 5
16.7%
고성군 3
10.0%
강릉시 2
 
6.7%
동해시 2
 
6.7%
정선군 2
 
6.7%
홍천군 2
 
6.7%
삼척시 1
 
3.3%
속초시 1
 
3.3%
양구군 1
 
3.3%
Other values (6) 6
20.0%
2023-12-12T23:18:10.068115image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
17
18.9%
13
14.4%
8
 
8.9%
5
 
5.6%
5
 
5.6%
5
 
5.6%
3
 
3.3%
3
 
3.3%
3
 
3.3%
2
 
2.2%
Other values (20) 26
28.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 90
100.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
17
18.9%
13
14.4%
8
 
8.9%
5
 
5.6%
5
 
5.6%
5
 
5.6%
3
 
3.3%
3
 
3.3%
3
 
3.3%
2
 
2.2%
Other values (20) 26
28.9%

Most occurring scripts

ValueCountFrequency (%)
Hangul 90
100.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
17
18.9%
13
14.4%
8
 
8.9%
5
 
5.6%
5
 
5.6%
5
 
5.6%
3
 
3.3%
3
 
3.3%
3
 
3.3%
2
 
2.2%
Other values (20) 26
28.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 90
100.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
17
18.9%
13
14.4%
8
 
8.9%
5
 
5.6%
5
 
5.6%
5
 
5.6%
3
 
3.3%
3
 
3.3%
3
 
3.3%
2
 
2.2%
Other values (20) 26
28.9%

시설명
Text

UNIQUE 

Distinct30
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size372.0 B
2023-12-12T23:18:10.316108image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length14
Median length12
Mean length8.0666667
Min length4

Characters and Unicode

Total characters242
Distinct characters80
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique30 ?
Unique (%)100.0%

Sample

1st row강릉문화예술회관
2nd row강릉단오문화관
3rd row고성문화의집 문화관람실
4th row고성군문화복지센터 대공연장
5th row고성군문화복지센터 소공연장
ValueCountFrequency (%)
고성군문화복지센터 2
 
5.4%
강릉문화예술회관 1
 
2.7%
꿈의나라 1
 
2.7%
하늘내린센터 1
 
2.7%
정선문화예술회관 1
 
2.7%
카사시네마 1
 
2.7%
춘천문화예술회관 1
 
2.7%
강원국악예술회관 1
 
2.7%
물의나라 1
 
2.7%
강원대학교 1
 
2.7%
Other values (26) 26
70.3%
2023-12-12T23:18:10.761735image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
21
 
8.7%
20
 
8.3%
17
 
7.0%
14
 
5.8%
14
 
5.8%
13
 
5.4%
7
 
2.9%
7
 
2.9%
5
 
2.1%
5
 
2.1%
Other values (70) 119
49.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 235
97.1%
Space Separator 7
 
2.9%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
21
 
8.9%
20
 
8.5%
17
 
7.2%
14
 
6.0%
14
 
6.0%
13
 
5.5%
7
 
3.0%
5
 
2.1%
5
 
2.1%
4
 
1.7%
Other values (69) 115
48.9%
Space Separator
ValueCountFrequency (%)
7
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 235
97.1%
Common 7
 
2.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
21
 
8.9%
20
 
8.5%
17
 
7.2%
14
 
6.0%
14
 
6.0%
13
 
5.5%
7
 
3.0%
5
 
2.1%
5
 
2.1%
4
 
1.7%
Other values (69) 115
48.9%
Common
ValueCountFrequency (%)
7
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 235
97.1%
ASCII 7
 
2.9%

Most frequent character per block

Hangul
ValueCountFrequency (%)
21
 
8.9%
20
 
8.5%
17
 
7.2%
14
 
6.0%
14
 
6.0%
13
 
5.5%
7
 
3.0%
5
 
2.1%
5
 
2.1%
4
 
1.7%
Other values (69) 115
48.9%
ASCII
ValueCountFrequency (%)
7
100.0%

시설구분
Categorical

IMBALANCE 

Distinct2
Distinct (%)6.7%
Missing0
Missing (%)0.0%
Memory size372.0 B
공공
27 
민간

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row공공
2nd row공공
3rd row공공
4th row공공
5th row공공

Common Values

ValueCountFrequency (%)
공공 27
90.0%
민간 3
 
10.0%

Length

2023-12-12T23:18:10.924006image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T23:18:11.046827image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
공공 27
90.0%
민간 3
 
10.0%
Distinct29
Distinct (%)96.7%
Missing0
Missing (%)0.0%
Memory size372.0 B
Minimum1983-03-12 00:00:00
Maximum2009-08-06 00:00:00
2023-12-12T23:18:11.189469image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T23:18:11.351298image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=29)
Distinct27
Distinct (%)90.0%
Missing0
Missing (%)0.0%
Memory size372.0 B
Minimum1993-10-08 00:00:00
Maximum2009-09-03 00:00:00
2023-12-12T23:18:11.477407image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T23:18:11.598373image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=27)

공연장면적(㎡)
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct30
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2574.5183
Minimum116
Maximum9782.83
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size402.0 B
2023-12-12T23:18:11.752485image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum116
5-th percentile241.208
Q1506.14
median1026.57
Q34038.75
95-th percentile8356
Maximum9782.83
Range9666.83
Interquartile range (IQR)3532.61

Descriptive statistics

Standard deviation2798.4866
Coefficient of variation (CV)1.0869943
Kurtosis0.61034478
Mean2574.5183
Median Absolute Deviation (MAD)779.29
Skewness1.2601349
Sum77235.55
Variance7831527.4
MonotonicityNot monotonic
2023-12-12T23:18:11.915014image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=30)
ValueCountFrequency (%)
8986.0 1
 
3.3%
6413.0 1
 
3.3%
809.0 1
 
3.3%
571.0 1
 
3.3%
694.0 1
 
3.3%
4196.0 1
 
3.3%
1421.0 1
 
3.3%
429.04 1
 
3.3%
5597.0 1
 
3.3%
3037.0 1
 
3.3%
Other values (20) 20
66.7%
ValueCountFrequency (%)
116.0 1
3.3%
186.56 1
3.3%
308.0 1
3.3%
429.04 1
3.3%
448.0 1
3.3%
454.0 1
3.3%
491.0 1
3.3%
505.0 1
3.3%
509.56 1
3.3%
571.0 1
3.3%
ValueCountFrequency (%)
9782.83 1
3.3%
8986.0 1
3.3%
7586.0 1
3.3%
6413.0 1
3.3%
5597.0 1
3.3%
5247.0 1
3.3%
5015.0 1
3.3%
4196.0 1
3.3%
3567.0 1
3.3%
3037.0 1
3.3%
Distinct29
Distinct (%)96.7%
Missing0
Missing (%)0.0%
Memory size372.0 B
2023-12-12T23:18:12.121814image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length6
Median length3
Mean length3.0666667
Min length2

Characters and Unicode

Total characters92
Distinct characters13
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique28 ?
Unique (%)93.3%

Sample

1st row363
2nd row161
3rd row63
4th row137.2
5th row26.4
ValueCountFrequency (%)
63 2
 
6.7%
363 1
 
3.3%
야외 1
 
3.3%
261 1
 
3.3%
728 1
 
3.3%
617 1
 
3.3%
53.54 1
 
3.3%
618 1
 
3.3%
194 1
 
3.3%
113.48 1
 
3.3%
Other values (19) 19
63.3%
2023-12-12T23:18:12.492491image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1 13
14.1%
6 12
13.0%
3 11
12.0%
2 10
10.9%
7 8
8.7%
4 8
8.7%
8 7
7.6%
9 6
6.5%
5 5
 
5.4%
. 5
 
5.4%
Other values (3) 7
7.6%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 85
92.4%
Other Punctuation 5
 
5.4%
Other Letter 2
 
2.2%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
1 13
15.3%
6 12
14.1%
3 11
12.9%
2 10
11.8%
7 8
9.4%
4 8
9.4%
8 7
8.2%
9 6
7.1%
5 5
 
5.9%
0 5
 
5.9%
Other Letter
ValueCountFrequency (%)
1
50.0%
1
50.0%
Other Punctuation
ValueCountFrequency (%)
. 5
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 90
97.8%
Hangul 2
 
2.2%

Most frequent character per script

Common
ValueCountFrequency (%)
1 13
14.4%
6 12
13.3%
3 11
12.2%
2 10
11.1%
7 8
8.9%
4 8
8.9%
8 7
7.8%
9 6
6.7%
5 5
 
5.6%
. 5
 
5.6%
Hangul
ValueCountFrequency (%)
1
50.0%
1
50.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 90
97.8%
Hangul 2
 
2.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1 13
14.4%
6 12
13.3%
3 11
12.2%
2 10
11.1%
7 8
8.9%
4 8
8.9%
8 7
7.8%
9 6
6.7%
5 5
 
5.6%
. 5
 
5.6%
Hangul
ValueCountFrequency (%)
1
50.0%
1
50.0%

객석수
Real number (ℝ)

HIGH CORRELATION 

Distinct29
Distinct (%)96.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean696.4
Minimum120
Maximum4291
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size402.0 B
2023-12-12T23:18:12.667064image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum120
5-th percentile141.6
Q1354.5
median490.5
Q3722
95-th percentile1673.25
Maximum4291
Range4171
Interquartile range (IQR)367.5

Descriptive statistics

Standard deviation777.26104
Coefficient of variation (CV)1.1161129
Kurtosis16.356021
Mean696.4
Median Absolute Deviation (MAD)190
Skewness3.7297226
Sum20892
Variance604134.73
MonotonicityNot monotonic
2023-12-12T23:18:12.792236image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=29)
ValueCountFrequency (%)
120 2
 
6.7%
438 1
 
3.3%
989 1
 
3.3%
504 1
 
3.3%
346 1
 
3.3%
506 1
 
3.3%
454 1
 
3.3%
796 1
 
3.3%
300 1
 
3.3%
1815 1
 
3.3%
Other values (19) 19
63.3%
ValueCountFrequency (%)
120 2
6.7%
168 1
3.3%
231 1
3.3%
288 1
3.3%
300 1
3.3%
301 1
3.3%
346 1
3.3%
380 1
3.3%
400 1
3.3%
424 1
3.3%
ValueCountFrequency (%)
4291 1
3.3%
1815 1
3.3%
1500 1
3.3%
1018 1
3.3%
989 1
3.3%
948 1
3.3%
796 1
3.3%
734 1
3.3%
686 1
3.3%
660 1
3.3%

구동무대기계수
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct17
Distinct (%)56.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean20.266667
Minimum0
Maximum67
Zeros10
Zeros (%)33.3%
Negative0
Negative (%)0.0%
Memory size402.0 B
2023-12-12T23:18:12.942523image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median11.5
Q333
95-th percentile62
Maximum67
Range67
Interquartile range (IQR)33

Descriptive statistics

Standard deviation22.269406
Coefficient of variation (CV)1.0988194
Kurtosis-0.49172348
Mean20.266667
Median Absolute Deviation (MAD)11.5
Skewness0.88195665
Sum608
Variance495.92644
MonotonicityNot monotonic
2023-12-12T23:18:13.092829image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=17)
ValueCountFrequency (%)
0 10
33.3%
62 2
 
6.7%
33 2
 
6.7%
30 2
 
6.7%
57 2
 
6.7%
34 1
 
3.3%
24 1
 
3.3%
4 1
 
3.3%
9 1
 
3.3%
10 1
 
3.3%
Other values (7) 7
23.3%
ValueCountFrequency (%)
0 10
33.3%
4 1
 
3.3%
6 1
 
3.3%
8 1
 
3.3%
9 1
 
3.3%
10 1
 
3.3%
13 1
 
3.3%
14 1
 
3.3%
19 1
 
3.3%
24 1
 
3.3%
ValueCountFrequency (%)
67 1
3.3%
62 2
6.7%
57 2
6.7%
36 1
3.3%
34 1
3.3%
33 2
6.7%
30 2
6.7%
24 1
3.3%
19 1
3.3%
14 1
3.3%
Distinct2
Distinct (%)6.7%
Missing0
Missing (%)0.0%
Memory size162.0 B
True
15 
False
15 
ValueCountFrequency (%)
True 15
50.0%
False 15
50.0%
2023-12-12T23:18:13.245390image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Interactions

2023-12-12T23:18:09.081260image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T23:18:08.208338image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T23:18:08.806846image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T23:18:09.201461image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T23:18:08.316358image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T23:18:08.888726image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T23:18:09.294453image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T23:18:08.399783image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T23:18:08.981781image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T23:18:13.331378image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
시군구시설명시설구분개관일자공연장등록일공연장면적(㎡)무대면적(㎡)객석수구동무대기계수문예회관
시군구1.0001.0000.0001.0001.0000.3160.9710.0000.5680.000
시설명1.0001.0001.0001.0001.0001.0001.0001.0001.0001.000
시설구분0.0001.0001.0001.0001.0000.0001.0000.0000.0000.201
개관일자1.0001.0001.0001.0001.0001.0000.9901.0001.0001.000
공연장등록일1.0001.0001.0001.0001.0000.9920.9820.9420.6520.000
공연장면적(㎡)0.3161.0000.0001.0000.9921.0001.0000.8930.3760.000
무대면적(㎡)0.9711.0001.0000.9900.9821.0001.0001.0001.0001.000
객석수0.0001.0000.0001.0000.9420.8931.0001.0000.4370.583
구동무대기계수0.5681.0000.0001.0000.6520.3761.0000.4371.0000.321
문예회관0.0001.0000.2011.0000.0000.0001.0000.5830.3211.000
2023-12-12T23:18:13.502906image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
시설구분문예회관
시설구분1.0000.124
문예회관0.1241.000
2023-12-12T23:18:13.589541image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
공연장면적(㎡)객석수구동무대기계수시설구분문예회관
공연장면적(㎡)1.0000.7910.5150.0000.000
객석수0.7911.0000.4820.0000.385
구동무대기계수0.5150.4821.0000.0000.194
시설구분0.0000.0000.0001.0000.124
문예회관0.0000.3850.1940.1241.000

Missing values

2023-12-12T23:18:09.431891image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T23:18:09.598041image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

시군구시설명시설구분개관일자공연장등록일공연장면적(㎡)무대면적(㎡)객석수구동무대기계수문예회관
0강릉시강릉문화예술회관공공1992-03-072005-10-188986.036343834Y
1강릉시강릉단오문화관공공2004-01-192007-04-181195.016148519N
2고성군고성문화의집 문화관람실공공2002-12-012002-12-01308.0632880N
3고성군고성군문화복지센터 대공연장공공2005-12-082007-01-04509.56137.23010Y
4고성군고성군문화복지센터 소공연장공공2005-12-082007-01-04186.5626.41680Y
5동해시동해문화예술회관공공1995-03-022002-12-185247.031056733Y
6동해시동해야외공연장공공2006-07-052006-07-052538.027715000N
7삼척시삼척문화예술회관공공1994-06-042003-01-023567.052894830Y
8속초시속초문화예술회관공공1990-10-102003-08-192987.026473430Y
9양구군양구문화복지센터공연장공공2000-12-012003-02-06491.01263808N
시군구시설명시설구분개관일자공연장등록일공연장면적(㎡)무대면적(㎡)객석수구동무대기계수문예회관
20춘천시춘천문화예술회관공공1993-06-212002-12-129782.83828101862Y
21춘천시강원국악예술회관공공2000-09-182002-12-12858.14113.481209N
22춘천시물의나라 꿈의나라공공2000-12-272002-12-233037.01944974N
23춘천시강원대학교 백령문화관공공1995-03-062003-02-125597.0618181533N
24춘천시봄내극장공공2000-09-062003-08-23429.0453.543000N
25태백시태백문화예술회관공공2003-07-252003-07-251421.061779657Y
26평창군평창문화예술회관공공1999-10-051999-10-054196.07284540Y
27홍천군홍천문화예술회관공공1995-03-232002-12-26694.026150624Y
28홍천군홍천문화원공공1983-03-122002-12-26571.0633460N
29화천군화천문화예술회관공공1997-01-312003-02-04809.01475040Y