Overview

Dataset statistics

Number of variables8
Number of observations49
Missing cells1
Missing cells (%)0.3%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory3.3 KiB
Average record size in memory69.7 B

Variable types

Categorical5
Numeric3

Dataset

Description2010~2022년 대구교통공사 디트로문화한마당 개최현황(개최기간, 행사장소, 행사내용, 건수)을 나타낸 자료입니다.
URLhttps://www.data.go.kr/data/15044939/fileData.do

Alerts

참여단체 is highly overall correlated with 행사내용High correlation
개최기간 is highly overall correlated with 개최년도 and 2 other fieldsHigh correlation
행사내용 is highly overall correlated with 참여단체High correlation
행사장소 is highly overall correlated with 개최년도 and 2 other fieldsHigh correlation
구 분 is highly overall correlated with 개최년도 and 2 other fieldsHigh correlation
개최년도 is highly overall correlated with 구 분 and 2 other fieldsHigh correlation
건수 has 1 (2.0%) missing valuesMissing
단체 has 1 (2.0%) zerosZeros

Reproduction

Analysis started2023-12-12 18:05:12.066752
Analysis finished2023-12-12 18:05:13.563161
Duration1.5 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

구 분
Categorical

HIGH CORRELATION 

Distinct17
Distinct (%)34.7%
Missing0
Missing (%)0.0%
Memory size524.0 B
제1회
제6회
제3회
제7회
제5회
Other values (12)
29 

Length

Max length4
Median length3
Mean length3.2653061
Min length3

Unique

Unique6 ?
Unique (%)12.2%

Sample

1st row제1회
2nd row제1회
3rd row제1회
4th row제1회
5th row제2회

Common Values

ValueCountFrequency (%)
제1회 4
8.2%
제6회 4
8.2%
제3회 4
8.2%
제7회 4
8.2%
제5회 4
8.2%
제4회 4
8.2%
제9회 4
8.2%
제8회 4
8.2%
제2회 4
8.2%
제11회 4
8.2%
Other values (7) 9
18.4%

Length

2023-12-13T03:05:13.639982image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
제1회 4
8.2%
제9회 4
8.2%
제11회 4
8.2%
제6회 4
8.2%
제8회 4
8.2%
제2회 4
8.2%
제4회 4
8.2%
제5회 4
8.2%
제7회 4
8.2%
제3회 4
8.2%
Other values (7) 9
18.4%

개최년도
Real number (ℝ)

HIGH CORRELATION 

Distinct14
Distinct (%)28.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2015.6327
Minimum2010
Maximum2023
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size573.0 B
2023-12-13T03:05:13.776547image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2010
5-th percentile2010
Q12013
median2016
Q32019
95-th percentile2021.6
Maximum2023
Range13
Interquartile range (IQR)6

Descriptive statistics

Standard deviation3.6153255
Coefficient of variation (CV)0.001793643
Kurtosis-0.97445627
Mean2015.6327
Median Absolute Deviation (MAD)3
Skewness0.11207595
Sum98766
Variance13.070578
MonotonicityIncreasing
2023-12-13T03:05:13.939610image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=14)
ValueCountFrequency (%)
2019 7
14.3%
2010 4
8.2%
2011 4
8.2%
2012 4
8.2%
2013 4
8.2%
2014 4
8.2%
2015 4
8.2%
2016 4
8.2%
2017 4
8.2%
2018 4
8.2%
Other values (4) 6
12.2%
ValueCountFrequency (%)
2010 4
8.2%
2011 4
8.2%
2012 4
8.2%
2013 4
8.2%
2014 4
8.2%
2015 4
8.2%
2016 4
8.2%
2017 4
8.2%
2018 4
8.2%
2019 7
14.3%
ValueCountFrequency (%)
2023 1
 
2.0%
2022 2
 
4.1%
2021 2
 
4.1%
2020 1
 
2.0%
2019 7
14.3%
2018 4
8.2%
2017 4
8.2%
2016 4
8.2%
2015 4
8.2%
2014 4
8.2%

개최기간
Categorical

HIGH CORRELATION 

Distinct17
Distinct (%)34.7%
Missing0
Missing (%)0.0%
Memory size524.0 B
11-24~11-30
10-23~10-29
10-11~10-17
11-02~11-06
10-22~10-28
Other values (12)
29 

Length

Max length11
Median length11
Mean length11
Min length11

Unique

Unique6 ?
Unique (%)12.2%

Sample

1st row11-24~11-30
2nd row11-24~11-30
3rd row11-24~11-30
4th row11-24~11-30
5th row11-16~11-22

Common Values

ValueCountFrequency (%)
11-24~11-30 4
8.2%
10-23~10-29 4
8.2%
10-11~10-17 4
8.2%
11-02~11-06 4
8.2%
10-22~10-28 4
8.2%
10-16~10-22 4
8.2%
10-31~11-01 4
8.2%
11-01~11-05 4
8.2%
11-16~11-22 4
8.2%
10-29~10-31 4
8.2%
Other values (7) 9
18.4%

Length

2023-12-13T03:05:14.095613image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
11-24~11-30 4
8.2%
10-31~11-01 4
8.2%
10-29~10-31 4
8.2%
10-23~10-29 4
8.2%
11-01~11-05 4
8.2%
11-16~11-22 4
8.2%
10-16~10-22 4
8.2%
10-22~10-28 4
8.2%
11-02~11-06 4
8.2%
10-11~10-17 4
8.2%
Other values (7) 9
18.4%

행사장소
Categorical

HIGH CORRELATION 

Distinct10
Distinct (%)20.4%
Missing0
Missing (%)0.0%
Memory size524.0 B
1-2호선 41개역
24 
1-2호선 20개역_3호선열차내
1-2호선 37개역_3호선열차내
1-2호선 39개역_3호선 1개역
1-2-3호선 23개역
Other values (5)

Length

Max length21
Median length18
Mean length12.714286
Min length9

Unique

Unique2 ?
Unique (%)4.1%

Sample

1st row1-2호선 41개역
2nd row1-2호선 41개역
3rd row1-2호선 41개역
4th row1-2호선 41개역
5th row1-2호선 41개역

Common Values

ValueCountFrequency (%)
1-2호선 41개역 24
49.0%
1-2호선 20개역_3호선열차내 4
 
8.2%
1-2호선 37개역_3호선열차내 4
 
8.2%
1-2호선 39개역_3호선 1개역 4
 
8.2%
1-2-3호선 23개역 4
 
8.2%
환승역 3개역(반월당_청라언덕_명덕역) 3
 
6.1%
공사 유튜브 채널 2
 
4.1%
1-2-3호선 주요역사 2
 
4.1%
1-2-3호선 8개역 1
 
2.0%
1-2-3호선 7개역 1
 
2.0%

Length

2023-12-13T03:05:14.237888image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T03:05:14.411446image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1-2호선 36
34.6%
41개역 24
23.1%
1-2-3호선 8
 
7.7%
20개역_3호선열차내 4
 
3.8%
37개역_3호선열차내 4
 
3.8%
39개역_3호선 4
 
3.8%
1개역 4
 
3.8%
23개역 4
 
3.8%
환승역 3
 
2.9%
3개역(반월당_청라언덕_명덕역 3
 
2.9%
Other values (6) 10
 
9.6%

행사내용
Categorical

HIGH CORRELATION 

Distinct7
Distinct (%)14.3%
Missing0
Missing (%)0.0%
Memory size524.0 B
전시
13 
공연
11 
체험
11 
나눔(건강검진)
10 
온라인공연
Other values (2)

Length

Max length9
Median length2
Mean length3.5714286
Min length2

Unique

Unique2 ?
Unique (%)4.1%

Sample

1st row공연
2nd row전시
3rd row체험
4th row나눔(건강검진)
5th row공연

Common Values

ValueCountFrequency (%)
전시 13
26.5%
공연 11
22.4%
체험 11
22.4%
나눔(건강검진) 10
20.4%
온라인공연 2
 
4.1%
전시, 온라인공연 1
 
2.0%
전지, 체험 1
 
2.0%

Length

2023-12-13T03:05:14.610529image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T03:05:14.791963image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
전시 14
27.5%
체험 12
23.5%
공연 11
21.6%
나눔(건강검진 10
19.6%
온라인공연 3
 
5.9%
전지 1
 
2.0%

건수
Real number (ℝ)

MISSING 

Distinct36
Distinct (%)75.0%
Missing1
Missing (%)2.0%
Infinite0
Infinite (%)0.0%
Mean49.270833
Minimum1
Maximum133
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size573.0 B
2023-12-13T03:05:15.026977image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile8
Q113.5
median34.5
Q369.25
95-th percentile123.5
Maximum133
Range132
Interquartile range (IQR)55.75

Descriptive statistics

Standard deviation39.598377
Coefficient of variation (CV)0.80368799
Kurtosis-0.54924232
Mean49.270833
Median Absolute Deviation (MAD)24
Skewness0.81023171
Sum2365
Variance1568.0315
MonotonicityNot monotonic
2023-12-13T03:05:15.226648image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=36)
ValueCountFrequency (%)
26 4
 
8.2%
12 3
 
6.1%
30 2
 
4.1%
133 2
 
4.1%
115 2
 
4.1%
9 2
 
4.1%
73 2
 
4.1%
8 2
 
4.1%
11 2
 
4.1%
68 1
 
2.0%
Other values (26) 26
53.1%
ValueCountFrequency (%)
1 1
 
2.0%
7 1
 
2.0%
8 2
4.1%
9 2
4.1%
10 1
 
2.0%
11 2
4.1%
12 3
6.1%
14 1
 
2.0%
21 1
 
2.0%
24 1
 
2.0%
ValueCountFrequency (%)
133 2
4.1%
127 1
2.0%
117 1
2.0%
115 2
4.1%
114 1
2.0%
111 1
2.0%
97 1
2.0%
85 1
2.0%
73 2
4.1%
68 1
2.0%

참여단체
Categorical

HIGH CORRELATION 

Distinct8
Distinct (%)16.3%
Missing0
Missing (%)0.0%
Memory size524.0 B
공공단체_ 협회
12 
동호회_ 공연단
11 
학교_ 종교단체
11 
병원_기업_개인 등
10 
지역예술단체
Other values (3)

Length

Max length10
Median length8
Mean length8.3673469
Min length6

Unique

Unique3 ?
Unique (%)6.1%

Sample

1st row공공단체_협회
2nd row동호회_ 공연단
3rd row학교_ 종교단체
4th row병원_기업_개인 등
5th row공공단체_ 협회

Common Values

ValueCountFrequency (%)
공공단체_ 협회 12
24.5%
동호회_ 공연단 11
22.4%
학교_ 종교단체 11
22.4%
병원_기업_개인 등 10
20.4%
지역예술단체 2
 
4.1%
공공단체_협회 1
 
2.0%
공연단체_협회 등 1
 
2.0%
공공기관_동호회 등 1
 
2.0%

Length

2023-12-13T03:05:15.409811image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T03:05:15.568919image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
공공단체 12
12.6%
협회 12
12.6%
12
12.6%
동호회 11
11.6%
공연단 11
11.6%
학교 11
11.6%
종교단체 11
11.6%
병원_기업_개인 10
10.5%
지역예술단체 2
 
2.1%
공공단체_협회 1
 
1.1%
Other values (2) 2
 
2.1%

단체
Real number (ℝ)

ZEROS 

Distinct34
Distinct (%)69.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean35.673469
Minimum0
Maximum95
Zeros1
Zeros (%)2.0%
Negative0
Negative (%)0.0%
Memory size573.0 B
2023-12-13T03:05:15.744743image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile7.4
Q115
median31
Q351
95-th percentile77.2
Maximum95
Range95
Interquartile range (IQR)36

Descriptive statistics

Standard deviation22.835816
Coefficient of variation (CV)0.64013443
Kurtosis-0.17884288
Mean35.673469
Median Absolute Deviation (MAD)16
Skewness0.62977207
Sum1748
Variance521.47449
MonotonicityNot monotonic
2023-12-13T03:05:15.877351image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=34)
ValueCountFrequency (%)
27 4
 
8.2%
15 3
 
6.1%
34 3
 
6.1%
8 2
 
4.1%
42 2
 
4.1%
11 2
 
4.1%
51 2
 
4.1%
30 2
 
4.1%
31 2
 
4.1%
43 2
 
4.1%
Other values (24) 25
51.0%
ValueCountFrequency (%)
0 1
 
2.0%
2 1
 
2.0%
7 1
 
2.0%
8 2
4.1%
10 2
4.1%
11 2
4.1%
12 1
 
2.0%
15 3
6.1%
23 1
 
2.0%
24 1
 
2.0%
ValueCountFrequency (%)
95 1
2.0%
81 1
2.0%
80 1
2.0%
73 1
2.0%
72 1
2.0%
70 1
2.0%
68 1
2.0%
59 1
2.0%
56 1
2.0%
54 1
2.0%

Interactions

2023-12-13T03:05:13.081136image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T03:05:12.474907image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T03:05:12.771970image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T03:05:13.180277image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T03:05:12.581188image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T03:05:12.866984image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T03:05:13.264811image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T03:05:12.677454image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T03:05:12.948232image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T03:05:15.992016image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
구 분개최년도개최기간행사장소행사내용건수참여단체단체
구 분1.0001.0001.0001.0000.8110.0000.7920.228
개최년도1.0001.0001.0000.9680.4190.5200.4190.085
개최기간1.0001.0001.0001.0000.8110.0000.7920.228
행사장소1.0000.9681.0001.0000.7080.0000.6630.667
행사내용0.8110.4190.8110.7081.0000.5060.9850.540
건수0.0000.5200.0000.0000.5061.0000.4810.567
참여단체0.7920.4190.7920.6630.9850.4811.0000.539
단체0.2280.0850.2280.6670.5400.5670.5391.000
2023-12-13T03:05:16.131130image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
참여단체개최기간행사내용행사장소구 분
참여단체1.0000.4220.9610.3790.422
개최기간0.4221.0000.4650.9061.000
행사내용0.9610.4651.0000.4390.465
행사장소0.3790.9060.4391.0000.906
구 분0.4221.0000.4650.9061.000
2023-12-13T03:05:16.260238image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
개최년도건수단체구 분개최기간행사장소행사내용참여단체
개최년도1.000-0.489-0.3930.9060.9060.6980.2470.205
건수-0.4891.0000.3820.0000.0000.0000.2670.238
단체-0.3930.3821.0000.0000.0000.2520.2920.278
구 분0.9060.0000.0001.0001.0000.9060.4650.422
개최기간0.9060.0000.0001.0001.0000.9060.4650.422
행사장소0.6980.0000.2520.9060.9061.0000.4390.379
행사내용0.2470.2670.2920.4650.4650.4391.0000.961
참여단체0.2050.2380.2780.4220.4220.3790.9611.000

Missing values

2023-12-13T03:05:13.370728image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T03:05:13.505272image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

구 분개최년도개최기간행사장소행사내용건수참여단체단체
0제1회201011-24~11-301-2호선 41개역공연73공공단체_협회27
1제1회201011-24~11-301-2호선 41개역전시32동호회_ 공연단23
2제1회201011-24~11-301-2호선 41개역체험11학교_ 종교단체33
3제1회201011-24~11-301-2호선 41개역나눔(건강검진)24병원_기업_개인 등15
4제2회201111-16~11-221-2호선 41개역공연97공공단체_ 협회27
5제2회201111-16~11-221-2호선 41개역전시37동호회_ 공연단54
6제2회201111-16~11-221-2호선 41개역체험44학교_ 종교단체43
7제2회201111-16~11-221-2호선 41개역나눔(건강검진)30병원_기업_개인 등27
8제3회201210-11~10-171-2호선 41개역공연114공공단체_ 협회31
9제3회201210-11~10-171-2호선 41개역전시56동호회_ 공연단70
구 분개최년도개최기간행사장소행사내용건수참여단체단체
39제11회201910-29~10-311-2-3호선 23개역공연68공공단체_ 협회15
40제11회201910-29~10-311-2-3호선 23개역전시8동호회_ 공연단72
41제11회201910-29~10-311-2-3호선 23개역체험12학교_ 종교단체12
42제11회201910-29~10-311-2-3호선 23개역나눔(건강검진)9병원_기업_개인 등11
43제12회202011-02~11-09공사 유튜브 채널온라인공연10지역예술단체10
44제13회202102-22~03-021-2-3호선 8개역전시8공공단체_ 협회8
45제14회202111-05~11-10공사 유튜브 채널온라인공연11지역예술단체11
46제15회202202-21~03-041-2-3호선 7개역전시7공공단체_ 협회7
47제16회202211.10~11.161-2-3호선 주요역사전시, 온라인공연52공연단체_협회 등52
48제17회202302.15~02.281-2-3호선 주요역사전지, 체험12공공기관_동호회 등8