Overview

Dataset statistics

Number of variables7
Number of observations60
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory3.5 KiB
Average record size in memory60.2 B

Variable types

Numeric2
Categorical5

Alerts

ORIGIN_CL_NM has constant value ""Constant
SRCHWRD_NM is highly overall correlated with ORIGIN_ID and 1 other fieldsHigh correlation
REPRSNT_KWRD_NM is highly overall correlated with ORIGIN_ID and 1 other fieldsHigh correlation
ORIGIN_ID is highly overall correlated with REPRSNT_KWRD_NM and 1 other fieldsHigh correlation
SEQ_NO has unique valuesUnique
SCCNT_VALUE has unique valuesUnique

Reproduction

Analysis started2023-12-10 09:59:42.628801
Analysis finished2023-12-10 09:59:44.750898
Duration2.12 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

SEQ_NO
Real number (ℝ)

UNIQUE 

Distinct60
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean131540.25
Minimum122563
Maximum139381
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size672.0 B
2023-12-10T18:59:44.943420image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum122563
5-th percentile122900.35
Q1125165.75
median131156
Q3138411
95-th percentile139375.2
Maximum139381
Range16818
Interquartile range (IQR)13245.25

Descriptive statistics

Standard deviation6934.4625
Coefficient of variation (CV)0.052717419
Kurtosis-1.9467811
Mean131540.25
Median Absolute Deviation (MAD)6935
Skewness-0.020426443
Sum7892415
Variance48086770
MonotonicityNot monotonic
2023-12-10T18:59:45.232119image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
125233 1
 
1.7%
138282 1
 
1.7%
136277 1
 
1.7%
138615 1
 
1.7%
139379 1
 
1.7%
138341 1
 
1.7%
138969 1
 
1.7%
139340 1
 
1.7%
139373 1
 
1.7%
138342 1
 
1.7%
Other values (50) 50
83.3%
ValueCountFrequency (%)
122563 1
1.7%
122564 1
1.7%
122565 1
1.7%
122918 1
1.7%
122919 1
1.7%
122920 1
1.7%
123341 1
1.7%
123342 1
1.7%
123343 1
1.7%
124904 1
1.7%
ValueCountFrequency (%)
139381 1
1.7%
139380 1
1.7%
139379 1
1.7%
139375 1
1.7%
139374 1
1.7%
139373 1
1.7%
139342 1
1.7%
139341 1
1.7%
139340 1
1.7%
138971 1
1.7%

SCCNT_YM
Categorical

Distinct6
Distinct (%)10.0%
Missing0
Missing (%)0.0%
Memory size612.0 B
2020-07
10 
2020-08
10 
2020-09
10 
2020-10
10 
2020-11
10 

Length

Max length7
Median length7
Mean length7
Min length7

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2020-07
2nd row2020-07
3rd row2020-07
4th row2020-07
5th row2020-07

Common Values

ValueCountFrequency (%)
2020-07 10
16.7%
2020-08 10
16.7%
2020-09 10
16.7%
2020-10 10
16.7%
2020-11 10
16.7%
2020-12 10
16.7%

Length

2023-12-10T18:59:45.540802image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T18:59:45.742981image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2020-07 10
16.7%
2020-08 10
16.7%
2020-09 10
16.7%
2020-10 10
16.7%
2020-11 10
16.7%
2020-12 10
16.7%

ORIGIN_ID
Categorical

HIGH CORRELATION 

Distinct10
Distinct (%)16.7%
Missing0
Missing (%)0.0%
Memory size612.0 B
FC001840
FC001247
FC001865
FC001900
FC001667
Other values (5)
30 

Length

Max length8
Median length8
Mean length8
Min length8

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowFC001840
2nd rowFC001247
3rd rowFC001865
4th rowFC001900
5th rowFC001667

Common Values

ValueCountFrequency (%)
FC001840 6
10.0%
FC001247 6
10.0%
FC001865 6
10.0%
FC001900 6
10.0%
FC001667 6
10.0%
FC000137 6
10.0%
FC001649 6
10.0%
FC002067 6
10.0%
FC001950 6
10.0%
FC000001 6
10.0%

Length

2023-12-10T18:59:45.975244image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T18:59:46.194173image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
fc001840 6
10.0%
fc001247 6
10.0%
fc001865 6
10.0%
fc001900 6
10.0%
fc001667 6
10.0%
fc000137 6
10.0%
fc001649 6
10.0%
fc002067 6
10.0%
fc001950 6
10.0%
fc000001 6
10.0%

REPRSNT_KWRD_NM
Categorical

HIGH CORRELATION 

Distinct10
Distinct (%)16.7%
Missing0
Missing (%)0.0%
Memory size612.0 B
인천대학교
올림픽공원
청주대학교
한성대학교
한밭대학교
Other values (5)
30 

Length

Max length10
Median length5
Mean length5.6
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row인천대학교
2nd row올림픽공원
3rd row청주대학교
4th row한성대학교
5th row한밭대학교

Common Values

ValueCountFrequency (%)
인천대학교 6
10.0%
올림픽공원 6
10.0%
청주대학교 6
10.0%
한성대학교 6
10.0%
한밭대학교 6
10.0%
한국예술종합학교 6
10.0%
국기원 6
10.0%
씨마크호텔 [강릉] 6
10.0%
대진대학교 6
10.0%
예술의전당 6
10.0%

Length

2023-12-10T18:59:46.560333image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T18:59:46.837447image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
인천대학교 6
9.1%
올림픽공원 6
9.1%
청주대학교 6
9.1%
한성대학교 6
9.1%
한밭대학교 6
9.1%
한국예술종합학교 6
9.1%
국기원 6
9.1%
씨마크호텔 6
9.1%
강릉 6
9.1%
대진대학교 6
9.1%

SRCHWRD_NM
Categorical

HIGH CORRELATION 

Distinct10
Distinct (%)16.7%
Missing0
Missing (%)0.0%
Memory size612.0 B
인천대학교
올림픽공원
청주대학교
한성대학교
한밭대학교
Other values (5)
30 

Length

Max length8
Median length5
Mean length5.3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row인천대학교
2nd row올림픽공원
3rd row청주대학교
4th row한성대학교
5th row한밭대학교

Common Values

ValueCountFrequency (%)
인천대학교 6
10.0%
올림픽공원 6
10.0%
청주대학교 6
10.0%
한성대학교 6
10.0%
한밭대학교 6
10.0%
한국예술종합학교 6
10.0%
국기원 6
10.0%
강릉씨마크호텔 6
10.0%
대진대학교 6
10.0%
예술의전당 6
10.0%

Length

2023-12-10T18:59:47.156295image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T18:59:47.476183image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
인천대학교 6
10.0%
올림픽공원 6
10.0%
청주대학교 6
10.0%
한성대학교 6
10.0%
한밭대학교 6
10.0%
한국예술종합학교 6
10.0%
국기원 6
10.0%
강릉씨마크호텔 6
10.0%
대진대학교 6
10.0%
예술의전당 6
10.0%

SCCNT_VALUE
Real number (ℝ)

UNIQUE 

Distinct60
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean90999.167
Minimum27100
Maximum192900
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size672.0 B
2023-12-10T18:59:47.930157image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum27100
5-th percentile45543.5
Q168852.5
median80550
Q3107350
95-th percentile165455
Maximum192900
Range165800
Interquartile range (IQR)38497.5

Descriptive statistics

Standard deviation36046.207
Coefficient of variation (CV)0.39611579
Kurtosis0.72906303
Mean90999.167
Median Absolute Deviation (MAD)19550
Skewness0.95275243
Sum5459950
Variance1.2993291 × 109
MonotonicityNot monotonic
2023-12-10T18:59:48.239928image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
77000 1
 
1.7%
70100 1
 
1.7%
86400 1
 
1.7%
70700 1
 
1.7%
66200 1
 
1.7%
134100 1
 
1.7%
72500 1
 
1.7%
117600 1
 
1.7%
42000 1
 
1.7%
59250 1
 
1.7%
Other values (50) 50
83.3%
ValueCountFrequency (%)
27100 1
1.7%
35700 1
1.7%
42000 1
1.7%
45730 1
1.7%
47300 1
1.7%
47700 1
1.7%
56700 1
1.7%
59250 1
1.7%
59600 1
1.7%
60800 1
1.7%
ValueCountFrequency (%)
192900 1
1.7%
184100 1
1.7%
174100 1
1.7%
165000 1
1.7%
147800 1
1.7%
147300 1
1.7%
141800 1
1.7%
134100 1
1.7%
133700 1
1.7%
129100 1
1.7%

ORIGIN_CL_NM
Categorical

CONSTANT 

Distinct1
Distinct (%)1.7%
Missing0
Missing (%)0.0%
Memory size612.0 B
KOPIS_공연시설
60 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowKOPIS_공연시설
2nd rowKOPIS_공연시설
3rd rowKOPIS_공연시설
4th rowKOPIS_공연시설
5th rowKOPIS_공연시설

Common Values

ValueCountFrequency (%)
KOPIS_공연시설 60
100.0%

Length

2023-12-10T18:59:48.516629image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T18:59:48.689037image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
kopis_공연시설 60
100.0%

Interactions

2023-12-10T18:59:43.628958image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T18:59:43.240891image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T18:59:43.824051image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T18:59:43.440510image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-10T18:59:48.819812image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
SEQ_NOSCCNT_YMORIGIN_IDREPRSNT_KWRD_NMSRCHWRD_NMSCCNT_VALUE
SEQ_NO1.0000.5660.8450.8450.8450.000
SCCNT_YM0.5661.0000.0000.0000.0000.000
ORIGIN_ID0.8450.0001.0001.0001.0000.000
REPRSNT_KWRD_NM0.8450.0001.0001.0001.0000.000
SRCHWRD_NM0.8450.0001.0001.0001.0000.000
SCCNT_VALUE0.0000.0000.0000.0000.0001.000
2023-12-10T18:59:48.995473image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
SRCHWRD_NMREPRSNT_KWRD_NMSCCNT_YMORIGIN_ID
SRCHWRD_NM1.0001.0000.0001.000
REPRSNT_KWRD_NM1.0001.0000.0001.000
SCCNT_YM0.0000.0001.0000.000
ORIGIN_ID1.0001.0000.0001.000
2023-12-10T18:59:49.186379image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
SEQ_NOSCCNT_VALUESCCNT_YMORIGIN_IDREPRSNT_KWRD_NMSRCHWRD_NM
SEQ_NO1.0000.0730.4210.4890.4890.489
SCCNT_VALUE0.0731.0000.0000.0000.0000.000
SCCNT_YM0.4210.0001.0000.0000.0000.000
ORIGIN_ID0.4890.0000.0001.0001.0001.000
REPRSNT_KWRD_NM0.4890.0000.0001.0001.0001.000
SRCHWRD_NM0.4890.0000.0001.0001.0001.000

Missing values

2023-12-10T18:59:44.434003image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-10T18:59:44.662299image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

SEQ_NOSCCNT_YMORIGIN_IDREPRSNT_KWRD_NMSRCHWRD_NMSCCNT_VALUEORIGIN_CL_NM
01252332020-07FC001840인천대학교인천대학교77000KOPIS_공연시설
11249622020-07FC001247올림픽공원올림픽공원45730KOPIS_공연시설
21256282020-07FC001865청주대학교청주대학교47300KOPIS_공연시설
31260332020-07FC001900한성대학교한성대학교59600KOPIS_공연시설
41260242020-07FC001667한밭대학교한밭대학교65600KOPIS_공연시설
51260032020-07FC000137한국예술종합학교한국예술종합학교56700KOPIS_공연시설
61229182020-07FC001649국기원국기원75100KOPIS_공연시설
71225632020-07FC002067씨마크호텔 [강릉]강릉씨마크호텔192900KOPIS_공연시설
81233412020-07FC001950대진대학교대진대학교60800KOPIS_공연시설
91249042020-07FC000001예술의전당예술의전당100800KOPIS_공연시설
SEQ_NOSCCNT_YMORIGIN_IDREPRSNT_KWRD_NMSRCHWRD_NMSCCNT_VALUEORIGIN_CL_NM
501382842020-12FC000001예술의전당예술의전당47700KOPIS_공연시설
511362792020-12FC001649국기원국기원76500KOPIS_공연시설
521367042020-12FC001950대진대학교대진대학교110200KOPIS_공연시설
531379002020-12FC002067씨마크호텔 [강릉]강릉씨마크호텔68380KOPIS_공연시설
541393812020-12FC001900한성대학교한성대학교92800KOPIS_공연시설
551383432020-12FC001247올림픽공원올림픽공원27100KOPIS_공연시설
561386172020-12FC001840인천대학교인천대학교174100KOPIS_공연시설
571389712020-12FC001865청주대학교청주대학교99900KOPIS_공연시설
581393422020-12FC000137한국예술종합학교한국예술종합학교76200KOPIS_공연시설
591393752020-12FC001667한밭대학교한밭대학교147800KOPIS_공연시설