Overview

Dataset statistics

Number of variables7
Number of observations10000
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory634.8 KiB
Average record size in memory65.0 B

Variable types

DateTime1
Categorical3
Text2
Numeric1

Dataset

Description방송 시간대별 벤처기업 광고수 데이터입니다. 방송일, 방송요일, 시간대 및 업종별 광고 수를 확인하실 수 있습니다.
Author한국방송광고진흥공사
URLhttps://www.data.go.kr/data/15111057/fileData.do

Reproduction

Analysis started2024-03-14 08:47:16.479128
Analysis finished2024-03-14 08:47:18.010878
Duration1.53 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct1293
Distinct (%)12.9%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
Minimum2012-02-05 00:00:00
Maximum2015-11-11 00:00:00
2024-03-14T17:47:18.156058image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-14T17:47:18.603997image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

방송요일
Categorical

Distinct7
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
1519 
1494 
1466 
1443 
1377 
Other values (2)
2701 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row
2nd row
3rd row
4th row
5th row

Common Values

ValueCountFrequency (%)
1519
15.2%
1494
14.9%
1466
14.7%
1443
14.4%
1377
13.8%
1370
13.7%
1331
13.3%

Length

2024-03-14T17:47:19.062492image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-14T17:47:19.394849image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1519
15.2%
1494
14.9%
1466
14.7%
1443
14.4%
1377
13.8%
1370
13.7%
1331
13.3%

시간대
Categorical

Distinct6
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
16시~19시
2366 
08시~11시
2288 
20시~23시
1939 
04시~07시
1592 
12시~15시
1445 

Length

Max length7
Median length7
Mean length7
Min length7

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row08시~11시
2nd row08시~11시
3rd row16시~19시
4th row08시~11시
5th row04시~07시

Common Values

ValueCountFrequency (%)
16시~19시 2366
23.7%
08시~11시 2288
22.9%
20시~23시 1939
19.4%
04시~07시 1592
15.9%
12시~15시 1445
14.4%
00시~03시 370
 
3.7%

Length

2024-03-14T17:47:19.815320image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-14T17:47:20.140511image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
16시~19시 2366
23.7%
08시~11시 2288
22.9%
20시~23시 1939
19.4%
04시~07시 1592
15.9%
12시~15시 1445
14.4%
00시~03시 370
 
3.7%

대업종
Categorical

Distinct20
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
가정용품
1442 
제약및의료
1029 
서비스
875 
산업기기
854 
식품
828 
Other values (15)
4972 

Length

Max length9
Median length7
Mean length4.7962
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row산업기기
2nd row가정용품
3rd row가정용품
4th row유통
5th row가정용품

Common Values

ValueCountFrequency (%)
가정용품 1442
14.4%
제약및의료 1029
10.3%
서비스 875
 
8.8%
산업기기 854
 
8.5%
식품 828
 
8.3%
컴퓨터및정보통신 711
 
7.1%
건설,건재및부동산 505
 
5.1%
수송기기 492
 
4.9%
가정용전기전자 468
 
4.7%
출판 444
 
4.4%
Other values (10) 2352
23.5%

Length

2024-03-14T17:47:20.572275image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
가정용품 1442
14.4%
제약및의료 1029
10.3%
서비스 875
 
8.8%
산업기기 854
 
8.5%
식품 828
 
8.3%
컴퓨터및정보통신 711
 
7.1%
건설,건재및부동산 505
 
5.1%
수송기기 492
 
4.9%
가정용전기전자 468
 
4.7%
출판 444
 
4.4%
Other values (10) 2352
23.5%
Distinct90
Distinct (%)0.9%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2024-03-14T17:47:21.632065image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length10
Median length8
Mean length5.4946
Min length2

Characters and Unicode

Total characters54946
Distinct characters141
Distinct categories4 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row상업및공업용기기
2nd row난방기기
3rd row생활잡화및기기
4th row소형,소매유통
5th row가구류
ValueCountFrequency (%)
가정용인테리어 478
 
4.7%
건재 442
 
4.3%
수송기기부품및용품 422
 
4.1%
농산품 411
 
4.0%
제약및의료기타 378
 
3.7%
소형,소매유통 361
 
3.5%
의료용품 345
 
3.4%
출판기타 314
 
3.1%
건강식품 311
 
3.0%
산업기기기타 276
 
2.7%
Other values (82) 6462
63.4%
2024-03-14T17:47:22.844769image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
7494
 
13.6%
3032
 
5.5%
2561
 
4.7%
2357
 
4.3%
2099
 
3.8%
1089
 
2.0%
998
 
1.8%
989
 
1.8%
942
 
1.7%
858
 
1.6%
Other values (131) 32527
59.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 54147
98.5%
Other Punctuation 447
 
0.8%
Space Separator 200
 
0.4%
Uppercase Letter 152
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
7494
 
13.8%
3032
 
5.6%
2561
 
4.7%
2357
 
4.4%
2099
 
3.9%
1089
 
2.0%
998
 
1.8%
989
 
1.8%
942
 
1.7%
858
 
1.6%
Other values (126) 31728
58.6%
Other Punctuation
ValueCountFrequency (%)
, 371
83.0%
/ 76
 
17.0%
Uppercase Letter
ValueCountFrequency (%)
W 76
50.0%
S 76
50.0%
Space Separator
ValueCountFrequency (%)
200
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 54147
98.5%
Common 647
 
1.2%
Latin 152
 
0.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
7494
 
13.8%
3032
 
5.6%
2561
 
4.7%
2357
 
4.4%
2099
 
3.9%
1089
 
2.0%
998
 
1.8%
989
 
1.8%
942
 
1.7%
858
 
1.6%
Other values (126) 31728
58.6%
Common
ValueCountFrequency (%)
, 371
57.3%
200
30.9%
/ 76
 
11.7%
Latin
ValueCountFrequency (%)
W 76
50.0%
S 76
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 54147
98.5%
ASCII 799
 
1.5%

Most frequent character per block

Hangul
ValueCountFrequency (%)
7494
 
13.8%
3032
 
5.6%
2561
 
4.7%
2357
 
4.4%
2099
 
3.9%
1089
 
2.0%
998
 
1.8%
989
 
1.8%
942
 
1.7%
858
 
1.6%
Other values (126) 31728
58.6%
ASCII
ValueCountFrequency (%)
, 371
46.4%
200
25.0%
/ 76
 
9.5%
W 76
 
9.5%
S 76
 
9.5%
Distinct145
Distinct (%)1.5%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2024-03-14T17:47:23.556648image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length10
Mean length6.0053
Min length2

Characters and Unicode

Total characters60053
Distinct characters199
Distinct categories4 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2 ?
Unique (%)< 0.1%

Sample

1st row요식업소용기기
2nd row온냉수기
3rd row생활잡화및기기제품종합
4th row가구대리점
5th row사무용가구
ValueCountFrequency (%)
수송기기용품 422
 
4.2%
가구대리점 361
 
3.6%
출판기타 314
 
3.1%
제약및의료기업pr 309
 
3.1%
침구류및솜 278
 
2.8%
산업기기기타 273
 
2.7%
창호 264
 
2.6%
컴퓨터통합솔루션 239
 
2.4%
서비스기타 226
 
2.2%
생활잡화및기기제품종합 218
 
2.2%
Other values (137) 7175
71.2%
2024-03-14T17:47:24.758844image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
8298
 
13.8%
3097
 
5.2%
2985
 
5.0%
2115
 
3.5%
1755
 
2.9%
1627
 
2.7%
1188
 
2.0%
1082
 
1.8%
1056
 
1.8%
1056
 
1.8%
Other values (189) 35794
59.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 58962
98.2%
Uppercase Letter 936
 
1.6%
Space Separator 79
 
0.1%
Other Punctuation 76
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
8298
 
14.1%
3097
 
5.3%
2985
 
5.1%
2115
 
3.6%
1755
 
3.0%
1627
 
2.8%
1188
 
2.0%
1082
 
1.8%
1056
 
1.8%
1056
 
1.8%
Other values (182) 34703
58.9%
Uppercase Letter
ValueCountFrequency (%)
P 392
41.9%
R 387
41.3%
W 76
 
8.1%
S 76
 
8.1%
C 5
 
0.5%
Space Separator
ValueCountFrequency (%)
79
100.0%
Other Punctuation
ValueCountFrequency (%)
/ 76
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 58962
98.2%
Latin 936
 
1.6%
Common 155
 
0.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
8298
 
14.1%
3097
 
5.3%
2985
 
5.1%
2115
 
3.6%
1755
 
3.0%
1627
 
2.8%
1188
 
2.0%
1082
 
1.8%
1056
 
1.8%
1056
 
1.8%
Other values (182) 34703
58.9%
Latin
ValueCountFrequency (%)
P 392
41.9%
R 387
41.3%
W 76
 
8.1%
S 76
 
8.1%
C 5
 
0.5%
Common
ValueCountFrequency (%)
79
51.0%
/ 76
49.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 58962
98.2%
ASCII 1091
 
1.8%

Most frequent character per block

Hangul
ValueCountFrequency (%)
8298
 
14.1%
3097
 
5.3%
2985
 
5.1%
2115
 
3.6%
1755
 
3.0%
1627
 
2.8%
1188
 
2.0%
1082
 
1.8%
1056
 
1.8%
1056
 
1.8%
Other values (182) 34703
58.9%
ASCII
ValueCountFrequency (%)
P 392
35.9%
R 387
35.5%
79
 
7.2%
W 76
 
7.0%
/ 76
 
7.0%
S 76
 
7.0%
C 5
 
0.5%

광고 수
Real number (ℝ)

Distinct13
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1.6507
Minimum1
Maximum13
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-03-14T17:47:25.115043image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q11
median1
Q32
95-th percentile4
Maximum13
Range12
Interquartile range (IQR)1

Descriptive statistics

Standard deviation1.1842423
Coefficient of variation (CV)0.71741823
Kurtosis11.443955
Mean1.6507
Median Absolute Deviation (MAD)0
Skewness2.8364303
Sum16507
Variance1.4024298
MonotonicityNot monotonic
2024-03-14T17:47:25.481868image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=13)
ValueCountFrequency (%)
1 6426
64.3%
2 2124
 
21.2%
3 680
 
6.8%
4 423
 
4.2%
5 162
 
1.6%
6 97
 
1.0%
7 51
 
0.5%
8 11
 
0.1%
9 9
 
0.1%
10 9
 
0.1%
Other values (3) 8
 
0.1%
ValueCountFrequency (%)
1 6426
64.3%
2 2124
 
21.2%
3 680
 
6.8%
4 423
 
4.2%
5 162
 
1.6%
6 97
 
1.0%
7 51
 
0.5%
8 11
 
0.1%
9 9
 
0.1%
10 9
 
0.1%
ValueCountFrequency (%)
13 2
 
< 0.1%
12 1
 
< 0.1%
11 5
 
0.1%
10 9
 
0.1%
9 9
 
0.1%
8 11
 
0.1%
7 51
 
0.5%
6 97
 
1.0%
5 162
 
1.6%
4 423
4.2%

Interactions

2024-03-14T17:47:17.203567image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-03-14T17:47:25.721931image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
방송요일시간대대업종중업종광고 수
방송요일1.0000.0280.0450.0990.026
시간대0.0281.0000.3320.6190.163
대업종0.0450.3321.0001.0000.382
중업종0.0990.6191.0001.0000.444
광고 수0.0260.1630.3820.4441.000
2024-03-14T17:47:25.987913image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
시간대대업종방송요일
시간대1.0000.1600.017
대업종0.1601.0000.020
방송요일0.0170.0201.000
2024-03-14T17:47:26.236134image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
광고 수방송요일시간대대업종
광고 수1.0000.0120.0860.130
방송요일0.0121.0000.0170.020
시간대0.0860.0171.0000.160
대업종0.1300.0200.1601.000

Missing values

2024-03-14T17:47:17.558456image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-14T17:47:17.924316image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

방송일방송요일시간대대업종중업종소업종광고 수
172962013-07-0108시~11시산업기기상업및공업용기기요식업소용기기1
585442014-10-1008시~11시가정용품난방기기온냉수기1
64082013-01-0216시~19시가정용품생활잡화및기기생활잡화및기기제품종합1
321372014-01-0908시~11시유통소형,소매유통가구대리점1
867582015-06-1704시~07시가정용품가구류사무용가구1
430062014-05-2016시~19시가정용품가구류가구류기타2
144832013-05-2720시~23시가정용품생활잡화및기기생활잡화및기기제품종합2
708172015-01-2216시~19시가정용품주방용품조리용구3
815592015-04-3004시~07시가정용품가구류사무용가구1
92652013-03-0508시~11시패션신발류신발류기타1
방송일방송요일시간대대업종중업종소업종광고 수
30892012-10-0220시~23시컴퓨터및정보통신컴퓨터저장장치컴퓨터저장장치제품종합2
465262014-06-2604시~07시제약및의료의료용품간이치료용품2
221532013-09-0320시~23시서비스음식및숙박대중음식점2
146892013-05-3008시~11시식품건강식품건강식품기타7
181852013-07-1116시~19시가정용전기전자음향기기음향기기기타1
522062014-08-1404시~07시건설,건재및부동산건재위생설비1
563452014-09-2016시~19시가정용품가정용인테리어침구류및솜4
249152013-10-1120시~23시서비스음식및숙박대중음식점2
360642014-02-2516시~19시컴퓨터및정보통신컴퓨터및정보통신기타컴퓨터및정보통신기타1
886862015-07-0704시~07시기초재농축수산기초재종묘1