Overview

Dataset statistics

Number of variables9
Number of observations999
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory75.2 KiB
Average record size in memory77.1 B

Variable types

Numeric5
Text3
Categorical1

Dataset

Description첨부파일에 대한 방송본부별 구분 코드와 게시판 구분 코드, 게시글 번호, 시퀀스, 파일명, 저장경롱, 크기, 타입에 대한 데이터
Author도로교통공단
URLhttps://www.data.go.kr/data/15089359/fileData.do

Alerts

방송본부 구분 코드 is highly overall correlated with 게시판구분코드 and 2 other fieldsHigh correlation
게시판구분코드 is highly overall correlated with 방송본부 구분 코드 and 2 other fieldsHigh correlation
게시글번호 is highly overall correlated with 방송본부 구분 코드 and 4 other fieldsHigh correlation
시퀀스 is highly overall correlated with 방송본부 구분 코드 and 3 other fieldsHigh correlation
크기 is highly overall correlated with 게시글번호 and 1 other fieldsHigh correlation
타입 is highly overall correlated with 게시글번호High correlation
시퀀스 has unique valuesUnique

Reproduction

Analysis started2023-12-12 12:37:47.521168
Analysis finished2023-12-12 12:37:52.053224
Duration4.53 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

방송본부 구분 코드
Real number (ℝ)

HIGH CORRELATION 

Distinct32
Distinct (%)3.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean281.22823
Minimum146
Maximum443
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size8.9 KiB
2023-12-12T21:37:52.115241image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum146
5-th percentile155
Q1172
median224
Q3416
95-th percentile416
Maximum443
Range297
Interquartile range (IQR)244

Descriptive statistics

Standard deviation115.33289
Coefficient of variation (CV)0.41010424
Kurtosis-1.8238318
Mean281.22823
Median Absolute Deviation (MAD)78
Skewness0.17656133
Sum280947
Variance13301.675
MonotonicityNot monotonic
2023-12-12T21:37:52.226224image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=32)
ValueCountFrequency (%)
416 319
31.9%
172 135
13.5%
155 59
 
5.9%
324 54
 
5.4%
163 47
 
4.7%
199 43
 
4.3%
396 42
 
4.2%
174 41
 
4.1%
146 36
 
3.6%
224 33
 
3.3%
Other values (22) 190
19.0%
ValueCountFrequency (%)
146 36
 
3.6%
155 59
5.9%
158 15
 
1.5%
159 5
 
0.5%
162 19
 
1.9%
163 47
 
4.7%
166 4
 
0.4%
169 7
 
0.7%
171 7
 
0.7%
172 135
13.5%
ValueCountFrequency (%)
443 14
 
1.4%
441 8
 
0.8%
416 319
31.9%
404 5
 
0.5%
399 1
 
0.1%
396 42
 
4.2%
388 2
 
0.2%
337 1
 
0.1%
333 2
 
0.2%
326 14
 
1.4%
Distinct60
Distinct (%)6.0%
Missing0
Missing (%)0.0%
Memory size7.9 KiB
2023-12-12T21:37:52.425818image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length11
Mean length9.6976977
Min length4

Characters and Unicode

Total characters9688
Distinct characters81
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique33 ?
Unique (%)3.3%

Sample

1st rowTBN 광주매거진
2nd rowTBN 부산매거진
3rd rowTBN 부산매거진
4th rowTBN 부산매거진
5th rowTBN 부산매거진
ValueCountFrequency (%)
tbn 343
16.6%
박철의 319
15.4%
방방곡곡 319
15.4%
차차차(광주 135
 
6.5%
달리는 107
 
5.2%
한밤의 59
 
2.9%
교차로 59
 
2.9%
차차차(경남 54
 
2.6%
출발 53
 
2.6%
라디오(부산 47
 
2.3%
Other values (63) 571
27.6%
2023-12-12T21:37:52.732687image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1067
 
11.0%
1028
 
10.6%
638
 
6.6%
638
 
6.6%
( 460
 
4.7%
) 460
 
4.7%
378
 
3.9%
T 347
 
3.6%
B 347
 
3.6%
N 347
 
3.6%
Other values (71) 3978
41.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 6479
66.9%
Space Separator 1067
 
11.0%
Uppercase Letter 1041
 
10.7%
Open Punctuation 460
 
4.7%
Close Punctuation 460
 
4.7%
Decimal Number 128
 
1.3%
Other Punctuation 53
 
0.5%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1028
15.9%
638
 
9.8%
638
 
9.8%
378
 
5.8%
338
 
5.2%
319
 
4.9%
319
 
4.9%
220
 
3.4%
172
 
2.7%
142
 
2.2%
Other values (54) 2287
35.3%
Decimal Number
ValueCountFrequency (%)
1 35
27.3%
0 30
23.4%
9 20
15.6%
7 11
 
8.6%
2 10
 
7.8%
4 8
 
6.2%
5 4
 
3.1%
6 4
 
3.1%
8 3
 
2.3%
3 3
 
2.3%
Uppercase Letter
ValueCountFrequency (%)
T 347
33.3%
B 347
33.3%
N 347
33.3%
Space Separator
ValueCountFrequency (%)
1067
100.0%
Open Punctuation
ValueCountFrequency (%)
( 460
100.0%
Close Punctuation
ValueCountFrequency (%)
) 460
100.0%
Other Punctuation
ValueCountFrequency (%)
! 53
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 6479
66.9%
Common 2168
 
22.4%
Latin 1041
 
10.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1028
15.9%
638
 
9.8%
638
 
9.8%
378
 
5.8%
338
 
5.2%
319
 
4.9%
319
 
4.9%
220
 
3.4%
172
 
2.7%
142
 
2.2%
Other values (54) 2287
35.3%
Common
ValueCountFrequency (%)
1067
49.2%
( 460
21.2%
) 460
21.2%
! 53
 
2.4%
1 35
 
1.6%
0 30
 
1.4%
9 20
 
0.9%
7 11
 
0.5%
2 10
 
0.5%
4 8
 
0.4%
Other values (4) 14
 
0.6%
Latin
ValueCountFrequency (%)
T 347
33.3%
B 347
33.3%
N 347
33.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 6479
66.9%
ASCII 3209
33.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1067
33.3%
( 460
14.3%
) 460
14.3%
T 347
 
10.8%
B 347
 
10.8%
N 347
 
10.8%
! 53
 
1.7%
1 35
 
1.1%
0 30
 
0.9%
9 20
 
0.6%
Other values (7) 43
 
1.3%
Hangul
ValueCountFrequency (%)
1028
15.9%
638
 
9.8%
638
 
9.8%
378
 
5.8%
338
 
5.2%
319
 
4.9%
319
 
4.9%
220
 
3.4%
172
 
2.7%
142
 
2.2%
Other values (54) 2287
35.3%

게시판구분코드
Real number (ℝ)

HIGH CORRELATION 

Distinct56
Distinct (%)5.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3676.9469
Minimum1888
Maximum7099
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size8.9 KiB
2023-12-12T21:37:52.880907image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1888
5-th percentile2005
Q12226
median2902
Q35402
95-th percentile5402
Maximum7099
Range5211
Interquartile range (IQR)3176

Descriptive statistics

Standard deviation1554.0919
Coefficient of variation (CV)0.42265824
Kurtosis-1.5435817
Mean3676.9469
Median Absolute Deviation (MAD)1014
Skewness0.2873381
Sum3673270
Variance2415201.8
MonotonicityNot monotonic
2023-12-12T21:37:53.017371image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
5402 243
24.3%
2226 103
 
10.3%
2005 59
 
5.9%
4203 46
 
4.6%
5400 45
 
4.5%
2577 43
 
4.3%
5140 42
 
4.2%
1888 36
 
3.6%
2902 31
 
3.1%
2228 30
 
3.0%
Other values (46) 321
32.1%
ValueCountFrequency (%)
1888 36
3.6%
2005 59
5.9%
2042 2
 
0.2%
2044 6
 
0.6%
2045 3
 
0.3%
2046 1
 
0.1%
2048 3
 
0.3%
2057 5
 
0.5%
2096 18
 
1.8%
2097 1
 
0.1%
ValueCountFrequency (%)
7099 13
 
1.3%
7097 1
 
0.1%
7067 8
 
0.8%
5402 243
24.3%
5400 45
 
4.5%
5398 14
 
1.4%
5396 17
 
1.7%
5244 5
 
0.5%
5177 1
 
0.1%
5140 42
 
4.2%

게시글번호
Real number (ℝ)

HIGH CORRELATION 

Distinct798
Distinct (%)79.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean331131.78
Minimum10473
Maximum491211
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size8.9 KiB
2023-12-12T21:37:53.139845image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum10473
5-th percentile151114.9
Q1202486.5
median425533
Q3434502.5
95-th percentile482874.3
Maximum491211
Range480738
Interquartile range (IQR)232016

Descriptive statistics

Standard deviation130132.7
Coefficient of variation (CV)0.39299369
Kurtosis-1.2779329
Mean331131.78
Median Absolute Deviation (MAD)60286
Skewness-0.41645022
Sum3.3080065 × 108
Variance1.693452 × 1010
MonotonicityNot monotonic
2023-12-12T21:37:53.261609image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
211101 5
 
0.5%
225020 5
 
0.5%
485169 5
 
0.5%
209706 5
 
0.5%
485168 4
 
0.4%
460679 4
 
0.4%
430322 4
 
0.4%
209979 4
 
0.4%
237894 4
 
0.4%
430168 4
 
0.4%
Other values (788) 955
95.6%
ValueCountFrequency (%)
10473 1
0.1%
10474 1
0.1%
10475 1
0.1%
10476 1
0.1%
10477 1
0.1%
10478 1
0.1%
10479 1
0.1%
10480 1
0.1%
10481 1
0.1%
10482 1
0.1%
ValueCountFrequency (%)
491211 1
0.1%
491176 1
0.1%
491116 1
0.1%
491055 1
0.1%
491008 1
0.1%
490656 1
0.1%
490535 1
0.1%
490366 1
0.1%
490245 1
0.1%
490028 1
0.1%

시퀀스
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct999
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean9722.6126
Minimum539
Maximum12236
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size8.9 KiB
2023-12-12T21:37:53.391473image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum539
5-th percentile5953.9
Q16498.5
median11608
Q311921
95-th percentile12173.1
Maximum12236
Range11697
Interquartile range (IQR)5422.5

Descriptive statistics

Standard deviation2788.5927
Coefficient of variation (CV)0.28681516
Kurtosis-0.043321104
Mean9722.6126
Median Absolute Deviation (MAD)548
Skewness-0.94019808
Sum9712890
Variance7776249.1
MonotonicityNot monotonic
2023-12-12T21:37:53.519851image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
11561 1
 
0.1%
11776 1
 
0.1%
11760 1
 
0.1%
11761 1
 
0.1%
11762 1
 
0.1%
11763 1
 
0.1%
11764 1
 
0.1%
11765 1
 
0.1%
11766 1
 
0.1%
11767 1
 
0.1%
Other values (989) 989
99.0%
ValueCountFrequency (%)
539 1
0.1%
540 1
0.1%
541 1
0.1%
542 1
0.1%
979 1
0.1%
980 1
0.1%
981 1
0.1%
982 1
0.1%
983 1
0.1%
984 1
0.1%
ValueCountFrequency (%)
12236 1
0.1%
12235 1
0.1%
12233 1
0.1%
12232 1
0.1%
12231 1
0.1%
12229 1
0.1%
12228 1
0.1%
12225 1
0.1%
12222 1
0.1%
12221 1
0.1%
Distinct873
Distinct (%)87.4%
Missing0
Missing (%)0.0%
Memory size7.9 KiB
2023-12-12T21:37:53.761833image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length59
Median length42
Mean length16.887888
Min length5

Characters and Unicode

Total characters16871
Distinct characters549
Distinct categories11 ?
Distinct scripts4 ?
Distinct blocks6 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique825 ?
Unique (%)82.6%

Sample

1st row포스터 디자인.pdf
2nd row양선호 - 1집 내 사랑 지니 - 01 - 내사랑 지니.mp3
3rd rowCF_184hJ_NPM1_16_1.jpg
4th row25.jpg
5th row트부성_final_전단.jpg
ValueCountFrequency (%)
신용회복 27
 
1.6%
사진 25
 
1.4%
20190724152000_양식_개인정보활용동의서.hwp 24
 
1.4%
the 19
 
1.1%
양식_개인정보활용동의서.hwp 19
 
1.1%
수기 17
 
1.0%
가수 17
 
1.0%
2019 15
 
0.9%
14
 
0.8%
사진.jpg 12
 
0.7%
Other values (1248) 1540
89.1%
2023-12-12T21:37:54.220093image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 1230
 
7.3%
1 1077
 
6.4%
. 1035
 
6.1%
2 950
 
5.6%
p 801
 
4.7%
733
 
4.3%
g 556
 
3.3%
j 531
 
3.1%
_ 508
 
3.0%
5 441
 
2.6%
Other values (539) 9009
53.4%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 5469
32.4%
Other Letter 4530
26.9%
Lowercase Letter 3169
18.8%
Uppercase Letter 1103
 
6.5%
Other Punctuation 1089
 
6.5%
Space Separator 733
 
4.3%
Connector Punctuation 508
 
3.0%
Dash Punctuation 89
 
0.5%
Close Punctuation 86
 
0.5%
Open Punctuation 84
 
0.5%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
188
 
4.2%
137
 
3.0%
131
 
2.9%
128
 
2.8%
125
 
2.8%
118
 
2.6%
117
 
2.6%
112
 
2.5%
102
 
2.3%
101
 
2.2%
Other values (462) 3271
72.2%
Lowercase Letter
ValueCountFrequency (%)
p 801
25.3%
g 556
17.5%
j 531
16.8%
h 271
 
8.6%
w 245
 
7.7%
e 100
 
3.2%
a 85
 
2.7%
t 63
 
2.0%
o 60
 
1.9%
f 50
 
1.6%
Other values (16) 407
12.8%
Uppercase Letter
ValueCountFrequency (%)
P 228
20.7%
G 224
20.3%
J 145
13.1%
S 81
 
7.3%
I 60
 
5.4%
C 58
 
5.3%
N 52
 
4.7%
D 52
 
4.7%
M 37
 
3.4%
T 35
 
3.2%
Other values (14) 131
11.9%
Decimal Number
ValueCountFrequency (%)
0 1230
22.5%
1 1077
19.7%
2 950
17.4%
5 441
 
8.1%
3 358
 
6.5%
4 322
 
5.9%
9 313
 
5.7%
6 268
 
4.9%
7 258
 
4.7%
8 252
 
4.6%
Other Punctuation
ValueCountFrequency (%)
. 1035
95.0%
% 21
 
1.9%
, 16
 
1.5%
; 8
 
0.7%
! 8
 
0.7%
& 1
 
0.1%
Close Punctuation
ValueCountFrequency (%)
) 78
90.7%
] 7
 
8.1%
1
 
1.2%
Open Punctuation
ValueCountFrequency (%)
( 76
90.5%
[ 7
 
8.3%
1
 
1.2%
Math Symbol
ValueCountFrequency (%)
~ 8
72.7%
+ 3
 
27.3%
Space Separator
ValueCountFrequency (%)
733
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 508
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 89
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 8069
47.8%
Hangul 4526
26.8%
Latin 4272
25.3%
Han 4
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
188
 
4.2%
137
 
3.0%
131
 
2.9%
128
 
2.8%
125
 
2.8%
118
 
2.6%
117
 
2.6%
112
 
2.5%
102
 
2.3%
101
 
2.2%
Other values (458) 3267
72.2%
Latin
ValueCountFrequency (%)
p 801
18.8%
g 556
13.0%
j 531
12.4%
h 271
 
6.3%
w 245
 
5.7%
P 228
 
5.3%
G 224
 
5.2%
J 145
 
3.4%
e 100
 
2.3%
a 85
 
2.0%
Other values (40) 1086
25.4%
Common
ValueCountFrequency (%)
0 1230
15.2%
1 1077
13.3%
. 1035
12.8%
2 950
11.8%
733
9.1%
_ 508
6.3%
5 441
 
5.5%
3 358
 
4.4%
4 322
 
4.0%
9 313
 
3.9%
Other values (17) 1102
13.7%
Han
ValueCountFrequency (%)
1
25.0%
1
25.0%
1
25.0%
1
25.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 12339
73.1%
Hangul 4437
 
26.3%
Compat Jamo 89
 
0.5%
CJK 3
 
< 0.1%
None 2
 
< 0.1%
CJK Compat Ideographs 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 1230
 
10.0%
1 1077
 
8.7%
. 1035
 
8.4%
2 950
 
7.7%
p 801
 
6.5%
733
 
5.9%
g 556
 
4.5%
j 531
 
4.3%
_ 508
 
4.1%
5 441
 
3.6%
Other values (65) 4477
36.3%
Hangul
ValueCountFrequency (%)
188
 
4.2%
137
 
3.1%
131
 
3.0%
128
 
2.9%
125
 
2.8%
118
 
2.7%
117
 
2.6%
112
 
2.5%
102
 
2.3%
101
 
2.3%
Other values (435) 3178
71.6%
Compat Jamo
ValueCountFrequency (%)
19
21.3%
10
11.2%
6
 
6.7%
6
 
6.7%
6
 
6.7%
6
 
6.7%
5
 
5.6%
5
 
5.6%
4
 
4.5%
3
 
3.4%
Other values (13) 19
21.3%
None
ValueCountFrequency (%)
1
50.0%
1
50.0%
CJK
ValueCountFrequency (%)
1
33.3%
1
33.3%
1
33.3%
CJK Compat Ideographs
ValueCountFrequency (%)
1
100.0%
Distinct956
Distinct (%)95.7%
Missing0
Missing (%)0.0%
Memory size7.9 KiB
2023-12-12T21:37:54.516207image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length65
Median length45
Mean length43.025025
Min length17

Characters and Unicode

Total characters42982
Distinct characters281
Distinct categories11 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique944 ?
Unique (%)94.5%

Sample

1st row/file/broadcast/attach/293/15270598316290.pdf
2nd row/file/oldprogram/B_STORY/양선호 - 1집 내 사랑 지니 - 01 - 내사랑 지니.mp3
3rd row/file/oldprogram/B_STORY/CF_184hJ_NPM1_16_1.jpg
4th row/file/oldprogram/B_STORY/25.jpg
5th row/file/oldprogram/B_STORY/트부성_final_전단.jpg
ValueCountFrequency (%)
file/oldprogram/b_story/사진 24
 
2.0%
old/files/notice 16
 
1.4%
file/oldprogram/b_story/사진.jpg 12
 
1.0%
file/oldprogram/b_story/가수안병현.jpg 8
 
0.7%
8
 
0.7%
file/oldprogram/b_story/10년 8
 
0.7%
001.jpg 6
 
0.5%
file/oldprogram/b_story/2009-04-02 3
 
0.3%
06;57;03pm.jpg 3
 
0.3%
011.jpg 3
 
0.3%
Other values (1045) 1084
92.3%
2023-12-12T21:37:54.963082image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
/ 4720
 
11.0%
a 3210
 
7.5%
1 2466
 
5.7%
t 2254
 
5.2%
6 1717
 
4.0%
0 1614
 
3.8%
c 1504
 
3.5%
4 1354
 
3.2%
o 1269
 
3.0%
l 1265
 
2.9%
Other values (271) 21609
50.3%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 20217
47.0%
Decimal Number 13597
31.6%
Other Punctuation 5742
 
13.4%
Uppercase Letter 2183
 
5.1%
Other Letter 707
 
1.6%
Connector Punctuation 295
 
0.7%
Space Separator 177
 
0.4%
Dash Punctuation 26
 
0.1%
Open Punctuation 16
 
< 0.1%
Close Punctuation 16
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
52
 
7.4%
45
 
6.4%
27
 
3.8%
26
 
3.7%
16
 
2.3%
15
 
2.1%
14
 
2.0%
13
 
1.8%
12
 
1.7%
11
 
1.6%
Other values (204) 476
67.3%
Lowercase Letter
ValueCountFrequency (%)
a 3210
15.9%
t 2254
11.1%
c 1504
 
7.4%
o 1269
 
6.3%
l 1265
 
6.3%
r 1226
 
6.1%
f 1038
 
5.1%
i 1034
 
5.1%
p 1029
 
5.1%
e 1027
 
5.1%
Other values (14) 5361
26.5%
Uppercase Letter
ValueCountFrequency (%)
S 274
12.6%
B 250
11.5%
T 226
10.4%
Y 225
10.3%
R 225
10.3%
O 225
10.3%
G 190
8.7%
P 184
8.4%
J 127
5.8%
N 49
 
2.2%
Other values (10) 208
9.5%
Decimal Number
ValueCountFrequency (%)
1 2466
18.1%
6 1717
12.6%
0 1614
11.9%
4 1354
10.0%
5 1242
9.1%
3 1212
8.9%
2 1151
8.5%
7 994
7.3%
9 950
 
7.0%
8 897
 
6.6%
Other Punctuation
ValueCountFrequency (%)
/ 4720
82.2%
. 995
 
17.3%
% 19
 
0.3%
; 8
 
0.1%
Open Punctuation
ValueCountFrequency (%)
( 12
75.0%
[ 4
 
25.0%
Close Punctuation
ValueCountFrequency (%)
) 12
75.0%
] 4
 
25.0%
Math Symbol
ValueCountFrequency (%)
~ 5
83.3%
+ 1
 
16.7%
Connector Punctuation
ValueCountFrequency (%)
_ 295
100.0%
Space Separator
ValueCountFrequency (%)
177
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 26
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 22400
52.1%
Common 19875
46.2%
Hangul 707
 
1.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
52
 
7.4%
45
 
6.4%
27
 
3.8%
26
 
3.7%
16
 
2.3%
15
 
2.1%
14
 
2.0%
13
 
1.8%
12
 
1.7%
11
 
1.6%
Other values (204) 476
67.3%
Latin
ValueCountFrequency (%)
a 3210
14.3%
t 2254
 
10.1%
c 1504
 
6.7%
o 1269
 
5.7%
l 1265
 
5.6%
r 1226
 
5.5%
f 1038
 
4.6%
i 1034
 
4.6%
p 1029
 
4.6%
e 1027
 
4.6%
Other values (34) 7544
33.7%
Common
ValueCountFrequency (%)
/ 4720
23.7%
1 2466
12.4%
6 1717
 
8.6%
0 1614
 
8.1%
4 1354
 
6.8%
5 1242
 
6.2%
3 1212
 
6.1%
2 1151
 
5.8%
. 995
 
5.0%
7 994
 
5.0%
Other values (13) 2410
12.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 42275
98.4%
Hangul 706
 
1.6%
Compat Jamo 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
/ 4720
 
11.2%
a 3210
 
7.6%
1 2466
 
5.8%
t 2254
 
5.3%
6 1717
 
4.1%
0 1614
 
3.8%
c 1504
 
3.6%
4 1354
 
3.2%
o 1269
 
3.0%
l 1265
 
3.0%
Other values (57) 20902
49.4%
Hangul
ValueCountFrequency (%)
52
 
7.4%
45
 
6.4%
27
 
3.8%
26
 
3.7%
16
 
2.3%
15
 
2.1%
14
 
2.0%
13
 
1.8%
12
 
1.7%
11
 
1.6%
Other values (203) 475
67.3%
Compat Jamo
ValueCountFrequency (%)
1
100.0%

크기
Real number (ℝ)

HIGH CORRELATION 

Distinct788
Distinct (%)78.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean732972.28
Minimum5
Maximum30782026
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size8.9 KiB
2023-12-12T21:37:55.144723image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum5
5-th percentile96
Q12972
median76123
Q3507374.5
95-th percentile4312105.3
Maximum30782026
Range30782021
Interquartile range (IQR)504402.5

Descriptive statistics

Standard deviation1832587.8
Coefficient of variation (CV)2.5002144
Kurtosis78.509701
Mean732972.28
Median Absolute Deviation (MAD)75812
Skewness6.515575
Sum7.3223931 × 108
Variance3.3583781 × 1012
MonotonicityNot monotonic
2023-12-12T21:37:55.675505image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
17408 37
 
3.7%
32768 12
 
1.2%
16896 12
 
1.2%
41 10
 
1.0%
62995 9
 
0.9%
143563 9
 
0.9%
17920 8
 
0.8%
32256 7
 
0.7%
18432 7
 
0.7%
16384 6
 
0.6%
Other values (778) 882
88.3%
ValueCountFrequency (%)
5 1
 
0.1%
7 1
 
0.1%
13 1
 
0.1%
14 1
 
0.1%
15 1
 
0.1%
16 3
0.3%
17 1
 
0.1%
21 1
 
0.1%
28 1
 
0.1%
30 3
0.3%
ValueCountFrequency (%)
30782026 1
0.1%
13458145 1
0.1%
10296827 1
0.1%
9656832 1
0.1%
8832521 2
0.2%
8460202 1
0.1%
7995996 1
0.1%
7856945 1
0.1%
7665812 1
0.1%
7613861 1
0.1%

타입
Categorical

HIGH CORRELATION 

Distinct17
Distinct (%)1.7%
Missing0
Missing (%)0.0%
Memory size7.9 KiB
image/jpeg
301 
<NA>
259 
application/haansofthwp
201 
image/pjpeg
127 
image/png
49 
Other values (12)
62 

Length

Max length71
Median length28
Mean length11.846847
Min length4

Unique

Unique4 ?
Unique (%)0.4%

Sample

1st rowapplication/pdf
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
image/jpeg 301
30.1%
<NA> 259
25.9%
application/haansofthwp 201
20.1%
image/pjpeg 127
12.7%
image/png 49
 
4.9%
application/pdf 16
 
1.6%
application/octet-stream 15
 
1.5%
application/vnd.openxmlformats-officedocument.wordprocessingml.document 6
 
0.6%
audio/mpeg 6
 
0.6%
application/x-hwp 6
 
0.6%
Other values (7) 13
 
1.3%

Length

2023-12-12T21:37:55.862885image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
image/jpeg 301
30.1%
na 259
25.9%
application/haansofthwp 201
20.1%
image/pjpeg 127
12.7%
image/png 49
 
4.9%
application/pdf 16
 
1.6%
application/octet-stream 15
 
1.5%
application/x-hwp 6
 
0.6%
audio/mpeg 6
 
0.6%
application/vnd.openxmlformats-officedocument.wordprocessingml.document 6
 
0.6%
Other values (7) 13
 
1.3%

Interactions

2023-12-12T21:37:51.290913image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:37:48.646034image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:37:49.293757image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:37:49.945392image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:37:50.617533image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:37:51.392036image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:37:48.777901image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:37:49.435619image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:37:50.102840image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:37:50.755278image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:37:51.509077image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:37:48.921459image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:37:49.553558image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:37:50.222167image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:37:50.900528image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:37:51.624754image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:37:49.041301image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:37:49.670169image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:37:50.349472image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:37:51.032227image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:37:51.725815image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:37:49.152218image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:37:49.810447image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:37:50.491889image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:37:51.165217image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T21:37:55.959766image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
방송본부 구분 코드Unnamed: 1게시판구분코드게시글번호시퀀스크기타입
방송본부 구분 코드1.0001.0000.9460.7110.5950.3600.828
Unnamed: 11.0001.0001.0000.9090.8370.5950.829
게시판구분코드0.9461.0001.0000.7190.5720.4670.736
게시글번호0.7110.9090.7191.0000.9070.2150.778
시퀀스0.5950.8370.5720.9071.0000.1050.669
크기0.3600.5950.4670.2150.1051.0000.764
타입0.8280.8290.7360.7780.6690.7641.000
2023-12-12T21:37:56.081500image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
방송본부 구분 코드게시판구분코드게시글번호시퀀스크기타입
방송본부 구분 코드1.0000.9900.6530.6530.2510.445
게시판구분코드0.9901.0000.6320.6320.2270.464
게시글번호0.6530.6321.0001.0000.6590.514
시퀀스0.6530.6321.0001.0000.6600.416
크기0.2510.2270.6590.6601.0000.497
타입0.4450.4640.5140.4160.4971.000

Missing values

2023-12-12T21:37:51.855045image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T21:37:51.997866image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

방송본부 구분 코드Unnamed: 1게시판구분코드게시글번호시퀀스파일명저장경로크기타입
0293TBN 광주매거진379739508711561포스터 디자인.pdf/file/broadcast/attach/293/15270598316290.pdf6330421application/pdf
1162TBN 부산매거진20961558516022양선호 - 1집 내 사랑 지니 - 01 - 내사랑 지니.mp3/file/oldprogram/B_STORY/양선호 - 1집 내 사랑 지니 - 01 - 내사랑 지니.mp34814<NA>
2162TBN 부산매거진20961558716023CF_184hJ_NPM1_16_1.jpg/file/oldprogram/B_STORY/CF_184hJ_NPM1_16_1.jpg5<NA>
3162TBN 부산매거진2096155893602425.jpg/file/oldprogram/B_STORY/25.jpg70<NA>
4162TBN 부산매거진20961559046025트부성_final_전단.jpg/file/oldprogram/B_STORY/트부성_final_전단.jpg522<NA>
5162TBN 부산매거진20961559236026IMG_0052.jpg/file/oldprogram/B_STORY/IMG_0052.jpg1997<NA>
6162TBN 부산매거진20962217537414yoo.jpg/file/broadcast/attach/162/13195642623060.jpg65528image/pjpeg
7162TBN 부산매거진20962232937480가족사진.jpg/file/broadcast/attach/162/13218267709720.jpg321898image/pjpeg
8162TBN 부산매거진20962239527516정향 마을 봉사소감문 주형.hwp/file/broadcast/attach/162/13227108137490.hwp710144application/haansofthwp
9162TBN 부산매거진20962242097559DSCF3861.JPG/file/broadcast/attach/162/13230509375460.JPG1495921image/pjpeg
방송본부 구분 코드Unnamed: 1게시판구분코드게시글번호시퀀스파일명저장경로크기타입
989155한밤의 교차로200540575911565566.JPG/file/broadcast/attach/155/15397779824480.JPG2100319image/jpeg
990155한밤의 교차로200541459411571017.JPG/file/broadcast/attach/155/15506453333730.JPG893048image/jpeg
991155한밤의 교차로200541493611572025.jpg/file/broadcast/attach/155/15511050179620.jpg14351image/jpeg
992155한밤의 교차로200541502311573050.jpg/file/broadcast/attach/155/15511933706510.jpg78389image/jpeg
993155한밤의 교차로200541548311574082.jpg/file/broadcast/attach/155/15517976042660.jpg29767image/jpeg
994155한밤의 교차로2005416094115761508534871878.jpg/file/broadcast/attach/155/15524858635300.jpg30448image/jpeg
995155한밤의 교차로200541646211577211.jpg/file/broadcast/attach/155/15529722733240.jpg86402image/jpeg
996155한밤의 교차로20054447811197420200420_015244.jpg/file/broadcast/attach/155/15873870025370.jpg4139506image/jpeg
997155한밤의 교차로2005452110120181595315367805.jpg/file/broadcast/attach/155/15960810636350.jpg339537image/jpeg
998155한밤의 교차로200547478412119B612_20210403_165611_087.jpg/file/broadcast/attach/155/16202156101780.jpg954018image/jpeg