Overview

Dataset statistics

Number of variables3
Number of observations200
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory5.2 KiB
Average record size in memory26.6 B

Variable types

Numeric2
Text1

Dataset

Description뉴스데이터베이스 "BIGKinds" 기반 분석 자료, 기타 메타정보https://www.bigkinds.or.kr 에 접속하시면 보다 많은 정보를 확인할 수 있습니다.뉴스빅데이터 분석 자료입니다. 관련 뉴스와 키워드를 확인할 수 있습니다.
Author한국언론진흥재단
URLhttps://www.data.go.kr/data/15012945/fileData.do

Alerts

순위 is highly overall correlated with 빈도수High correlation
빈도수 is highly overall correlated with 순위High correlation
순위 has unique valuesUnique
키워드 has unique valuesUnique

Reproduction

Analysis started2024-03-14 12:39:43.990942
Analysis finished2024-03-14 12:39:45.122677
Duration1.13 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

순위
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct200
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean100.5
Minimum1
Maximum200
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.9 KiB
2024-03-14T21:39:45.255869image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile10.95
Q150.75
median100.5
Q3150.25
95-th percentile190.05
Maximum200
Range199
Interquartile range (IQR)99.5

Descriptive statistics

Standard deviation57.879185
Coefficient of variation (CV)0.57591228
Kurtosis-1.2
Mean100.5
Median Absolute Deviation (MAD)50
Skewness0
Sum20100
Variance3350
MonotonicityStrictly increasing
2024-03-14T21:39:45.609175image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.5%
139 1
 
0.5%
129 1
 
0.5%
130 1
 
0.5%
131 1
 
0.5%
132 1
 
0.5%
133 1
 
0.5%
134 1
 
0.5%
135 1
 
0.5%
136 1
 
0.5%
Other values (190) 190
95.0%
ValueCountFrequency (%)
1 1
0.5%
2 1
0.5%
3 1
0.5%
4 1
0.5%
5 1
0.5%
6 1
0.5%
7 1
0.5%
8 1
0.5%
9 1
0.5%
10 1
0.5%
ValueCountFrequency (%)
200 1
0.5%
199 1
0.5%
198 1
0.5%
197 1
0.5%
196 1
0.5%
195 1
0.5%
194 1
0.5%
193 1
0.5%
192 1
0.5%
191 1
0.5%

키워드
Text

UNIQUE 

Distinct200
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size1.7 KiB
2024-03-14T21:39:46.773376image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length9
Median length3
Mean length3.565
Min length3

Characters and Unicode

Total characters713
Distinct characters195
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique200 ?
Unique (%)100.0%

Sample

1st row대통령
2nd row한국당
3rd row청와대
4th row민주당
5th row위원장
ValueCountFrequency (%)
한국당 2
 
1.0%
대통령 1
 
0.5%
김재원 1
 
0.5%
신년사 1
 
0.5%
탄핵소추안 1
 
0.5%
문재인_정부 1
 
0.5%
예비후보 1
 
0.5%
성탄절 1
 
0.5%
체육회 1
 
0.5%
연동형_비례대표제 1
 
0.5%
Other values (189) 189
94.5%
2024-03-14T21:39:48.714430image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
_ 19
 
2.7%
17
 
2.4%
17
 
2.4%
16
 
2.2%
15
 
2.1%
15
 
2.1%
15
 
2.1%
15
 
2.1%
14
 
2.0%
13
 
1.8%
Other values (185) 557
78.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 694
97.3%
Connector Punctuation 19
 
2.7%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
17
 
2.4%
17
 
2.4%
16
 
2.3%
15
 
2.2%
15
 
2.2%
15
 
2.2%
15
 
2.2%
14
 
2.0%
13
 
1.9%
13
 
1.9%
Other values (184) 544
78.4%
Connector Punctuation
ValueCountFrequency (%)
_ 19
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 694
97.3%
Common 19
 
2.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
17
 
2.4%
17
 
2.4%
16
 
2.3%
15
 
2.2%
15
 
2.2%
15
 
2.2%
15
 
2.2%
14
 
2.0%
13
 
1.9%
13
 
1.9%
Other values (184) 544
78.4%
Common
ValueCountFrequency (%)
_ 19
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 694
97.3%
ASCII 19
 
2.7%

Most frequent character per block

ASCII
ValueCountFrequency (%)
_ 19
100.0%
Hangul
ValueCountFrequency (%)
17
 
2.4%
17
 
2.4%
16
 
2.3%
15
 
2.2%
15
 
2.2%
15
 
2.2%
15
 
2.2%
14
 
2.0%
13
 
1.9%
13
 
1.9%
Other values (184) 544
78.4%

빈도수
Real number (ℝ)

HIGH CORRELATION 

Distinct163
Distinct (%)81.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean615.32
Minimum122
Maximum9556
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.9 KiB
2024-03-14T21:39:49.036506image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum122
5-th percentile130
Q1166
median257
Q3486.5
95-th percentile1935.9
Maximum9556
Range9434
Interquartile range (IQR)320.5

Descriptive statistics

Standard deviation1234.0689
Coefficient of variation (CV)2.0055726
Kurtosis31.716273
Mean615.32
Median Absolute Deviation (MAD)112.5
Skewness5.2647612
Sum123064
Variance1522926.2
MonotonicityDecreasing
2024-03-14T21:39:49.501754image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
222 4
 
2.0%
130 3
 
1.5%
162 3
 
1.5%
175 3
 
1.5%
176 3
 
1.5%
133 3
 
1.5%
301 3
 
1.5%
127 3
 
1.5%
124 3
 
1.5%
189 2
 
1.0%
Other values (153) 170
85.0%
ValueCountFrequency (%)
122 1
 
0.5%
123 1
 
0.5%
124 3
1.5%
127 3
1.5%
128 1
 
0.5%
130 3
1.5%
131 2
1.0%
132 2
1.0%
133 3
1.5%
134 1
 
0.5%
ValueCountFrequency (%)
9556 1
0.5%
9547 1
0.5%
6619 1
0.5%
6550 1
0.5%
3791 1
0.5%
3523 1
0.5%
3180 1
0.5%
2842 1
0.5%
2344 1
0.5%
2143 1
0.5%

Interactions

2024-03-14T21:39:44.587507image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-14T21:39:44.199371image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-14T21:39:44.749651image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-14T21:39:44.429775image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-03-14T21:39:49.757476image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순위빈도수
순위1.0000.609
빈도수0.6091.000
2024-03-14T21:39:49.990864image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순위빈도수
순위1.000-1.000
빈도수-1.0001.000

Missing values

2024-03-14T21:39:44.936123image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-14T21:39:45.069913image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

순위키워드빈도수
01대통령9556
12한국당9547
23청와대6619
34민주당6550
45위원장3791
56선거법3523
67본회의3180
78트럼프2842
89원내대표2344
910예산안2143
순위키워드빈도수
190191감찰_중단130
191192한반도_상공128
192193시위대127
193194수출규제127
194195비서실장127
195196유엔_안보리124
196197데이터124
197198단체장124
198199탄핵안123
199200시의원122