Overview

Dataset statistics

Number of variables2
Number of observations100
Missing cells1
Missing cells (%)0.5%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.8 KiB
Average record size in memory18.3 B

Variable types

Numeric1
Text1

Dataset

Description파일 다운로드
Author서울특별시
URLhttps://data.seoul.go.kr/dataList/OA-2788/C/1/datasetView.do

Alerts

* 한국십진분류표 has unique valuesUnique

Reproduction

Analysis started2024-03-23 02:56:57.021814
Analysis finished2024-03-23 02:56:57.632106
Duration0.61 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

* 한국십진분류표
Real number (ℝ)

UNIQUE 

Distinct100
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean495
Minimum0
Maximum990
Zeros1
Zeros (%)1.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2024-03-23T02:56:57.777487image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile49.5
Q1247.5
median495
Q3742.5
95-th percentile940.5
Maximum990
Range990
Interquartile range (IQR)495

Descriptive statistics

Standard deviation290.11492
Coefficient of variation (CV)0.58609075
Kurtosis-1.2
Mean495
Median Absolute Deviation (MAD)250
Skewness0
Sum49500
Variance84166.667
MonotonicityStrictly increasing
2024-03-23T02:56:58.175204image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 1
 
1.0%
640 1
 
1.0%
740 1
 
1.0%
730 1
 
1.0%
720 1
 
1.0%
710 1
 
1.0%
700 1
 
1.0%
690 1
 
1.0%
680 1
 
1.0%
670 1
 
1.0%
Other values (90) 90
90.0%
ValueCountFrequency (%)
0 1
1.0%
10 1
1.0%
20 1
1.0%
30 1
1.0%
40 1
1.0%
50 1
1.0%
60 1
1.0%
70 1
1.0%
80 1
1.0%
90 1
1.0%
ValueCountFrequency (%)
990 1
1.0%
980 1
1.0%
970 1
1.0%
960 1
1.0%
950 1
1.0%
940 1
1.0%
930 1
1.0%
920 1
1.0%
910 1
1.0%
900 1
1.0%
Distinct99
Distinct (%)100.0%
Missing1
Missing (%)1.0%
Memory size932.0 B
2024-03-23T02:56:58.683872image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length17
Median length16
Mean length6.2424242
Min length3

Characters and Unicode

Total characters618
Distinct characters140
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique99 ?
Unique (%)100.0%

Sample

1st row총    류
2nd row도서학,서지학
3rd row문헌정보학
4th row백과사전
5th row강연집,수필집,연설문집
ValueCountFrequency (%)
18
 
9.8%
6
 
3.3%
6
 
3.3%
6
 
3.3%
기타 5
 
2.7%
4
 
2.2%
일반 3
 
1.6%
2
 
1.1%
2
 
1.1%
2
 
1.1%
Other values (116) 130
70.7%
2024-03-23T02:56:59.469647image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
85
 
13.8%
  66
 
10.7%
59
 
9.5%
, 25
 
4.0%
16
 
2.6%
13
 
2.1%
13
 
2.1%
12
 
1.9%
11
 
1.8%
11
 
1.8%
Other values (130) 307
49.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 440
71.2%
Space Separator 151
 
24.4%
Other Punctuation 25
 
4.0%
Close Punctuation 1
 
0.2%
Open Punctuation 1
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
59
 
13.4%
16
 
3.6%
13
 
3.0%
13
 
3.0%
12
 
2.7%
11
 
2.5%
11
 
2.5%
11
 
2.5%
8
 
1.8%
8
 
1.8%
Other values (125) 278
63.2%
Space Separator
ValueCountFrequency (%)
85
56.3%
  66
43.7%
Other Punctuation
ValueCountFrequency (%)
, 25
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 440
71.2%
Common 178
28.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
59
 
13.4%
16
 
3.6%
13
 
3.0%
13
 
3.0%
12
 
2.7%
11
 
2.5%
11
 
2.5%
11
 
2.5%
8
 
1.8%
8
 
1.8%
Other values (125) 278
63.2%
Common
ValueCountFrequency (%)
85
47.8%
  66
37.1%
, 25
 
14.0%
) 1
 
0.6%
( 1
 
0.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 440
71.2%
ASCII 112
 
18.1%
None 66
 
10.7%

Most frequent character per block

ASCII
ValueCountFrequency (%)
85
75.9%
, 25
 
22.3%
) 1
 
0.9%
( 1
 
0.9%
None
ValueCountFrequency (%)
  66
100.0%
Hangul
ValueCountFrequency (%)
59
 
13.4%
16
 
3.6%
13
 
3.0%
13
 
3.0%
12
 
2.7%
11
 
2.5%
11
 
2.5%
11
 
2.5%
8
 
1.8%
8
 
1.8%
Other values (125) 278
63.2%

Interactions

2024-03-23T02:56:57.249079image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-03-23T02:56:59.962187image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
* 한국십진분류표Unnamed: 1
* 한국십진분류표1.0001.000
Unnamed: 11.0001.000

Missing values

2024-03-23T02:56:57.442587image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-23T02:56:57.584836image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

* 한국십진분류표Unnamed: 1
00총    류
110도서학,서지학
220문헌정보학
330백과사전
440강연집,수필집,연설문집
550일반 연속간행물
660일반 학회,단체,협회,기관
770신문,저널리즘
880일반 전집,총서
990향토자료
* 한국십진분류표Unnamed: 1
90900역    사
91910아 시 아
92920유    럽
93930아프리카
94940북아메리카
95950남아메리카
96960오세아니아
97970양극지방
98980지    리
99990전    기