Overview

Dataset statistics

Number of variables12
Number of observations4938
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory477.5 KiB
Average record size in memory99.0 B

Variable types

Numeric3
Text4
Categorical5

Dataset

Description대전광역시 서구 디지털자료 현황입니다.(순번, 등록번호, 서명, 저자, 출판사, 출판년도, 출판월, 별치기호, 청구기호, 자료실, 서가, 자료상태, MARC, 제어번호 입수구분)
URLhttps://www.data.go.kr/data/15104515/fileData.do

Alerts

별치기호 has constant value ""Constant
자료실 has constant value ""Constant
서가 has constant value ""Constant
자료상태 has constant value ""Constant
출판년도 is highly overall correlated with 제어번호High correlation
제어번호 is highly overall correlated with 출판년도 and 1 other fieldsHigh correlation
입수구분 is highly overall correlated with 제어번호High correlation
입수구분 is highly imbalanced (88.7%)Imbalance
순번 has unique valuesUnique

Reproduction

Analysis started2023-12-12 08:09:19.927195
Analysis finished2023-12-12 08:09:23.085631
Duration3.16 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

순번
Real number (ℝ)

UNIQUE 

Distinct4938
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2550.4018
Minimum1
Maximum5145
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size43.5 KiB
2023-12-12T17:09:23.205178image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile249.85
Q11283.25
median2552.5
Q33806.75
95-th percentile4896.15
Maximum5145
Range5144
Interquartile range (IQR)2523.5

Descriptive statistics

Standard deviation1476.3038
Coefficient of variation (CV)0.57885147
Kurtosis-1.1730884
Mean2550.4018
Median Absolute Deviation (MAD)1261.5
Skewness0.014156104
Sum12593884
Variance2179473
MonotonicityStrictly increasing
2023-12-12T17:09:23.403570image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
< 0.1%
3388 1
 
< 0.1%
3395 1
 
< 0.1%
3394 1
 
< 0.1%
3393 1
 
< 0.1%
3392 1
 
< 0.1%
3391 1
 
< 0.1%
3390 1
 
< 0.1%
3389 1
 
< 0.1%
3387 1
 
< 0.1%
Other values (4928) 4928
99.8%
ValueCountFrequency (%)
1 1
< 0.1%
2 1
< 0.1%
3 1
< 0.1%
4 1
< 0.1%
5 1
< 0.1%
6 1
< 0.1%
7 1
< 0.1%
8 1
< 0.1%
9 1
< 0.1%
10 1
< 0.1%
ValueCountFrequency (%)
5145 1
< 0.1%
5144 1
< 0.1%
5143 1
< 0.1%
5142 1
< 0.1%
5141 1
< 0.1%
5140 1
< 0.1%
5139 1
< 0.1%
5138 1
< 0.1%
5137 1
< 0.1%
5136 1
< 0.1%
Distinct4928
Distinct (%)99.8%
Missing0
Missing (%)0.0%
Memory size38.7 KiB
2023-12-12T17:09:23.774805image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length8
Median length6
Mean length6.1059133
Min length5

Characters and Unicode

Total characters30151
Distinct characters12
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique4918 ?
Unique (%)99.6%

Sample

1st rowNB007053
2nd rowNB007019
3rd rowNB006920
4th rowNB007021
5th rowNB006903
ValueCountFrequency (%)
nb007494 2
 
< 0.1%
nb7535 2
 
< 0.1%
nb7529 2
 
< 0.1%
nb007493 2
 
< 0.1%
nb7525 2
 
< 0.1%
nb007491 2
 
< 0.1%
nb7524 2
 
< 0.1%
nb7528 2
 
< 0.1%
nb7532 2
 
< 0.1%
nb7526 2
 
< 0.1%
Other values (4918) 4918
99.6%
2023-12-12T17:09:24.335168image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
N 4938
16.4%
B 4938
16.4%
0 2526
8.4%
4 2513
8.3%
5 2477
8.2%
6 2469
8.2%
1 2139
7.1%
7 1944
 
6.4%
2 1749
 
5.8%
3 1545
 
5.1%
Other values (2) 2913
9.7%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 20275
67.2%
Uppercase Letter 9876
32.8%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 2526
12.5%
4 2513
12.4%
5 2477
12.2%
6 2469
12.2%
1 2139
10.5%
7 1944
9.6%
2 1749
8.6%
3 1545
7.6%
8 1512
7.5%
9 1401
6.9%
Uppercase Letter
ValueCountFrequency (%)
N 4938
50.0%
B 4938
50.0%

Most occurring scripts

ValueCountFrequency (%)
Common 20275
67.2%
Latin 9876
32.8%

Most frequent character per script

Common
ValueCountFrequency (%)
0 2526
12.5%
4 2513
12.4%
5 2477
12.2%
6 2469
12.2%
1 2139
10.5%
7 1944
9.6%
2 1749
8.6%
3 1545
7.6%
8 1512
7.5%
9 1401
6.9%
Latin
ValueCountFrequency (%)
N 4938
50.0%
B 4938
50.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 30151
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
N 4938
16.4%
B 4938
16.4%
0 2526
8.4%
4 2513
8.3%
5 2477
8.2%
6 2469
8.2%
1 2139
7.1%
7 1944
 
6.4%
2 1749
 
5.8%
3 1545
 
5.1%
Other values (2) 2913
9.7%

서명
Text

Distinct4820
Distinct (%)97.6%
Missing0
Missing (%)0.0%
Memory size38.7 KiB
2023-12-12T17:09:24.710599image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length178
Median length87
Mean length14.311867
Min length1

Characters and Unicode

Total characters70672
Distinct characters1087
Distinct categories11 ?
Distinct scripts4 ?
Distinct blocks7 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique4735 ?
Unique (%)95.9%

Sample

1st row락앤런 [DVD] .[24] ,Multiplication ROCK
2nd row도라도라 익스플로러 [DVD] .1-5 ,우리 처음 만난 날
3rd row가족의 나라 [비디오 녹화자료]
4th row도라도라 익스플로러 [DVD] .2-2 ,학교 가는 길 (My music teacher)
5th row저스틴 [비디오녹화자료]
ValueCountFrequency (%)
1066
 
6.4%
비디오녹화자료 264
 
1.6%
2 194
 
1.2%
dvd 191
 
1.2%
녹화자료 141
 
0.8%
1 118
 
0.7%
비디오 114
 
0.7%
3 113
 
0.7%
84
 
0.5%
삼국지 76
 
0.5%
Other values (7610) 14228
85.8%
2023-12-12T17:09:25.317291image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
12067
 
17.1%
1184
 
1.7%
1165
 
1.6%
1094
 
1.5%
. 971
 
1.4%
1 926
 
1.3%
: 920
 
1.3%
2 782
 
1.1%
776
 
1.1%
683
 
1.0%
Other values (1077) 50104
70.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 39393
55.7%
Space Separator 12067
 
17.1%
Uppercase Letter 5376
 
7.6%
Lowercase Letter 4763
 
6.7%
Decimal Number 3617
 
5.1%
Other Punctuation 2899
 
4.1%
Close Punctuation 985
 
1.4%
Open Punctuation 982
 
1.4%
Dash Punctuation 383
 
0.5%
Math Symbol 156
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1184
 
3.0%
1165
 
3.0%
1094
 
2.8%
776
 
2.0%
683
 
1.7%
652
 
1.7%
629
 
1.6%
621
 
1.6%
529
 
1.3%
510
 
1.3%
Other values (988) 31550
80.1%
Uppercase Letter
ValueCountFrequency (%)
D 663
 
12.3%
E 509
 
9.5%
V 388
 
7.2%
I 350
 
6.5%
S 344
 
6.4%
A 325
 
6.0%
T 307
 
5.7%
R 287
 
5.3%
O 269
 
5.0%
L 263
 
4.9%
Other values (16) 1671
31.1%
Lowercase Letter
ValueCountFrequency (%)
e 682
14.3%
i 459
9.6%
n 397
 
8.3%
o 389
 
8.2%
s 375
 
7.9%
a 322
 
6.8%
l 294
 
6.2%
t 277
 
5.8%
r 248
 
5.2%
d 217
 
4.6%
Other values (15) 1103
23.2%
Other Punctuation
ValueCountFrequency (%)
. 971
33.5%
: 920
31.7%
, 682
23.5%
! 89
 
3.1%
/ 48
 
1.7%
& 46
 
1.6%
' 45
 
1.6%
? 42
 
1.4%
; 34
 
1.2%
" 16
 
0.6%
Other values (3) 6
 
0.2%
Decimal Number
ValueCountFrequency (%)
1 926
25.6%
2 782
21.6%
3 393
10.9%
0 355
 
9.8%
4 258
 
7.1%
9 222
 
6.1%
5 204
 
5.6%
6 176
 
4.9%
7 160
 
4.4%
8 141
 
3.9%
Letter Number
ValueCountFrequency (%)
27
52.9%
15
29.4%
7
 
13.7%
1
 
2.0%
1
 
2.0%
Math Symbol
ValueCountFrequency (%)
= 110
70.5%
~ 30
 
19.2%
< 8
 
5.1%
> 8
 
5.1%
Close Punctuation
ValueCountFrequency (%)
] 641
65.1%
) 344
34.9%
Open Punctuation
ValueCountFrequency (%)
[ 638
65.0%
( 344
35.0%
Space Separator
ValueCountFrequency (%)
12067
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 383
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 39373
55.7%
Common 21089
29.8%
Latin 10190
 
14.4%
Han 20
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1184
 
3.0%
1165
 
3.0%
1094
 
2.8%
776
 
2.0%
683
 
1.7%
652
 
1.7%
629
 
1.6%
621
 
1.6%
529
 
1.3%
510
 
1.3%
Other values (971) 31530
80.1%
Latin
ValueCountFrequency (%)
e 682
 
6.7%
D 663
 
6.5%
E 509
 
5.0%
i 459
 
4.5%
n 397
 
3.9%
o 389
 
3.8%
V 388
 
3.8%
s 375
 
3.7%
I 350
 
3.4%
S 344
 
3.4%
Other values (46) 5634
55.3%
Common
ValueCountFrequency (%)
12067
57.2%
. 971
 
4.6%
1 926
 
4.4%
: 920
 
4.4%
2 782
 
3.7%
, 682
 
3.2%
] 641
 
3.0%
[ 638
 
3.0%
3 393
 
1.9%
- 383
 
1.8%
Other values (23) 2686
 
12.7%
Han
ValueCountFrequency (%)
2
 
10.0%
2
 
10.0%
2
 
10.0%
1
 
5.0%
1
 
5.0%
1
 
5.0%
1
 
5.0%
1
 
5.0%
1
 
5.0%
1
 
5.0%
Other values (7) 7
35.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 39361
55.7%
ASCII 31226
44.2%
Number Forms 51
 
0.1%
CJK 17
 
< 0.1%
Compat Jamo 12
 
< 0.1%
CJK Compat Ideographs 3
 
< 0.1%
None 2
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
12067
38.6%
. 971
 
3.1%
1 926
 
3.0%
: 920
 
2.9%
2 782
 
2.5%
, 682
 
2.2%
e 682
 
2.2%
D 663
 
2.1%
] 641
 
2.1%
[ 638
 
2.0%
Other values (73) 12254
39.2%
Hangul
ValueCountFrequency (%)
1184
 
3.0%
1165
 
3.0%
1094
 
2.8%
776
 
2.0%
683
 
1.7%
652
 
1.7%
629
 
1.6%
621
 
1.6%
529
 
1.3%
510
 
1.3%
Other values (970) 31518
80.1%
Number Forms
ValueCountFrequency (%)
27
52.9%
15
29.4%
7
 
13.7%
1
 
2.0%
1
 
2.0%
Compat Jamo
ValueCountFrequency (%)
12
100.0%
CJK
ValueCountFrequency (%)
2
11.8%
2
11.8%
2
11.8%
1
 
5.9%
1
 
5.9%
1
 
5.9%
1
 
5.9%
1
 
5.9%
1
 
5.9%
1
 
5.9%
Other values (4) 4
23.5%
None
ValueCountFrequency (%)
· 2
100.0%
CJK Compat Ideographs
ValueCountFrequency (%)
1
33.3%
1
33.3%
1
33.3%
Distinct794
Distinct (%)16.1%
Missing0
Missing (%)0.0%
Memory size38.7 KiB
2023-12-12T17:09:25.606654image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length27
Median length23
Mean length7.5188335
Min length1

Characters and Unicode

Total characters37128
Distinct characters346
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique394 ?
Unique (%)8.0%

Sample

1st row아이굳키드
2nd row스크린에듀케이션
3rd row디에스미디어 [제작·판매]
4th row스크린 에듀케이션 [제작·공급]
5th row아트서비스 [제작·판매]
ValueCountFrequency (%)
엔터테인먼트 254
 
3.8%
비트윈(주 180
 
2.7%
트라이스타 148
 
2.2%
145
 
2.2%
콜럼비아 137
 
2.0%
워너브러더스코리아(주 122
 
1.8%
kbs미디어 117
 
1.7%
파라마운트 116
 
1.7%
비트윈 101
 
1.5%
크레지오닷컴 100
 
1.5%
Other values (710) 5319
78.9%
2023-12-12T17:09:26.070922image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1834
 
4.9%
1803
 
4.9%
1623
 
4.4%
1351
 
3.6%
952
 
2.6%
939
 
2.5%
838
 
2.3%
827
 
2.2%
) 792
 
2.1%
( 788
 
2.1%
Other values (336) 25381
68.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 28400
76.5%
Uppercase Letter 3471
 
9.3%
Space Separator 1803
 
4.9%
Close Punctuation 1119
 
3.0%
Open Punctuation 1115
 
3.0%
Lowercase Letter 752
 
2.0%
Other Punctuation 246
 
0.7%
Decimal Number 220
 
0.6%
Dash Punctuation 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1834
 
6.5%
1623
 
5.7%
1351
 
4.8%
952
 
3.4%
939
 
3.3%
838
 
3.0%
827
 
2.9%
786
 
2.8%
760
 
2.7%
717
 
2.5%
Other values (272) 17773
62.6%
Uppercase Letter
ValueCountFrequency (%)
S 437
12.6%
E 350
10.1%
B 333
 
9.6%
K 318
 
9.2%
D 312
 
9.0%
C 210
 
6.1%
R 176
 
5.1%
M 174
 
5.0%
N 146
 
4.2%
I 142
 
4.1%
Other values (15) 873
25.2%
Lowercase Letter
ValueCountFrequency (%)
i 100
13.3%
a 95
12.6%
e 93
12.4%
n 66
8.8%
o 63
8.4%
d 54
7.2%
t 53
7.0%
m 48
6.4%
r 40
 
5.3%
v 25
 
3.3%
Other values (12) 115
15.3%
Decimal Number
ValueCountFrequency (%)
2 109
49.5%
0 77
35.0%
1 31
 
14.1%
5 1
 
0.5%
8 1
 
0.5%
4 1
 
0.5%
Other Punctuation
ValueCountFrequency (%)
· 99
40.2%
, 93
37.8%
& 32
 
13.0%
. 16
 
6.5%
/ 6
 
2.4%
Close Punctuation
ValueCountFrequency (%)
) 792
70.8%
] 327
29.2%
Open Punctuation
ValueCountFrequency (%)
( 788
70.7%
[ 327
29.3%
Space Separator
ValueCountFrequency (%)
1803
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 28400
76.5%
Common 4505
 
12.1%
Latin 4223
 
11.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1834
 
6.5%
1623
 
5.7%
1351
 
4.8%
952
 
3.4%
939
 
3.3%
838
 
3.0%
827
 
2.9%
786
 
2.8%
760
 
2.7%
717
 
2.5%
Other values (272) 17773
62.6%
Latin
ValueCountFrequency (%)
S 437
 
10.3%
E 350
 
8.3%
B 333
 
7.9%
K 318
 
7.5%
D 312
 
7.4%
C 210
 
5.0%
R 176
 
4.2%
M 174
 
4.1%
N 146
 
3.5%
I 142
 
3.4%
Other values (37) 1625
38.5%
Common
ValueCountFrequency (%)
1803
40.0%
) 792
17.6%
( 788
17.5%
] 327
 
7.3%
[ 327
 
7.3%
2 109
 
2.4%
· 99
 
2.2%
, 93
 
2.1%
0 77
 
1.7%
& 32
 
0.7%
Other values (7) 58
 
1.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 28400
76.5%
ASCII 8629
 
23.2%
None 99
 
0.3%

Most frequent character per block

Hangul
ValueCountFrequency (%)
1834
 
6.5%
1623
 
5.7%
1351
 
4.8%
952
 
3.4%
939
 
3.3%
838
 
3.0%
827
 
2.9%
786
 
2.8%
760
 
2.7%
717
 
2.5%
Other values (272) 17773
62.6%
ASCII
ValueCountFrequency (%)
1803
20.9%
) 792
 
9.2%
( 788
 
9.1%
S 437
 
5.1%
E 350
 
4.1%
B 333
 
3.9%
] 327
 
3.8%
[ 327
 
3.8%
K 318
 
3.7%
D 312
 
3.6%
Other values (53) 2842
32.9%
None
ValueCountFrequency (%)
· 99
100.0%

출판년도
Real number (ℝ)

HIGH CORRELATION 

Distinct34
Distinct (%)0.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2006.0638
Minimum1963
Maximum2021
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size43.5 KiB
2023-12-12T17:09:26.221211image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1963
5-th percentile2000
Q12002
median2005
Q32008
95-th percentile2017
Maximum2021
Range58
Interquartile range (IQR)6

Descriptive statistics

Standard deviation5.0920016
Coefficient of variation (CV)0.0025383049
Kurtosis1.6165709
Mean2006.0638
Median Absolute Deviation (MAD)3
Skewness0.89268595
Sum9905943
Variance25.92848
MonotonicityNot monotonic
2023-12-12T17:09:26.392728image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=34)
ValueCountFrequency (%)
2001 571
11.6%
2005 564
11.4%
2002 494
10.0%
2003 452
9.2%
2006 420
8.5%
2004 418
8.5%
2007 329
 
6.7%
2008 310
 
6.3%
2000 228
 
4.6%
2009 147
 
3.0%
Other values (24) 1005
20.4%
ValueCountFrequency (%)
1963 1
 
< 0.1%
1987 1
 
< 0.1%
1989 1
 
< 0.1%
1990 2
< 0.1%
1991 2
< 0.1%
1992 1
 
< 0.1%
1993 3
0.1%
1995 1
 
< 0.1%
1996 1
 
< 0.1%
1997 4
0.1%
ValueCountFrequency (%)
2021 28
 
0.6%
2020 69
1.4%
2019 60
1.2%
2018 71
1.4%
2017 84
1.7%
2016 50
1.0%
2015 105
2.1%
2014 91
1.8%
2013 107
2.2%
2012 93
1.9%

별치기호
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size38.7 KiB
DV
4938 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowDV
2nd rowDV
3rd rowDV
4th rowDV
5th rowDV

Common Values

ValueCountFrequency (%)
DV 4938
100.0%

Length

2023-12-12T17:09:26.528630image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T17:09:26.639956image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
dv 4938
100.0%
Distinct4919
Distinct (%)99.6%
Missing0
Missing (%)0.0%
Memory size38.7 KiB
2023-12-12T17:09:27.041842image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length14
Median length13
Mean length9.2347104
Min length5

Characters and Unicode

Total characters45601
Distinct characters14
Distinct categories4 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique4902 ?
Unique (%)99.3%

Sample

1st row375.1 141 24
2nd row375.1 107 1-5
3rd row688.23 81
4th row375.1 109 2-2
5th row688.6 495
ValueCountFrequency (%)
688.24 1715
 
17.1%
688.21 690
 
6.9%
688.6 623
 
6.2%
688.7 268
 
2.7%
375.1 206
 
2.1%
688.8 171
 
1.7%
688.22 140
 
1.4%
673.52 125
 
1.2%
911 122
 
1.2%
688.23 86
 
0.9%
Other values (1794) 5876
58.6%
2023-12-12T17:09:27.628828image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
8 9019
19.8%
6 6095
13.4%
5084
11.1%
2 4717
10.3%
. 4526
9.9%
1 4050
8.9%
4 3341
 
7.3%
3 2019
 
4.4%
5 1998
 
4.4%
7 1976
 
4.3%
Other values (4) 2776
 
6.1%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 35908
78.7%
Space Separator 5084
 
11.1%
Other Punctuation 4529
 
9.9%
Dash Punctuation 80
 
0.2%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
8 9019
25.1%
6 6095
17.0%
2 4717
13.1%
1 4050
11.3%
4 3341
 
9.3%
3 2019
 
5.6%
5 1998
 
5.6%
7 1976
 
5.5%
9 1391
 
3.9%
0 1302
 
3.6%
Other Punctuation
ValueCountFrequency (%)
. 4526
99.9%
, 3
 
0.1%
Space Separator
ValueCountFrequency (%)
5084
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 80
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 45601
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
8 9019
19.8%
6 6095
13.4%
5084
11.1%
2 4717
10.3%
. 4526
9.9%
1 4050
8.9%
4 3341
 
7.3%
3 2019
 
4.4%
5 1998
 
4.4%
7 1976
 
4.3%
Other values (4) 2776
 
6.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 45601
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
8 9019
19.8%
6 6095
13.4%
5084
11.1%
2 4717
10.3%
. 4526
9.9%
1 4050
8.9%
4 3341
 
7.3%
3 2019
 
4.4%
5 1998
 
4.4%
7 1976
 
4.3%
Other values (4) 2776
 
6.1%

자료실
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size38.7 KiB
전자정보자료실
4938 

Length

Max length7
Median length7
Mean length7
Min length7

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row전자정보자료실
2nd row전자정보자료실
3rd row전자정보자료실
4th row전자정보자료실
5th row전자정보자료실

Common Values

ValueCountFrequency (%)
전자정보자료실 4938
100.0%

Length

2023-12-12T17:09:27.780546image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T17:09:27.877995image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
전자정보자료실 4938
100.0%

서가
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size38.7 KiB
멀티미디어자료
4938 

Length

Max length7
Median length7
Mean length7
Min length7

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row멀티미디어자료
2nd row멀티미디어자료
3rd row멀티미디어자료
4th row멀티미디어자료
5th row멀티미디어자료

Common Values

ValueCountFrequency (%)
멀티미디어자료 4938
100.0%

Length

2023-12-12T17:09:27.991933image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T17:09:28.108351image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
멀티미디어자료 4938
100.0%

자료상태
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size38.7 KiB
이용가능
4938 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row이용가능
2nd row이용가능
3rd row이용가능
4th row이용가능
5th row이용가능

Common Values

ValueCountFrequency (%)
이용가능 4938
100.0%

Length

2023-12-12T17:09:28.214207image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T17:09:28.302187image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
이용가능 4938
100.0%

제어번호
Real number (ℝ)

HIGH CORRELATION 

Distinct4922
Distinct (%)99.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1206184.2
Minimum851660
Maximum4257243
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size43.5 KiB
2023-12-12T17:09:28.401820image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum851660
5-th percentile855254.85
Q1858408.25
median883830.5
Q3916930.5
95-th percentile3599724.1
Maximum4257243
Range3405583
Interquartile range (IQR)58522.25

Descriptive statistics

Standard deviation879279.23
Coefficient of variation (CV)0.7289759
Kurtosis4.6337621
Mean1206184.2
Median Absolute Deviation (MAD)26964
Skewness2.5028124
Sum5.9561377 × 109
Variance7.7313196 × 1011
MonotonicityNot monotonic
2023-12-12T17:09:28.542555image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
940421 4
 
0.1%
3719523 3
 
0.1%
4255979 2
 
< 0.1%
4255975 2
 
< 0.1%
4128335 2
 
< 0.1%
4128333 2
 
< 0.1%
4128330 2
 
< 0.1%
4255985 2
 
< 0.1%
4255982 2
 
< 0.1%
4255974 2
 
< 0.1%
Other values (4912) 4915
99.5%
ValueCountFrequency (%)
851660 1
< 0.1%
854994 1
< 0.1%
854995 1
< 0.1%
854996 1
< 0.1%
854997 1
< 0.1%
854998 1
< 0.1%
854999 1
< 0.1%
855000 1
< 0.1%
855010 1
< 0.1%
855011 1
< 0.1%
ValueCountFrequency (%)
4257243 1
< 0.1%
4257242 1
< 0.1%
4257241 1
< 0.1%
4257240 1
< 0.1%
4257238 1
< 0.1%
4257237 1
< 0.1%
4257236 1
< 0.1%
4257235 1
< 0.1%
4257234 1
< 0.1%
4257233 1
< 0.1%

입수구분
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size38.7 KiB
구입
4863 
수증
 
75

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row구입
2nd row구입
3rd row구입
4th row구입
5th row구입

Common Values

ValueCountFrequency (%)
구입 4863
98.5%
수증 75
 
1.5%

Length

2023-12-12T17:09:28.668442image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T17:09:28.749905image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
구입 4863
98.5%
수증 75
 
1.5%

Interactions

2023-12-12T17:09:22.335191image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:09:21.597122image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:09:21.946517image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:09:22.445253image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:09:21.689752image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:09:22.088881image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:09:22.579918image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:09:21.799071image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:09:22.206572image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T17:09:28.803615image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번출판년도제어번호입수구분
순번1.0000.7290.6530.218
출판년도0.7291.0000.7930.204
제어번호0.6530.7931.0000.712
입수구분0.2180.2040.7121.000
2023-12-12T17:09:28.892457image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번출판년도제어번호입수구분
순번1.000-0.013-0.0300.167
출판년도-0.0131.0000.9120.146
제어번호-0.0300.9121.0000.525
입수구분0.1670.1460.5251.000

Missing values

2023-12-12T17:09:22.770314image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T17:09:22.986179image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

순번등록번호서명출판사출판년도별치기호청구기호자료실서가자료상태제어번호입수구분
01NB007053락앤런 [DVD] .[24] ,Multiplication ROCK아이굳키드2015DV375.1 141 24전자정보자료실멀티미디어자료이용가능3047126구입
12NB007019도라도라 익스플로러 [DVD] .1-5 ,우리 처음 만난 날스크린에듀케이션2009DV375.1 107 1-5전자정보자료실멀티미디어자료이용가능2973720구입
23NB006920가족의 나라 [비디오 녹화자료]디에스미디어 [제작·판매]2013DV688.23 81전자정보자료실멀티미디어자료이용가능2663300구입
34NB007021도라도라 익스플로러 [DVD] .2-2 ,학교 가는 길 (My music teacher)스크린 에듀케이션 [제작·공급]2009DV375.1 109 2-2전자정보자료실멀티미디어자료이용가능2973722구입
45NB006903저스틴 [비디오녹화자료]아트서비스 [제작·판매]2014DV688.6 495전자정보자료실멀티미디어자료이용가능2663127구입
56NB007027도라도라 익스플로러 [DVD] .3-3 ,무지개야 용기를 내! (The shy rainbow)스크린 에듀케이션 [제작·공급]2011DV375.1 115 3-3전자정보자료실멀티미디어자료이용가능2974164구입
67NB006895거룩한 소녀 마리아 [DVD]올라잇픽쳐스2015DV688.8 139전자정보자료실멀티미디어자료이용가능2638530구입
78NB007017도라도라 익스플로러 [DVD] .1-3 ,친구들을 구하자스크린 에듀케이션[제작·공급]2008DV375.1 105 1-3전자정보자료실멀티미디어자료이용가능2973718구입
89NB006904타잔 [비디오녹화자료]아트서비스 [제작·판매]2014DV688.6 496전자정보자료실멀티미디어자료이용가능2663131구입
910NB006906인사동 스캔들 [비디오녹화자료]KD미디어2009DV688.21 569전자정보자료실멀티미디어자료이용가능2663134구입
순번등록번호서명출판사출판년도별치기호청구기호자료실서가자료상태제어번호입수구분
49285136NB7590플립 [DVD]워너브러더스2011DV688.24 1764전자정보자료실멀티미디어자료이용가능4257238구입
49295137NB7549(21세기 청춘 패러독스) 수성못 [비디오녹화자료] = Duck Town비디오여행2019DV688.21 722전자정보자료실멀티미디어자료이용가능4256008수증
49305138NB7552저수지에서 건진 치타 [비디오 녹화자료]인디스토리2008DV688.21 725전자정보자료실멀티미디어자료이용가능4255990수증
49315139NB7555파티 51 [비디오녹화자료]이오스 엔터테인먼트 [제작·판매]2015DV688.21 728전자정보자료실멀티미디어자료이용가능4255995수증
49325140NB7556팔월의 일요일들 [비디오녹화자료] = Sunday in August인디스토리2008DV688.21 729전자정보자료실멀티미디어자료이용가능4255993수증
49335141NB007498삐삐 롱스타킹 1집-2 [DVD]스크린에듀케이션2010DV375.1 210 1-2전자정보자료실멀티미디어자료이용가능4128344구입
49345142NB007457샤크 스쿨 2 : 오션월드[DVD]미디어룩2020DV688.6 662전자정보자료실멀티미디어자료이용가능4125080구입
49355143NB007438드래곤 헌터 / 전체관람 [DVD]미디어포유2019DV688.6 651전자정보자료실멀티미디어자료이용가능4046258구입
49365144NB007442반지의 비밀일기 극장판비디오여행2020DV688.6 655전자정보자료실멀티미디어자료이용가능4046265구입
49375145NB007453열혈형사 [비디오 녹화자료]노바미디어 [제작]2020DV688.21 684전자정보자료실멀티미디어자료이용가능4046286구입