Overview

Dataset statistics

Number of variables4
Number of observations6372
Missing cells112
Missing cells (%)0.4%
Duplicate rows124
Duplicate rows (%)1.9%
Total size in memory199.3 KiB
Average record size in memory32.0 B

Variable types

Text4

Dataset

Description평생학습계좌제 학습과정 운영 기관에서 등록한 교재 관련 정보로서 교재명, 저자명, 출판사명, 출판년도에 관한 네 가지 정보를 제공합니다.
Author국가평생교육진흥원
URLhttps://www.data.go.kr/data/15090722/fileData.do

Alerts

Dataset has 124 (1.9%) duplicate rowsDuplicates

Reproduction

Analysis started2023-12-12 14:05:53.114112
Analysis finished2023-12-12 14:05:55.002943
Duration1.89 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct5512
Distinct (%)86.5%
Missing1
Missing (%)< 0.1%
Memory size49.9 KiB
2023-12-12T23:05:55.286302image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length94
Median length77
Mean length11.719981
Min length1

Characters and Unicode

Total characters74668
Distinct characters980
Distinct categories15 ?
Distinct scripts5 ?
Distinct blocks10 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique5079 ?
Unique (%)79.7%

Sample

1st row한국사 마인드맵 지도자과정
2nd row나를 미치게 하는 너
3rd row마흔의 심리학
4th row모모
5th row아픈 영혼, 책을 만나다
ValueCountFrequency (%)
144
 
0.9%
실제 124
 
0.8%
나무 112
 
0.7%
위한 112
 
0.7%
이론과 111
 
0.7%
1 97
 
0.6%
2급 74
 
0.5%
72
 
0.5%
2 70
 
0.5%
중국어 70
 
0.5%
Other values (7176) 14494
93.6%
2023-12-12T23:05:55.852413image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
9214
 
12.3%
1325
 
1.8%
1087
 
1.5%
984
 
1.3%
876
 
1.2%
862
 
1.2%
857
 
1.1%
2 831
 
1.1%
1 744
 
1.0%
e 744
 
1.0%
Other values (970) 57144
76.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 51009
68.3%
Space Separator 9215
 
12.3%
Lowercase Letter 5777
 
7.7%
Uppercase Letter 3624
 
4.9%
Decimal Number 2828
 
3.8%
Other Punctuation 673
 
0.9%
Close Punctuation 551
 
0.7%
Open Punctuation 551
 
0.7%
Dash Punctuation 298
 
0.4%
Math Symbol 63
 
0.1%
Other values (5) 79
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1325
 
2.6%
1087
 
2.1%
984
 
1.9%
876
 
1.7%
862
 
1.7%
857
 
1.7%
739
 
1.4%
724
 
1.4%
709
 
1.4%
708
 
1.4%
Other values (875) 42138
82.6%
Lowercase Letter
ValueCountFrequency (%)
e 744
12.9%
a 491
 
8.5%
i 481
 
8.3%
n 446
 
7.7%
t 444
 
7.7%
r 431
 
7.5%
o 415
 
7.2%
s 351
 
6.1%
l 241
 
4.2%
h 220
 
3.8%
Other values (16) 1513
26.2%
Uppercase Letter
ValueCountFrequency (%)
S 367
 
10.1%
E 341
 
9.4%
T 330
 
9.1%
I 265
 
7.3%
P 247
 
6.8%
C 242
 
6.7%
A 208
 
5.7%
B 189
 
5.2%
O 175
 
4.8%
N 168
 
4.6%
Other values (16) 1092
30.1%
Other Punctuation
ValueCountFrequency (%)
, 287
42.6%
& 90
 
13.4%
: 72
 
10.7%
/ 64
 
9.5%
. 42
 
6.2%
· 35
 
5.2%
? 21
 
3.1%
! 19
 
2.8%
; 15
 
2.2%
% 14
 
2.1%
Other values (3) 14
 
2.1%
Decimal Number
ValueCountFrequency (%)
2 831
29.4%
1 744
26.3%
0 557
19.7%
3 255
 
9.0%
4 119
 
4.2%
5 105
 
3.7%
7 85
 
3.0%
6 56
 
2.0%
8 52
 
1.8%
9 24
 
0.8%
Math Symbol
ValueCountFrequency (%)
~ 35
55.6%
+ 19
30.2%
| 7
 
11.1%
< 1
 
1.6%
> 1
 
1.6%
Letter Number
ValueCountFrequency (%)
25
42.4%
23
39.0%
6
 
10.2%
5
 
8.5%
Space Separator
ValueCountFrequency (%)
9214
> 99.9%
  1
 
< 0.1%
Close Punctuation
ValueCountFrequency (%)
) 479
86.9%
] 72
 
13.1%
Open Punctuation
ValueCountFrequency (%)
( 479
86.9%
[ 72
 
13.1%
Dash Punctuation
ValueCountFrequency (%)
- 298
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 9
100.0%
Final Punctuation
ValueCountFrequency (%)
8
100.0%
Other Number
ValueCountFrequency (%)
2
100.0%
Modifier Symbol
ValueCountFrequency (%)
` 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 50862
68.1%
Common 14199
 
19.0%
Latin 9460
 
12.7%
Han 142
 
0.2%
Hiragana 5
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1325
 
2.6%
1087
 
2.1%
984
 
1.9%
876
 
1.7%
862
 
1.7%
857
 
1.7%
739
 
1.5%
724
 
1.4%
709
 
1.4%
708
 
1.4%
Other values (785) 41991
82.6%
Han
ValueCountFrequency (%)
8
 
5.6%
6
 
4.2%
5
 
3.5%
4
 
2.8%
4
 
2.8%
3
 
2.1%
3
 
2.1%
3
 
2.1%
3
 
2.1%
3
 
2.1%
Other values (75) 100
70.4%
Latin
ValueCountFrequency (%)
e 744
 
7.9%
a 491
 
5.2%
i 481
 
5.1%
n 446
 
4.7%
t 444
 
4.7%
r 431
 
4.6%
o 415
 
4.4%
S 367
 
3.9%
s 351
 
3.7%
E 341
 
3.6%
Other values (46) 4949
52.3%
Common
ValueCountFrequency (%)
9214
64.9%
2 831
 
5.9%
1 744
 
5.2%
0 557
 
3.9%
) 479
 
3.4%
( 479
 
3.4%
- 298
 
2.1%
, 287
 
2.0%
3 255
 
1.8%
4 119
 
0.8%
Other values (29) 936
 
6.6%
Hiragana
ValueCountFrequency (%)
1
20.0%
1
20.0%
1
20.0%
1
20.0%
1
20.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 50855
68.1%
ASCII 23554
31.5%
CJK 136
 
0.2%
Number Forms 59
 
0.1%
None 36
 
< 0.1%
Punctuation 8
 
< 0.1%
Compat Jamo 7
 
< 0.1%
CJK Compat Ideographs 6
 
< 0.1%
Hiragana 5
 
< 0.1%
Enclosed Alphanum 2
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
9214
39.1%
2 831
 
3.5%
1 744
 
3.2%
e 744
 
3.2%
0 557
 
2.4%
a 491
 
2.1%
i 481
 
2.0%
) 479
 
2.0%
( 479
 
2.0%
n 446
 
1.9%
Other values (77) 9088
38.6%
Hangul
ValueCountFrequency (%)
1325
 
2.6%
1087
 
2.1%
984
 
1.9%
876
 
1.7%
862
 
1.7%
857
 
1.7%
739
 
1.5%
724
 
1.4%
709
 
1.4%
708
 
1.4%
Other values (780) 41984
82.6%
None
ValueCountFrequency (%)
· 35
97.2%
  1
 
2.8%
Number Forms
ValueCountFrequency (%)
25
42.4%
23
39.0%
6
 
10.2%
5
 
8.5%
CJK
ValueCountFrequency (%)
8
 
5.9%
6
 
4.4%
5
 
3.7%
4
 
2.9%
4
 
2.9%
3
 
2.2%
3
 
2.2%
3
 
2.2%
3
 
2.2%
3
 
2.2%
Other values (70) 94
69.1%
Punctuation
ValueCountFrequency (%)
8
100.0%
Compat Jamo
ValueCountFrequency (%)
2
28.6%
2
28.6%
1
14.3%
1
14.3%
1
14.3%
CJK Compat Ideographs
ValueCountFrequency (%)
2
33.3%
1
16.7%
1
16.7%
1
16.7%
1
16.7%
Enclosed Alphanum
ValueCountFrequency (%)
2
100.0%
Hiragana
ValueCountFrequency (%)
1
20.0%
1
20.0%
1
20.0%
1
20.0%
1
20.0%

저자
Text

Distinct3863
Distinct (%)60.9%
Missing30
Missing (%)0.5%
Memory size49.9 KiB
2023-12-12T23:05:56.138327image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length50
Median length48
Mean length6.4880164
Min length1

Characters and Unicode

Total characters41147
Distinct characters681
Distinct categories10 ?
Distinct scripts4 ?
Distinct blocks5 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique3013 ?
Unique (%)47.5%

Sample

1st row양근숙
2nd row레스카터
3rd row황선미
4th row이경수
5th row김영아
ValueCountFrequency (%)
482
 
5.1%
247
 
2.6%
자체제작 101
 
1.1%
교육과학기술부 85
 
0.9%
공저 81
 
0.9%
편집부 74
 
0.8%
교육부 69
 
0.7%
해군교육사령부 66
 
0.7%
강사 58
 
0.6%
kim 56
 
0.6%
Other values (4538) 8087
86.0%
2023-12-12T23:05:56.597425image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
3145
 
7.6%
1126
 
2.7%
, 1039
 
2.5%
1007
 
2.4%
872
 
2.1%
867
 
2.1%
748
 
1.8%
628
 
1.5%
572
 
1.4%
528
 
1.3%
Other values (671) 30615
74.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 31119
75.6%
Lowercase Letter 3153
 
7.7%
Space Separator 3150
 
7.7%
Uppercase Letter 1536
 
3.7%
Other Punctuation 1337
 
3.2%
Decimal Number 320
 
0.8%
Close Punctuation 179
 
0.4%
Dash Punctuation 177
 
0.4%
Open Punctuation 173
 
0.4%
Math Symbol 3
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1126
 
3.6%
1007
 
3.2%
872
 
2.8%
867
 
2.8%
748
 
2.4%
628
 
2.0%
572
 
1.8%
528
 
1.7%
519
 
1.7%
494
 
1.6%
Other values (593) 23758
76.3%
Lowercase Letter
ValueCountFrequency (%)
a 434
13.8%
e 396
12.6%
n 305
9.7%
r 299
9.5%
i 265
 
8.4%
o 183
 
5.8%
s 173
 
5.5%
l 168
 
5.3%
c 133
 
4.2%
t 123
 
3.9%
Other values (16) 674
21.4%
Uppercase Letter
ValueCountFrequency (%)
G 108
 
7.0%
M 107
 
7.0%
K 106
 
6.9%
J 106
 
6.9%
S 105
 
6.8%
C 103
 
6.7%
A 100
 
6.5%
R 96
 
6.2%
E 72
 
4.7%
T 66
 
4.3%
Other values (15) 567
36.9%
Decimal Number
ValueCountFrequency (%)
1 79
24.7%
2 70
21.9%
3 42
13.1%
4 34
10.6%
5 30
 
9.4%
6 25
 
7.8%
7 15
 
4.7%
0 10
 
3.1%
9 9
 
2.8%
8 6
 
1.9%
Other Punctuation
ValueCountFrequency (%)
, 1039
77.7%
. 162
 
12.1%
/ 52
 
3.9%
& 42
 
3.1%
· 26
 
1.9%
: 9
 
0.7%
; 5
 
0.4%
2
 
0.1%
Space Separator
ValueCountFrequency (%)
3145
99.8%
  5
 
0.2%
Close Punctuation
ValueCountFrequency (%)
) 178
99.4%
] 1
 
0.6%
Open Punctuation
ValueCountFrequency (%)
( 172
99.4%
[ 1
 
0.6%
Math Symbol
ValueCountFrequency (%)
| 2
66.7%
× 1
33.3%
Dash Punctuation
ValueCountFrequency (%)
- 177
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 31074
75.5%
Common 5339
 
13.0%
Latin 4689
 
11.4%
Han 45
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1126
 
3.6%
1007
 
3.2%
872
 
2.8%
867
 
2.8%
748
 
2.4%
628
 
2.0%
572
 
1.8%
528
 
1.7%
519
 
1.7%
494
 
1.6%
Other values (564) 23713
76.3%
Latin
ValueCountFrequency (%)
a 434
 
9.3%
e 396
 
8.4%
n 305
 
6.5%
r 299
 
6.4%
i 265
 
5.7%
o 183
 
3.9%
s 173
 
3.7%
l 168
 
3.6%
c 133
 
2.8%
t 123
 
2.6%
Other values (41) 2210
47.1%
Han
ValueCountFrequency (%)
4
 
8.9%
3
 
6.7%
3
 
6.7%
3
 
6.7%
2
 
4.4%
2
 
4.4%
2
 
4.4%
2
 
4.4%
2
 
4.4%
2
 
4.4%
Other values (19) 20
44.4%
Common
ValueCountFrequency (%)
3145
58.9%
, 1039
 
19.5%
) 178
 
3.3%
- 177
 
3.3%
( 172
 
3.2%
. 162
 
3.0%
1 79
 
1.5%
2 70
 
1.3%
/ 52
 
1.0%
3 42
 
0.8%
Other values (17) 223
 
4.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 31051
75.5%
ASCII 9994
 
24.3%
CJK 45
 
0.1%
None 34
 
0.1%
Compat Jamo 23
 
0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
3145
31.5%
, 1039
 
10.4%
a 434
 
4.3%
e 396
 
4.0%
n 305
 
3.1%
r 299
 
3.0%
i 265
 
2.7%
o 183
 
1.8%
) 178
 
1.8%
- 177
 
1.8%
Other values (64) 3573
35.8%
Hangul
ValueCountFrequency (%)
1126
 
3.6%
1007
 
3.2%
872
 
2.8%
867
 
2.8%
748
 
2.4%
628
 
2.0%
572
 
1.8%
528
 
1.7%
519
 
1.7%
494
 
1.6%
Other values (556) 23690
76.3%
None
ValueCountFrequency (%)
· 26
76.5%
  5
 
14.7%
2
 
5.9%
× 1
 
2.9%
Compat Jamo
ValueCountFrequency (%)
15
65.2%
2
 
8.7%
1
 
4.3%
1
 
4.3%
1
 
4.3%
1
 
4.3%
1
 
4.3%
1
 
4.3%
CJK
ValueCountFrequency (%)
4
 
8.9%
3
 
6.7%
3
 
6.7%
3
 
6.7%
2
 
4.4%
2
 
4.4%
2
 
4.4%
2
 
4.4%
2
 
4.4%
2
 
4.4%
Other values (19) 20
44.4%
Distinct2108
Distinct (%)33.3%
Missing37
Missing (%)0.6%
Memory size49.9 KiB
2023-12-12T23:05:56.899684image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length40
Median length28
Mean length5.4112076
Min length1

Characters and Unicode

Total characters34280
Distinct characters645
Distinct categories11 ?
Distinct scripts4 ?
Distinct blocks6 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1398 ?
Unique (%)22.1%

Sample

1st row삼인
2nd row사계절
3rd row위즈덤하우스
4th row삼인
5th row창비
ValueCountFrequency (%)
자체제작 441
 
6.2%
244
 
3.4%
국군인쇄창 224
 
3.2%
학지사 191
 
2.7%
도서출판 97
 
1.4%
자체교재 88
 
1.2%
자체 87
 
1.2%
해군인쇄창 80
 
1.1%
제작 76
 
1.1%
다락원 66
 
0.9%
Other values (2177) 5492
77.5%
2023-12-12T23:05:57.385173image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1445
 
4.2%
1048
 
3.1%
810
 
2.4%
771
 
2.2%
713
 
2.1%
703
 
2.1%
676
 
2.0%
667
 
1.9%
639
 
1.9%
590
 
1.7%
Other values (635) 26218
76.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 30039
87.6%
Lowercase Letter 1119
 
3.3%
Uppercase Letter 1101
 
3.2%
Space Separator 816
 
2.4%
Close Punctuation 338
 
1.0%
Open Punctuation 331
 
1.0%
Dash Punctuation 252
 
0.7%
Decimal Number 122
 
0.4%
Other Punctuation 108
 
0.3%
Other Symbol 52
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1445
 
4.8%
1048
 
3.5%
771
 
2.6%
713
 
2.4%
703
 
2.3%
676
 
2.3%
667
 
2.2%
639
 
2.1%
590
 
2.0%
564
 
1.9%
Other values (563) 22223
74.0%
Lowercase Letter
ValueCountFrequency (%)
o 134
12.0%
n 117
10.5%
e 107
9.6%
s 101
 
9.0%
a 95
 
8.5%
r 91
 
8.1%
i 72
 
6.4%
g 52
 
4.6%
m 45
 
4.0%
d 45
 
4.0%
Other values (14) 260
23.2%
Uppercase Letter
ValueCountFrequency (%)
C 104
 
9.4%
O 97
 
8.8%
E 85
 
7.7%
P 78
 
7.1%
M 74
 
6.7%
A 71
 
6.4%
S 67
 
6.1%
B 58
 
5.3%
L 52
 
4.7%
R 51
 
4.6%
Other values (14) 364
33.1%
Decimal Number
ValueCountFrequency (%)
1 46
37.7%
2 36
29.5%
3 13
 
10.7%
5 10
 
8.2%
0 10
 
8.2%
9 4
 
3.3%
6 2
 
1.6%
4 1
 
0.8%
Other Punctuation
ValueCountFrequency (%)
. 58
53.7%
, 17
 
15.7%
& 12
 
11.1%
/ 11
 
10.2%
· 6
 
5.6%
: 4
 
3.7%
Space Separator
ValueCountFrequency (%)
810
99.3%
  6
 
0.7%
Close Punctuation
ValueCountFrequency (%)
) 332
98.2%
] 6
 
1.8%
Open Punctuation
ValueCountFrequency (%)
( 325
98.2%
[ 6
 
1.8%
Math Symbol
ValueCountFrequency (%)
| 1
50.0%
× 1
50.0%
Dash Punctuation
ValueCountFrequency (%)
- 252
100.0%
Other Symbol
ValueCountFrequency (%)
52
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 30086
87.8%
Latin 2220
 
6.5%
Common 1969
 
5.7%
Han 5
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1445
 
4.8%
1048
 
3.5%
771
 
2.6%
713
 
2.4%
703
 
2.3%
676
 
2.2%
667
 
2.2%
639
 
2.1%
590
 
2.0%
564
 
1.9%
Other values (559) 22270
74.0%
Latin
ValueCountFrequency (%)
o 134
 
6.0%
n 117
 
5.3%
e 107
 
4.8%
C 104
 
4.7%
s 101
 
4.5%
O 97
 
4.4%
a 95
 
4.3%
r 91
 
4.1%
E 85
 
3.8%
P 78
 
3.5%
Other values (38) 1211
54.5%
Common
ValueCountFrequency (%)
810
41.1%
) 332
16.9%
( 325
16.5%
- 252
 
12.8%
. 58
 
2.9%
1 46
 
2.3%
2 36
 
1.8%
, 17
 
0.9%
3 13
 
0.7%
& 12
 
0.6%
Other values (13) 68
 
3.5%
Han
ValueCountFrequency (%)
1
20.0%
1
20.0%
1
20.0%
1
20.0%
1
20.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 30029
87.6%
ASCII 4176
 
12.2%
None 65
 
0.2%
Compat Jamo 5
 
< 0.1%
CJK 4
 
< 0.1%
CJK Compat Ideographs 1
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
1445
 
4.8%
1048
 
3.5%
771
 
2.6%
713
 
2.4%
703
 
2.3%
676
 
2.3%
667
 
2.2%
639
 
2.1%
590
 
2.0%
564
 
1.9%
Other values (554) 22213
74.0%
ASCII
ValueCountFrequency (%)
810
19.4%
) 332
 
8.0%
( 325
 
7.8%
- 252
 
6.0%
o 134
 
3.2%
n 117
 
2.8%
e 107
 
2.6%
C 104
 
2.5%
s 101
 
2.4%
O 97
 
2.3%
Other values (58) 1797
43.0%
None
ValueCountFrequency (%)
52
80.0%
· 6
 
9.2%
  6
 
9.2%
× 1
 
1.5%
Compat Jamo
ValueCountFrequency (%)
2
40.0%
1
20.0%
1
20.0%
1
20.0%
CJK
ValueCountFrequency (%)
1
25.0%
1
25.0%
1
25.0%
1
25.0%
CJK Compat Ideographs
ValueCountFrequency (%)
1
100.0%
Distinct60
Distinct (%)0.9%
Missing44
Missing (%)0.7%
Memory size49.9 KiB
2023-12-12T23:05:57.568177image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length4
Median length4
Mean length3.8111568
Min length1

Characters and Unicode

Total characters24117
Distinct characters14
Distinct categories4 ?
Distinct scripts1 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique14 ?
Unique (%)0.2%

Sample

1st row2010
2nd row2007
3rd row2002
4th row2007
5th row2009
ValueCountFrequency (%)
2010 696
 
11.0%
2014 495
 
7.8%
2011 494
 
7.8%
2012 473
 
7.5%
2013 445
 
7.0%
2009 427
 
6.8%
0 394
 
6.2%
2015 363
 
5.7%
2007 313
 
5.0%
2008 290
 
4.6%
Other values (48) 1933
30.6%
2023-12-12T23:05:57.937789image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 8658
35.9%
2 6764
28.0%
1 4513
18.7%
9 1018
 
4.2%
4 606
 
2.5%
3 560
 
2.3%
5 531
 
2.2%
8 530
 
2.2%
7 436
 
1.8%
6 387
 
1.6%
Other values (4) 114
 
0.5%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 24003
99.5%
Space Separator 88
 
0.4%
Dash Punctuation 24
 
0.1%
Connector Punctuation 2
 
< 0.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 8658
36.1%
2 6764
28.2%
1 4513
18.8%
9 1018
 
4.2%
4 606
 
2.5%
3 560
 
2.3%
5 531
 
2.2%
8 530
 
2.2%
7 436
 
1.8%
6 387
 
1.6%
Space Separator
ValueCountFrequency (%)
83
94.3%
  5
 
5.7%
Dash Punctuation
ValueCountFrequency (%)
- 24
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 24117
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 8658
35.9%
2 6764
28.0%
1 4513
18.7%
9 1018
 
4.2%
4 606
 
2.5%
3 560
 
2.3%
5 531
 
2.2%
8 530
 
2.2%
7 436
 
1.8%
6 387
 
1.6%
Other values (4) 114
 
0.5%

Most occurring blocks

ValueCountFrequency (%)
ASCII 24112
> 99.9%
None 5
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 8658
35.9%
2 6764
28.1%
1 4513
18.7%
9 1018
 
4.2%
4 606
 
2.5%
3 560
 
2.3%
5 531
 
2.2%
8 530
 
2.2%
7 436
 
1.8%
6 387
 
1.6%
Other values (3) 109
 
0.5%
None
ValueCountFrequency (%)
  5
100.0%

Missing values

2023-12-12T23:05:54.603735image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T23:05:54.727639image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-12T23:05:54.903871image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

교재명저자출판사출판년도
0한국사 마인드맵 지도자과정양근숙<NA>2010
1나를 미치게 하는 너레스카터삼인2007
2마흔의 심리학황선미사계절2002
3모모이경수위즈덤하우스2007
4아픈 영혼, 책을 만나다김영아삼인2009
5엄마를 부탁해신경숙창비2008
6우리들의 행복한 시간공지영오픈하우스2010
7유진과유진이금이푸른책들2008
8심성계발을 위한 미술치료의 이론과 실제(사)한국심성교육개발원(사)한국심성교육개발원2008
9세계 최고의 명강사를 꿈꿔라류석우씨앗을 뿌리는 사람2004
교재명저자출판사출판년도
6362한식조리기능사전명숙현능출판사2010
6363한글지도의 이론과 실제김영희(주)아침나라2010
6364역사속유물이야기플라토비플라토비2013
6365소망의 나무 1,2,3교육과학기술부평생교육진흥원2007
6366소망의 나무 교사용 지도서교육과학기술부평생교육진흥원2007
6367우쿨렐레염인정아름출판사1987
6368커피바리스타현광진한수출판사2010
6369테스트테스트테스트1900
6370하모니카정옥선태림스코어2016
6371해피바이엘심재응현대음악2001

Duplicate rows

Most frequently occurring

교재명저자출판사출판년도# duplicates
115프린트물 제공강사홈플러스평생교육스쿨013
113프린트물 제공강사홈플러스 평생교육스쿨012
114프린트물 제공강사홈플러스 평생교육스쿨20147
118프린트물제공강사홈플러스평생교육스쿨04
27동화구연의 이론과 실제이규원동화사랑20023
33미술치료의 이론과 실제김인선 조수경 외한국삼성교육개발원03
39박선생 역사교실박선생박선생창의역사지리교실20183
46산업안전보건교육한국산업안전보건공단-20123
56손뜨개 교재뜨개나무뜨개나무03
59스토리텔링과 책놀이이송은창지사20123