Overview

Dataset statistics

Number of variables10
Number of observations8347
Missing cells2207
Missing cells (%)2.6%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory668.5 KiB
Average record size in memory82.0 B

Variable types

Text4
Categorical2
Numeric2
DateTime2

Dataset

Description독립기념관 국외사적지 첨부파일 파일타입, 파일명, 파일설명등에 관한 자료입니다.
Author독립기념관
URLhttps://www.data.go.kr/data/15067829/fileData.do

Alerts

등록순번 is highly overall correlated with 정렬순번High correlation
정렬순번 is highly overall correlated with 등록순번High correlation
파일타입 is highly imbalanced (93.5%)Imbalance
수정일자 has 2204 (26.4%) missing valuesMissing

Reproduction

Analysis started2023-12-12 07:42:30.134833
Analysis finished2023-12-12 07:42:32.254865
Duration2.12 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct1253
Distinct (%)15.0%
Missing0
Missing (%)0.0%
Memory size65.3 KiB
2023-12-12T16:42:32.533725image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length7
Mean length7.0511561
Min length5

Characters and Unicode

Total characters58856
Distinct characters35
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique140 ?
Unique (%)1.7%

Sample

1st row1-01-13-0001
2nd row1-01-13-0001
3rd row1-01-13-0001
4th row1-01-13-0001
5th row1-01-13-0002
ValueCountFrequency (%)
cn002·8 60
 
0.7%
cn00384 40
 
0.5%
1-01-13-0004 38
 
0.5%
cn00167 38
 
0.5%
cn00328 37
 
0.4%
cn00289 34
 
0.4%
cn00306 34
 
0.4%
cn00284 32
 
0.4%
ru00060 30
 
0.4%
cn00109 29
 
0.3%
Other values (1242) 7975
95.5%
2023-12-12T16:42:33.042445image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 22114
37.6%
1 4912
 
8.3%
N 3161
 
5.4%
C 3141
 
5.3%
2 2927
 
5.0%
3 2808
 
4.8%
S 2417
 
4.1%
- 2235
 
3.8%
4 1810
 
3.1%
U 1556
 
2.6%
Other values (25) 11775
20.0%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 41355
70.3%
Uppercase Letter 15204
 
25.8%
Dash Punctuation 2235
 
3.8%
Other Punctuation 60
 
0.1%
Space Separator 2
 
< 0.1%

Most frequent character per category

Uppercase Letter
ValueCountFrequency (%)
N 3161
20.8%
C 3141
20.7%
S 2417
15.9%
U 1556
10.2%
M 1040
 
6.8%
H 883
 
5.8%
R 726
 
4.8%
X 313
 
2.1%
E 285
 
1.9%
P 280
 
1.8%
Other values (12) 1402
9.2%
Decimal Number
ValueCountFrequency (%)
0 22114
53.5%
1 4912
 
11.9%
2 2927
 
7.1%
3 2808
 
6.8%
4 1810
 
4.4%
5 1425
 
3.4%
9 1382
 
3.3%
8 1360
 
3.3%
6 1353
 
3.3%
7 1264
 
3.1%
Dash Punctuation
ValueCountFrequency (%)
- 2235
100.0%
Other Punctuation
ValueCountFrequency (%)
· 60
100.0%
Space Separator
ValueCountFrequency (%)
2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 43652
74.2%
Latin 15204
 
25.8%

Most frequent character per script

Latin
ValueCountFrequency (%)
N 3161
20.8%
C 3141
20.7%
S 2417
15.9%
U 1556
10.2%
M 1040
 
6.8%
H 883
 
5.8%
R 726
 
4.8%
X 313
 
2.1%
E 285
 
1.9%
P 280
 
1.8%
Other values (12) 1402
9.2%
Common
ValueCountFrequency (%)
0 22114
50.7%
1 4912
 
11.3%
2 2927
 
6.7%
3 2808
 
6.4%
- 2235
 
5.1%
4 1810
 
4.1%
5 1425
 
3.3%
9 1382
 
3.2%
8 1360
 
3.1%
6 1353
 
3.1%
Other values (3) 1326
 
3.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 58796
99.9%
None 60
 
0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 22114
37.6%
1 4912
 
8.4%
N 3161
 
5.4%
C 3141
 
5.3%
2 2927
 
5.0%
3 2808
 
4.8%
S 2417
 
4.1%
- 2235
 
3.8%
4 1810
 
3.1%
U 1556
 
2.6%
Other values (24) 11715
19.9%
None
ValueCountFrequency (%)
· 60
100.0%

관리구분
Categorical

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size65.3 KiB
PLACE
6697 
TEXTBOOK
1650 

Length

Max length8
Median length5
Mean length5.5930274
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowPLACE
2nd rowPLACE
3rd rowPLACE
4th rowPLACE
5th rowPLACE

Common Values

ValueCountFrequency (%)
PLACE 6697
80.2%
TEXTBOOK 1650
 
19.8%

Length

2023-12-12T16:42:33.229289image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T16:42:33.354828image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
place 6697
80.2%
textbook 1650
 
19.8%

등록순번
Real number (ℝ)

HIGH CORRELATION 

Distinct48
Distinct (%)0.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean7.1271115
Minimum0
Maximum47
Zeros4
Zeros (%)< 0.1%
Negative0
Negative (%)0.0%
Memory size73.5 KiB
2023-12-12T16:42:33.475826image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile1
Q12
median5
Q39
95-th percentile22
Maximum47
Range47
Interquartile range (IQR)7

Descriptive statistics

Standard deviation6.9645996
Coefficient of variation (CV)0.97719806
Kurtosis4.8795326
Mean7.1271115
Median Absolute Deviation (MAD)3
Skewness2.0217158
Sum59490
Variance48.505648
MonotonicityNot monotonic
2023-12-12T16:42:33.622405image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=48)
ValueCountFrequency (%)
1 1185
14.2%
2 1050
12.6%
3 923
11.1%
4 791
9.5%
5 657
 
7.9%
6 544
 
6.5%
7 449
 
5.4%
8 380
 
4.6%
9 321
 
3.8%
10 267
 
3.2%
Other values (38) 1780
21.3%
ValueCountFrequency (%)
0 4
 
< 0.1%
1 1185
14.2%
2 1050
12.6%
3 923
11.1%
4 791
9.5%
5 657
7.9%
6 544
6.5%
7 449
 
5.4%
8 380
 
4.6%
9 321
 
3.8%
ValueCountFrequency (%)
47 2
 
< 0.1%
46 2
 
< 0.1%
45 2
 
< 0.1%
44 3
 
< 0.1%
43 3
 
< 0.1%
42 4
< 0.1%
41 4
< 0.1%
40 5
0.1%
39 7
0.1%
38 9
0.1%

파일타입
Categorical

IMBALANCE 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size65.3 KiB
jpg
8283 
png
 
64

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowjpg
2nd rowjpg
3rd rowjpg
4th rowjpg
5th rowjpg

Common Values

ValueCountFrequency (%)
jpg 8283
99.2%
png 64
 
0.8%

Length

2023-12-12T16:42:33.762165image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T16:42:33.866286image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
jpg 8283
99.2%
png 64
 
0.8%
Distinct8314
Distinct (%)99.6%
Missing1
Missing (%)< 0.1%
Memory size65.3 KiB
2023-12-12T16:42:34.052569image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length51
Median length51
Mean length33.679367
Min length13

Characters and Unicode

Total characters281088
Distinct characters56
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique8307 ?
Unique (%)99.5%

Sample

1st row20180115100111_cb75efa2·8f99b85d5a8a75dd4c84ec1.jpg
2nd row20180115100111_8b7bd58f0ac82f1b366adb7fa8dc4766.jpg
3rd row20180115100111_5a609bc81cca45c53ea506a172f424c1.jpg
4th row20180115100112_cf3836c50efd4b194c50168d79915c8b.jpg
5th row201802071002·8_bfc6a6321a56f9978cd58ce7637b0425.jpg
ValueCountFrequency (%)
hs105-001.jpg 10
 
0.1%
es004-001.jpg 9
 
0.1%
cn002·8-001.jpg 7
 
0.1%
cn002·8-002.jpg 5
 
0.1%
cn002·8-003.jpg 4
 
< 0.1%
cn002·8-005.jpg 2
 
< 0.1%
cn002·8-004.jpg 2
 
< 0.1%
20180104170134_737b28197cb6fc0c2a6dcc6f52996289.jpg 1
 
< 0.1%
kz00011-003.jpg 1
 
< 0.1%
kz00011-004.jpg 1
 
< 0.1%
Other values (8304) 8304
99.5%
2023-12-12T16:42:34.419434image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 42029
15.0%
1 27494
 
9.8%
2 21532
 
7.7%
3 14763
 
5.3%
5 12502
 
4.4%
4 12403
 
4.4%
7 12312
 
4.4%
8 12223
 
4.3%
6 12077
 
4.3%
9 10813
 
3.8%
Other values (46) 102940
36.6%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 178148
63.4%
Lowercase Letter 81021
28.8%
Other Punctuation 9110
 
3.2%
Uppercase Letter 4484
 
1.6%
Connector Punctuation 4383
 
1.6%
Dash Punctuation 3942
 
1.4%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
e 8939
11.0%
d 8814
10.9%
b 8794
10.9%
f 8771
10.8%
c 8769
10.8%
a 8757
10.8%
p 8359
10.3%
g 8352
10.3%
j 8291
10.2%
s 1652
 
2.0%
Other values (11) 1523
 
1.9%
Uppercase Letter
ValueCountFrequency (%)
N 890
19.8%
C 878
19.6%
U 732
16.3%
S 542
12.1%
M 186
 
4.1%
P 159
 
3.5%
R 153
 
3.4%
J 150
 
3.3%
X 131
 
2.9%
I 123
 
2.7%
Other values (11) 540
12.0%
Decimal Number
ValueCountFrequency (%)
0 42029
23.6%
1 27494
15.4%
2 21532
12.1%
3 14763
 
8.3%
5 12502
 
7.0%
4 12403
 
7.0%
7 12312
 
6.9%
8 12223
 
6.9%
6 12077
 
6.8%
9 10813
 
6.1%
Other Punctuation
ValueCountFrequency (%)
. 8346
91.6%
· 764
 
8.4%
Connector Punctuation
ValueCountFrequency (%)
_ 4383
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 3942
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 195583
69.6%
Latin 85505
30.4%

Most frequent character per script

Latin
ValueCountFrequency (%)
e 8939
10.5%
d 8814
10.3%
b 8794
10.3%
f 8771
10.3%
c 8769
10.3%
a 8757
10.2%
p 8359
9.8%
g 8352
9.8%
j 8291
9.7%
s 1652
 
1.9%
Other values (32) 6007
7.0%
Common
ValueCountFrequency (%)
0 42029
21.5%
1 27494
14.1%
2 21532
11.0%
3 14763
 
7.5%
5 12502
 
6.4%
4 12403
 
6.3%
7 12312
 
6.3%
8 12223
 
6.2%
6 12077
 
6.2%
9 10813
 
5.5%
Other values (4) 17435
8.9%

Most occurring blocks

ValueCountFrequency (%)
ASCII 280324
99.7%
None 764
 
0.3%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 42029
15.0%
1 27494
 
9.8%
2 21532
 
7.7%
3 14763
 
5.3%
5 12502
 
4.5%
4 12403
 
4.4%
7 12312
 
4.4%
8 12223
 
4.4%
6 12077
 
4.3%
9 10813
 
3.9%
Other values (45) 102176
36.4%
None
ValueCountFrequency (%)
· 764
100.0%
Distinct8104
Distinct (%)97.1%
Missing1
Missing (%)< 0.1%
Memory size65.3 KiB
2023-12-12T16:42:34.834221image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length95
Median length90
Mean length21.788042
Min length5

Characters and Unicode

Total characters181843
Distinct characters573
Distinct categories13 ?
Distinct scripts4 ?
Distinct blocks6 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique8013 ?
Unique (%)96.0%

Sample

1st row03 - 신규식거주지 남창로 100-5호 (1)(로고추가).jpg
2nd row03 - 신규식거주지 남창로 100-5호 (2)(로고추가).jpg
3rd row03 - 신규식거주지 남창로 100-5호 (3)(로고추가).jpg
4th row05 - 신규식 거주지 및 중국 신해혁명 주역들의 거주지 골목전경(로고추가).jpg
5th row10 - 팔선교 YMCA중화기독교 청년회관 (1-1)(로고추가).jpg
ValueCountFrequency (%)
1170
 
5.8%
실태조사 285
 
1.4%
17(1227 285
 
1.4%
러시아 198
 
1.0%
거주지 198
 
1.0%
대한민국임시정부 169
 
0.8%
161
 
0.8%
2)(로고추가).jpg 160
 
0.8%
1)(로고추가).jpg 153
 
0.8%
용정 144
 
0.7%
Other values (7444) 17319
85.6%
2023-12-12T16:42:35.431819image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 25234
 
13.9%
11913
 
6.6%
. 9114
 
5.0%
p 8065
 
4.4%
g 8065
 
4.4%
j 7992
 
4.4%
- 7891
 
4.3%
1 7865
 
4.3%
( 6339
 
3.5%
) 6336
 
3.5%
Other values (563) 83029
45.7%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 55702
30.6%
Other Letter 46417
25.5%
Lowercase Letter 27963
15.4%
Space Separator 11913
 
6.6%
Other Punctuation 9369
 
5.2%
Uppercase Letter 9243
 
5.1%
Dash Punctuation 7891
 
4.3%
Open Punctuation 6342
 
3.5%
Close Punctuation 6339
 
3.5%
Connector Punctuation 647
 
0.4%
Other values (3) 17
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
3242
 
7.0%
3194
 
6.9%
3149
 
6.8%
3120
 
6.7%
1142
 
2.5%
911
 
2.0%
763
 
1.6%
674
 
1.5%
669
 
1.4%
632
 
1.4%
Other values (490) 28921
62.3%
Uppercase Letter
ValueCountFrequency (%)
C 1702
18.4%
N 1492
16.1%
U 992
10.7%
S 785
8.5%
M 637
 
6.9%
D 605
 
6.5%
P 557
 
6.0%
J 538
 
5.8%
G 467
 
5.1%
R 381
 
4.1%
Other values (15) 1087
11.8%
Lowercase Letter
ValueCountFrequency (%)
p 8065
28.8%
g 8065
28.8%
j 7992
28.6%
s 1677
 
6.0%
h 812
 
2.9%
m 644
 
2.3%
e 271
 
1.0%
n 115
 
0.4%
c 39
 
0.1%
t 39
 
0.1%
Other values (14) 244
 
0.9%
Decimal Number
ValueCountFrequency (%)
0 25234
45.3%
1 7865
 
14.1%
2 5327
 
9.6%
3 3319
 
6.0%
5 2732
 
4.9%
4 2708
 
4.9%
7 2416
 
4.3%
8 2110
 
3.8%
9 2063
 
3.7%
6 1928
 
3.5%
Other Punctuation
ValueCountFrequency (%)
. 9114
97.3%
, 154
 
1.6%
· 99
 
1.1%
' 2
 
< 0.1%
Open Punctuation
ValueCountFrequency (%)
( 6339
> 99.9%
3
 
< 0.1%
Close Punctuation
ValueCountFrequency (%)
) 6336
> 99.9%
3
 
< 0.1%
Space Separator
ValueCountFrequency (%)
11913
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 7891
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 647
100.0%
Final Punctuation
ValueCountFrequency (%)
8
100.0%
Initial Punctuation
ValueCountFrequency (%)
8
100.0%
Math Symbol
ValueCountFrequency (%)
~ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 98220
54.0%
Hangul 46400
25.5%
Latin 37206
 
20.5%
Han 17
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
3242
 
7.0%
3194
 
6.9%
3149
 
6.8%
3120
 
6.7%
1142
 
2.5%
911
 
2.0%
763
 
1.6%
674
 
1.5%
669
 
1.4%
632
 
1.4%
Other values (474) 28904
62.3%
Latin
ValueCountFrequency (%)
p 8065
21.7%
g 8065
21.7%
j 7992
21.5%
C 1702
 
4.6%
s 1677
 
4.5%
N 1492
 
4.0%
U 992
 
2.7%
h 812
 
2.2%
S 785
 
2.1%
m 644
 
1.7%
Other values (39) 4980
13.4%
Common
ValueCountFrequency (%)
0 25234
25.7%
11913
12.1%
. 9114
 
9.3%
- 7891
 
8.0%
1 7865
 
8.0%
( 6339
 
6.5%
) 6336
 
6.5%
2 5327
 
5.4%
3 3319
 
3.4%
5 2732
 
2.8%
Other values (14) 12150
12.4%
Han
ValueCountFrequency (%)
2
 
11.8%
1
 
5.9%
1
 
5.9%
1
 
5.9%
1
 
5.9%
1
 
5.9%
1
 
5.9%
1
 
5.9%
1
 
5.9%
1
 
5.9%
Other values (6) 6
35.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 135305
74.4%
Hangul 46398
 
25.5%
None 105
 
0.1%
CJK 17
 
< 0.1%
Punctuation 16
 
< 0.1%
Compat Jamo 2
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 25234
18.6%
11913
 
8.8%
. 9114
 
6.7%
p 8065
 
6.0%
g 8065
 
6.0%
j 7992
 
5.9%
- 7891
 
5.8%
1 7865
 
5.8%
( 6339
 
4.7%
) 6336
 
4.7%
Other values (58) 36491
27.0%
Hangul
ValueCountFrequency (%)
3242
 
7.0%
3194
 
6.9%
3149
 
6.8%
3120
 
6.7%
1142
 
2.5%
911
 
2.0%
763
 
1.6%
674
 
1.5%
669
 
1.4%
632
 
1.4%
Other values (472) 28902
62.3%
None
ValueCountFrequency (%)
· 99
94.3%
3
 
2.9%
3
 
2.9%
Punctuation
ValueCountFrequency (%)
8
50.0%
8
50.0%
CJK
ValueCountFrequency (%)
2
 
11.8%
1
 
5.9%
1
 
5.9%
1
 
5.9%
1
 
5.9%
1
 
5.9%
1
 
5.9%
1
 
5.9%
1
 
5.9%
1
 
5.9%
Other values (6) 6
35.3%
Compat Jamo
ValueCountFrequency (%)
1
50.0%
1
50.0%
Distinct4929
Distinct (%)59.1%
Missing1
Missing (%)< 0.1%
Memory size65.3 KiB
2023-12-12T16:42:35.677505image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length940
Median length142
Mean length22.330098
Min length1

Characters and Unicode

Total characters186367
Distinct characters998
Distinct categories13 ?
Distinct scripts5 ?
Distinct blocks8 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique3505 ?
Unique (%)42.0%

Sample

1st row신규식 거주지(2016)
2nd row신규식 거주지(2016)
3rd row신규식 거주지(2016)
4th row신규식 거주지 및 중국 신해혁명 주역들의 거주지 골목전경(2016)
5th row팔선교 중화기독교 청년회관 건물 원경(2016)
ValueCountFrequency (%)
전경 1183
 
3.0%
2019 1032
 
2.7%
건물 899
 
2.3%
내부 556
 
1.4%
405
 
1.0%
입구 310
 
0.8%
있었던 284
 
0.7%
있는 279
 
0.7%
거주지 275
 
0.7%
일대 248
 
0.6%
Other values (7506) 33434
85.9%
2023-12-12T16:42:36.042141image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
30645
 
16.4%
0 6934
 
3.7%
( 6100
 
3.3%
) 6096
 
3.3%
2 6071
 
3.3%
1 4412
 
2.4%
2757
 
1.5%
2735
 
1.5%
2733
 
1.5%
2317
 
1.2%
Other values (988) 115567
62.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 117178
62.9%
Space Separator 30645
 
16.4%
Decimal Number 23171
 
12.4%
Open Punctuation 6152
 
3.3%
Close Punctuation 6148
 
3.3%
Lowercase Letter 1192
 
0.6%
Other Punctuation 1164
 
0.6%
Uppercase Letter 421
 
0.2%
Dash Punctuation 194
 
0.1%
Final Punctuation 54
 
< 0.1%
Other values (3) 48
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2757
 
2.4%
2735
 
2.3%
2733
 
2.3%
2317
 
2.0%
2079
 
1.8%
1908
 
1.6%
1814
 
1.5%
1723
 
1.5%
1638
 
1.4%
1527
 
1.3%
Other values (887) 95947
81.9%
Uppercase Letter
ValueCountFrequency (%)
I 52
12.4%
C 46
 
10.9%
S 42
 
10.0%
A 34
 
8.1%
D 31
 
7.4%
M 22
 
5.2%
H 18
 
4.3%
L 18
 
4.3%
P 17
 
4.0%
Y 14
 
3.3%
Other values (25) 127
30.2%
Lowercase Letter
ValueCountFrequency (%)
e 161
13.5%
a 142
11.9%
n 106
8.9%
o 98
8.2%
l 96
8.1%
i 86
 
7.2%
t 81
 
6.8%
r 71
 
6.0%
s 63
 
5.3%
u 52
 
4.4%
Other values (16) 236
19.8%
Other Punctuation
ValueCountFrequency (%)
' 299
25.7%
. 261
22.4%
, 216
18.6%
· 191
16.4%
: 67
 
5.8%
* 64
 
5.5%
" 30
 
2.6%
/ 13
 
1.1%
\ 12
 
1.0%
& 4
 
0.3%
Other values (3) 7
 
0.6%
Decimal Number
ValueCountFrequency (%)
0 6934
29.9%
2 6071
26.2%
1 4412
19.0%
9 1778
 
7.7%
3 939
 
4.1%
7 741
 
3.2%
8 738
 
3.2%
5 658
 
2.8%
6 495
 
2.1%
4 405
 
1.7%
Open Punctuation
ValueCountFrequency (%)
( 6100
99.2%
31
 
0.5%
20
 
0.3%
[ 1
 
< 0.1%
Close Punctuation
ValueCountFrequency (%)
) 6096
99.2%
31
 
0.5%
20
 
0.3%
] 1
 
< 0.1%
Final Punctuation
ValueCountFrequency (%)
50
92.6%
4
 
7.4%
Initial Punctuation
ValueCountFrequency (%)
25
86.2%
4
 
13.8%
Math Symbol
ValueCountFrequency (%)
~ 16
94.1%
1
 
5.9%
Space Separator
ValueCountFrequency (%)
30645
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 194
100.0%
Control
ValueCountFrequency (%)
2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 116876
62.7%
Common 67576
36.3%
Latin 1600
 
0.9%
Han 302
 
0.2%
Cyrillic 13
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2757
 
2.4%
2735
 
2.3%
2733
 
2.3%
2317
 
2.0%
2079
 
1.8%
1908
 
1.6%
1814
 
1.6%
1723
 
1.5%
1638
 
1.4%
1527
 
1.3%
Other values (751) 95645
81.8%
Han
ValueCountFrequency (%)
16
 
5.3%
15
 
5.0%
9
 
3.0%
8
 
2.6%
8
 
2.6%
7
 
2.3%
7
 
2.3%
7
 
2.3%
7
 
2.3%
7
 
2.3%
Other values (126) 211
69.9%
Latin
ValueCountFrequency (%)
e 161
 
10.1%
a 142
 
8.9%
n 106
 
6.6%
o 98
 
6.1%
l 96
 
6.0%
i 86
 
5.4%
t 81
 
5.1%
r 71
 
4.4%
s 63
 
3.9%
I 52
 
3.2%
Other values (41) 644
40.2%
Common
ValueCountFrequency (%)
30645
45.3%
0 6934
 
10.3%
( 6100
 
9.0%
) 6096
 
9.0%
2 6071
 
9.0%
1 4412
 
6.5%
9 1778
 
2.6%
3 939
 
1.4%
7 741
 
1.1%
8 738
 
1.1%
Other values (30) 3122
 
4.6%
Cyrillic
ValueCountFrequency (%)
А 3
23.1%
Н 2
15.4%
Х 1
 
7.7%
В 1
 
7.7%
Л 1
 
7.7%
Е 1
 
7.7%
К 1
 
7.7%
С 1
 
7.7%
Д 1
 
7.7%
Р 1
 
7.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 116876
62.7%
ASCII 68797
36.9%
CJK 299
 
0.2%
None 295
 
0.2%
Punctuation 83
 
< 0.1%
Cyrillic 13
 
< 0.1%
CJK Compat Ideographs 3
 
< 0.1%
Math Operators 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
30645
44.5%
0 6934
 
10.1%
( 6100
 
8.9%
) 6096
 
8.9%
2 6071
 
8.8%
1 4412
 
6.4%
9 1778
 
2.6%
3 939
 
1.4%
7 741
 
1.1%
8 738
 
1.1%
Other values (70) 4343
 
6.3%
Hangul
ValueCountFrequency (%)
2757
 
2.4%
2735
 
2.3%
2733
 
2.3%
2317
 
2.0%
2079
 
1.8%
1908
 
1.6%
1814
 
1.6%
1723
 
1.5%
1638
 
1.4%
1527
 
1.3%
Other values (751) 95645
81.8%
None
ValueCountFrequency (%)
· 191
64.7%
31
 
10.5%
31
 
10.5%
20
 
6.8%
20
 
6.8%
2
 
0.7%
Punctuation
ValueCountFrequency (%)
50
60.2%
25
30.1%
4
 
4.8%
4
 
4.8%
CJK
ValueCountFrequency (%)
16
 
5.4%
15
 
5.0%
9
 
3.0%
8
 
2.7%
8
 
2.7%
7
 
2.3%
7
 
2.3%
7
 
2.3%
7
 
2.3%
7
 
2.3%
Other values (124) 208
69.6%
Cyrillic
ValueCountFrequency (%)
А 3
23.1%
Н 2
15.4%
Х 1
 
7.7%
В 1
 
7.7%
Л 1
 
7.7%
Е 1
 
7.7%
К 1
 
7.7%
С 1
 
7.7%
Д 1
 
7.7%
Р 1
 
7.7%
CJK Compat Ideographs
ValueCountFrequency (%)
2
66.7%
1
33.3%
Math Operators
ValueCountFrequency (%)
1
100.0%

정렬순번
Real number (ℝ)

HIGH CORRELATION 

Distinct49
Distinct (%)0.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean6.2639272
Minimum0
Maximum99
Zeros4
Zeros (%)< 0.1%
Negative0
Negative (%)0.0%
Memory size73.5 KiB
2023-12-12T16:42:36.167060image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile1
Q12
median4
Q38
95-th percentile17
Maximum99
Range99
Interquartile range (IQR)6

Descriptive statistics

Standard deviation7.3295437
Coefficient of variation (CV)1.1701196
Kurtosis64.663046
Mean6.2639272
Median Absolute Deviation (MAD)3
Skewness6.0295844
Sum52285
Variance53.72221
MonotonicityNot monotonic
2023-12-12T16:42:36.282690image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=49)
ValueCountFrequency (%)
1 1527
18.3%
2 1028
12.3%
3 915
11.0%
4 802
9.6%
5 673
8.1%
6 575
 
6.9%
7 481
 
5.8%
8 402
 
4.8%
9 336
 
4.0%
10 265
 
3.2%
Other values (39) 1343
16.1%
ValueCountFrequency (%)
0 4
 
< 0.1%
1 1527
18.3%
2 1028
12.3%
3 915
11.0%
4 802
9.6%
5 673
8.1%
6 575
 
6.9%
7 481
 
5.8%
8 402
 
4.8%
9 336
 
4.0%
ValueCountFrequency (%)
99 20
0.2%
97 1
 
< 0.1%
46 1
 
< 0.1%
45 1
 
< 0.1%
44 1
 
< 0.1%
43 1
 
< 0.1%
42 1
 
< 0.1%
41 2
 
< 0.1%
40 4
 
< 0.1%
39 4
 
< 0.1%
Distinct988
Distinct (%)11.8%
Missing0
Missing (%)0.0%
Memory size65.3 KiB
Minimum2013-12-27 10:59:00
Maximum2020-06-04 17:20:00
2023-12-12T16:42:36.618391image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T16:42:36.756326image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

수정일자
Date

MISSING 

Distinct800
Distinct (%)13.0%
Missing2204
Missing (%)26.4%
Memory size65.3 KiB
Minimum2014-08-06 16:50:00
Maximum2020-08-21 17:22:00
2023-12-12T16:42:36.890543image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T16:42:37.082221image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

Interactions

2023-12-12T16:42:31.563856image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T16:42:31.412159image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T16:42:31.690078image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T16:42:31.485980image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T16:42:37.182920image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
관리구분등록순번파일타입정렬순번
관리구분1.0000.1990.0640.132
등록순번0.1991.0000.0000.412
파일타입0.0640.0001.0000.259
정렬순번0.1320.4120.2591.000
2023-12-12T16:42:37.272213image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
파일타입관리구분
파일타입1.0000.040
관리구분0.0401.000
2023-12-12T16:42:37.368702image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
등록순번정렬순번관리구분파일타입
등록순번1.0000.6160.1530.000
정렬순번0.6161.0000.0950.187
관리구분0.1530.0951.0000.040
파일타입0.0000.1870.0401.000

Missing values

2023-12-12T16:42:31.816838image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T16:42:32.010758image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-12T16:42:32.168288image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

관리번호관리구분등록순번파일타입파일명실제파일명파일설명정렬순번등록일자수정일자
01-01-13-0001PLACE1jpg20180115100111_cb75efa2·8f99b85d5a8a75dd4c84ec1.jpg03 - 신규식거주지 남창로 100-5호 (1)(로고추가).jpg신규식 거주지(2016)12018-01-15 10:342019-12-31 14:05
11-01-13-0001PLACE2jpg20180115100111_8b7bd58f0ac82f1b366adb7fa8dc4766.jpg03 - 신규식거주지 남창로 100-5호 (2)(로고추가).jpg신규식 거주지(2016)22018-01-15 10:342019-12-31 14:05
21-01-13-0001PLACE3jpg20180115100111_5a609bc81cca45c53ea506a172f424c1.jpg03 - 신규식거주지 남창로 100-5호 (3)(로고추가).jpg신규식 거주지(2016)32018-01-15 10:342019-12-31 14:05
31-01-13-0001PLACE4jpg20180115100112_cf3836c50efd4b194c50168d79915c8b.jpg05 - 신규식 거주지 및 중국 신해혁명 주역들의 거주지 골목전경(로고추가).jpg신규식 거주지 및 중국 신해혁명 주역들의 거주지 골목전경(2016)42018-01-15 10:342019-12-31 14:05
41-01-13-0002PLACE1jpg201802071002·8_bfc6a6321a56f9978cd58ce7637b0425.jpg10 - 팔선교 YMCA중화기독교 청년회관 (1-1)(로고추가).jpg팔선교 중화기독교 청년회관 건물 원경(2016)12018-02-07 10:122019-12-31 14:06
51-01-13-0002PLACE2jpg201802071002·8_70e82bc352fb862·8bd4dc313c3804af.jpg10 - 팔선교 YMCA중화기독교 청년회관 (1-2)(로고추가).jpg팔선교 중화기독교 청년회관 건물 원경(2016)22018-02-07 10:122019-12-31 14:06
61-01-13-0002PLACE3jpg201802071002·8_c158e937c6f14b024605591118e6c324.jpg10 - 팔선교 YMCA중화기독교 청년회관 (3)(로고추가).jpg팔선교 중화기독교 청년회관 건물 입구(2016)32018-02-07 10:122019-12-31 14:06
71-01-13-0002PLACE4jpg201802071002·8_8e41ec5f610f46af18741804d439f8cf.jpg10 - 팔선교 YMCA중화기독교 청년회관 (4)(로고추가).jpg팔선교 중화기독교 청년회관 건물 입구 우측(2016)42018-02-07 10:122019-12-31 14:06
81-01-13-0002PLACE5jpg201802071002·8_778a2a90034bd9d577e1e10735f20d40.jpg10 - 팔선교 YMCA중화기독교 청년회관 (5)(로고추가).jpg팔선교 중화기독교 청년회관 건물 입구 좌측(2016)52018-02-07 10:122019-12-31 14:06
91-01-13-0002PLACE6jpg201802071002·8_92aa29a44d64344104f21ef146d35d33.jpg10 - 팔선교 YMCA중화기독교 청년회관 (6)(로고추가).jpg팔선교 중화기독교 청년회관 건물 내부(2016)62018-02-07 10:122019-12-31 14:06
관리번호관리구분등록순번파일타입파일명실제파일명파일설명정렬순번등록일자수정일자
8337UZ00007PLACE2jpgUZ00007-002.jpgUZ00007-002.jpg구 뽈리따또젤 꼴호즈 내의 푸룬제 제19호 학교142013-12-27 10:592018-03-09 14:21
8338UZ00007PLACE3jpgUZ00007-003.jpgUZ00007-003.jpg구 뽈리따또젤 꼴호즈 내의 경찰서 건물(2008)32013-12-27 10:592018-03-09 14:21
8339UZ00007PLACE4jpgUZ00007-004.jpgUZ00007-004.jpg한인 혁명가들의 사진과 약력 등이 전시되어 있는 노력자 베테랑 건물(2008)42013-12-27 10:592018-03-09 14:21
8340UZ00007PLACE5jpgUZ00007-005.jpgUZ00007-005.jpg노력자 베테랑 건물 입구(2008)52013-12-27 10:592018-03-09 14:21
8341UZ00007PLACE6jpgUZ00007-006.jpgUZ00007-006.jpg구 뽈리따또젤 꼴호즈 문화궁전(2008)62013-12-27 10:592018-03-09 14:21
8342UZ00007PLACE7jpgUZ00007-007.jpgUZ00007-007.jpg구 뽈리따또젤 꼴호즈 문화궁전72013-12-27 10:592018-03-09 14:21
8343UZ00007PLACE8jpgUZ00007-008.jpgUZ00007-008.jpg구 뽈리따또젤 꼴호즈 전경82013-12-27 10:592018-03-09 14:21
8344UZ00007PLACE9jpgUZ00007-009.jpgUZ00007-009.jpg구 뽈리따또젤 꼴호즈 전경92013-12-27 10:592018-03-09 14:21
8345UZ00007PLACE10jpgUZ00007-010.jpgUZ00007-010.jpg구 뽈리따또젤 꼴호즈 전경102013-12-27 10:592018-03-09 14:21
8346UZ00007PLACE12jpg20171019131008_225e4b89080b5c95e22d758112bec64b.jpg779_타쉬켄트시온고꼴호즈문화궁전IMG_1383(로고추가).jpg구 뽈리따또젤 꼴호즈 내의 푸룬제 제19호 학교(2008)122017-10-19 13:382018-03-09 14:21