Overview

Dataset statistics

Number of variables4
Number of observations1032
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory33.4 KiB
Average record size in memory33.1 B

Variable types

Numeric1
Text2
Categorical1

Dataset

Description현충시설(국외사적지)와 관련하여 현충시설정보(비석, 탑, 동상, 기념장소 등) 를 목록으로 제공 * 제공항목 : 시설의 명칭, 설치 지역 등의 정보를 제공
Author국가보훈부
URLhttps://www.data.go.kr/data/15054476/fileData.do

Alerts

순번 is highly overall correlated with 국가High correlation
국가 is highly overall correlated with 순번High correlation
순번 has unique valuesUnique

Reproduction

Analysis started2023-12-12 12:32:07.796783
Analysis finished2023-12-12 12:32:08.688034
Duration0.89 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

순번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct1032
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean516.5
Minimum1
Maximum1032
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size9.2 KiB
2023-12-12T21:32:08.801034image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile52.55
Q1258.75
median516.5
Q3774.25
95-th percentile980.45
Maximum1032
Range1031
Interquartile range (IQR)515.5

Descriptive statistics

Standard deviation298.05704
Coefficient of variation (CV)0.57707075
Kurtosis-1.2
Mean516.5
Median Absolute Deviation (MAD)258
Skewness0
Sum533028
Variance88838
MonotonicityStrictly increasing
2023-12-12T21:32:09.002009image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.1%
711 1
 
0.1%
681 1
 
0.1%
682 1
 
0.1%
683 1
 
0.1%
684 1
 
0.1%
685 1
 
0.1%
686 1
 
0.1%
687 1
 
0.1%
688 1
 
0.1%
Other values (1022) 1022
99.0%
ValueCountFrequency (%)
1 1
0.1%
2 1
0.1%
3 1
0.1%
4 1
0.1%
5 1
0.1%
6 1
0.1%
7 1
0.1%
8 1
0.1%
9 1
0.1%
10 1
0.1%
ValueCountFrequency (%)
1032 1
0.1%
1031 1
0.1%
1030 1
0.1%
1029 1
0.1%
1028 1
0.1%
1027 1
0.1%
1026 1
0.1%
1025 1
0.1%
1024 1
0.1%
1023 1
0.1%
Distinct968
Distinct (%)93.8%
Missing0
Missing (%)0.0%
Memory size8.2 KiB
2023-12-12T21:32:09.331883image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length28
Median length24
Mean length13.535853
Min length3

Characters and Unicode

Total characters13969
Distinct characters504
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks5 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique921 ?
Unique (%)89.2%

Sample

1st row광저우 황포군관학교
2nd row광저우 중산대학
3rd row황포군관학교 입학예비생 교육 장소
4th row광저우 신규식 외교활동지
5th row광저우 신규식 외교활동지
ValueCountFrequency (%)
331
 
9.9%
거주지 87
 
2.6%
활동지 56
 
1.7%
본부 52
 
1.6%
38
 
1.1%
대한민국임시정부 36
 
1.1%
개최지 30
 
0.9%
상하이 29
 
0.9%
한국광복군 29
 
0.9%
호놀룰루 28
 
0.8%
Other values (1193) 2616
78.5%
2023-12-12T21:32:10.233638image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2692
 
19.3%
490
 
3.5%
334
 
2.4%
304
 
2.2%
230
 
1.6%
212
 
1.5%
202
 
1.4%
197
 
1.4%
192
 
1.4%
176
 
1.3%
Other values (494) 8940
64.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 11058
79.2%
Space Separator 2692
 
19.3%
Decimal Number 97
 
0.7%
Other Punctuation 63
 
0.5%
Uppercase Letter 43
 
0.3%
Close Punctuation 7
 
0.1%
Open Punctuation 7
 
0.1%
Initial Punctuation 1
 
< 0.1%
Final Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
490
 
4.4%
334
 
3.0%
304
 
2.7%
230
 
2.1%
212
 
1.9%
202
 
1.8%
197
 
1.8%
192
 
1.7%
176
 
1.6%
162
 
1.5%
Other values (468) 8559
77.4%
Decimal Number
ValueCountFrequency (%)
3 29
29.9%
1 28
28.9%
2 19
19.6%
0 5
 
5.2%
8 5
 
5.2%
4 4
 
4.1%
5 4
 
4.1%
7 3
 
3.1%
Uppercase Letter
ValueCountFrequency (%)
C 15
34.9%
D 13
30.2%
S 6
 
14.0%
O 3
 
7.0%
A 2
 
4.7%
M 2
 
4.7%
Y 2
 
4.7%
Other Punctuation
ValueCountFrequency (%)
. 42
66.7%
' 10
 
15.9%
· 10
 
15.9%
& 1
 
1.6%
Close Punctuation
ValueCountFrequency (%)
) 6
85.7%
1
 
14.3%
Open Punctuation
ValueCountFrequency (%)
( 6
85.7%
1
 
14.3%
Space Separator
ValueCountFrequency (%)
2692
100.0%
Initial Punctuation
ValueCountFrequency (%)
1
100.0%
Final Punctuation
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 11058
79.2%
Common 2868
 
20.5%
Latin 43
 
0.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
490
 
4.4%
334
 
3.0%
304
 
2.7%
230
 
2.1%
212
 
1.9%
202
 
1.8%
197
 
1.8%
192
 
1.7%
176
 
1.6%
162
 
1.5%
Other values (468) 8559
77.4%
Common
ValueCountFrequency (%)
2692
93.9%
. 42
 
1.5%
3 29
 
1.0%
1 28
 
1.0%
2 19
 
0.7%
' 10
 
0.3%
· 10
 
0.3%
) 6
 
0.2%
( 6
 
0.2%
0 5
 
0.2%
Other values (9) 21
 
0.7%
Latin
ValueCountFrequency (%)
C 15
34.9%
D 13
30.2%
S 6
 
14.0%
O 3
 
7.0%
A 2
 
4.7%
M 2
 
4.7%
Y 2
 
4.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 11055
79.1%
ASCII 2897
 
20.7%
None 12
 
0.1%
Compat Jamo 3
 
< 0.1%
Punctuation 2
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2692
92.9%
. 42
 
1.4%
3 29
 
1.0%
1 28
 
1.0%
2 19
 
0.7%
C 15
 
0.5%
D 13
 
0.4%
' 10
 
0.3%
S 6
 
0.2%
) 6
 
0.2%
Other values (11) 37
 
1.3%
Hangul
ValueCountFrequency (%)
490
 
4.4%
334
 
3.0%
304
 
2.7%
230
 
2.1%
212
 
1.9%
202
 
1.8%
197
 
1.8%
192
 
1.7%
176
 
1.6%
162
 
1.5%
Other values (467) 8556
77.4%
None
ValueCountFrequency (%)
· 10
83.3%
1
 
8.3%
1
 
8.3%
Compat Jamo
ValueCountFrequency (%)
3
100.0%
Punctuation
ValueCountFrequency (%)
1
50.0%
1
50.0%
Distinct388
Distinct (%)37.6%
Missing0
Missing (%)0.0%
Memory size8.2 KiB
2023-12-12T21:32:10.738567image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length19
Median length6
Mean length5.7199612
Min length2

Characters and Unicode

Total characters5903
Distinct characters392
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique366 ?
Unique (%)35.5%

Sample

1st row(내용없음)
2nd row신항서로
3rd row(내용없음)
4th row동교장
5th row손문 관저 터
ValueCountFrequency (%)
내용없음 619
51.5%
18
 
1.5%
스트리트 16
 
1.3%
애비뉴 9
 
0.7%
호텔 6
 
0.5%
공동묘지 5
 
0.4%
깔레 4
 
0.3%
삼원포 4
 
0.3%
기념관 4
 
0.3%
거리 4
 
0.3%
Other values (464) 512
42.6%
2023-12-12T21:32:11.470172image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
624
 
10.6%
622
 
10.5%
620
 
10.5%
( 619
 
10.5%
619
 
10.5%
) 619
 
10.5%
169
 
2.9%
54
 
0.9%
49
 
0.8%
43
 
0.7%
Other values (382) 1865
31.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 4429
75.0%
Open Punctuation 619
 
10.5%
Close Punctuation 619
 
10.5%
Space Separator 169
 
2.9%
Decimal Number 50
 
0.8%
Uppercase Letter 12
 
0.2%
Dash Punctuation 2
 
< 0.1%
Other Punctuation 2
 
< 0.1%
Math Symbol 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
624
 
14.1%
622
 
14.0%
620
 
14.0%
619
 
14.0%
54
 
1.2%
49
 
1.1%
43
 
1.0%
40
 
0.9%
39
 
0.9%
38
 
0.9%
Other values (358) 1681
38.0%
Decimal Number
ValueCountFrequency (%)
1 13
26.0%
9 5
 
10.0%
4 5
 
10.0%
6 5
 
10.0%
3 5
 
10.0%
5 4
 
8.0%
0 4
 
8.0%
2 4
 
8.0%
7 3
 
6.0%
8 2
 
4.0%
Uppercase Letter
ValueCountFrequency (%)
S 4
33.3%
A 2
16.7%
O 2
16.7%
C 1
 
8.3%
M 1
 
8.3%
Y 1
 
8.3%
H 1
 
8.3%
Other Punctuation
ValueCountFrequency (%)
· 1
50.0%
. 1
50.0%
Open Punctuation
ValueCountFrequency (%)
( 619
100.0%
Close Punctuation
ValueCountFrequency (%)
) 619
100.0%
Space Separator
ValueCountFrequency (%)
169
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 2
100.0%
Math Symbol
ValueCountFrequency (%)
> 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 4429
75.0%
Common 1462
 
24.8%
Latin 12
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
624
 
14.1%
622
 
14.0%
620
 
14.0%
619
 
14.0%
54
 
1.2%
49
 
1.1%
43
 
1.0%
40
 
0.9%
39
 
0.9%
38
 
0.9%
Other values (358) 1681
38.0%
Common
ValueCountFrequency (%)
( 619
42.3%
) 619
42.3%
169
 
11.6%
1 13
 
0.9%
9 5
 
0.3%
4 5
 
0.3%
6 5
 
0.3%
3 5
 
0.3%
5 4
 
0.3%
0 4
 
0.3%
Other values (7) 14
 
1.0%
Latin
ValueCountFrequency (%)
S 4
33.3%
A 2
16.7%
O 2
16.7%
C 1
 
8.3%
M 1
 
8.3%
Y 1
 
8.3%
H 1
 
8.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 4428
75.0%
ASCII 1473
 
25.0%
Compat Jamo 1
 
< 0.1%
None 1
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
624
 
14.1%
622
 
14.0%
620
 
14.0%
619
 
14.0%
54
 
1.2%
49
 
1.1%
43
 
1.0%
40
 
0.9%
39
 
0.9%
38
 
0.9%
Other values (357) 1680
37.9%
ASCII
ValueCountFrequency (%)
( 619
42.0%
) 619
42.0%
169
 
11.5%
1 13
 
0.9%
9 5
 
0.3%
4 5
 
0.3%
6 5
 
0.3%
3 5
 
0.3%
5 4
 
0.3%
0 4
 
0.3%
Other values (13) 25
 
1.7%
Compat Jamo
ValueCountFrequency (%)
1
100.0%
None
ValueCountFrequency (%)
· 1
100.0%

국가
Categorical

HIGH CORRELATION 

Distinct26
Distinct (%)2.5%
Missing0
Missing (%)0.0%
Memory size8.2 KiB
중국
482 
미국
159 
러시아
122 
일본
69 
멕시코
53 
Other values (21)
147 

Length

Max length6
Median length2
Mean length2.3895349
Min length2

Unique

Unique4 ?
Unique (%)0.4%

Sample

1st row중국
2nd row중국
3rd row중국
4th row중국
5th row중국

Common Values

ValueCountFrequency (%)
중국 482
46.7%
미국 159
 
15.4%
러시아 122
 
11.8%
일본 69
 
6.7%
멕시코 53
 
5.1%
카자흐스탄 25
 
2.4%
인도네시아 16
 
1.6%
인도 13
 
1.3%
쿠바 13
 
1.3%
프랑스 12
 
1.2%
Other values (16) 68
 
6.6%

Length

2023-12-12T21:32:11.716609image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
중국 483
46.8%
미국 159
 
15.4%
러시아 123
 
11.9%
일본 69
 
6.7%
멕시코 53
 
5.1%
카자흐스탄 25
 
2.4%
인도네시아 16
 
1.6%
인도 13
 
1.3%
쿠바 13
 
1.3%
프랑스 12
 
1.2%
Other values (14) 66
 
6.4%

Interactions

2023-12-12T21:32:08.333375image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T21:32:11.862498image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번국가
순번1.0000.905
국가0.9051.000
2023-12-12T21:32:11.955549image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번국가
순번1.0000.617
국가0.6171.000

Missing values

2023-12-12T21:32:08.516490image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T21:32:08.633792image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

순번사적지명사적지 기타사항국가
01광저우 황포군관학교(내용없음)중국
12광저우 중산대학신항서로중국
23황포군관학교 입학예비생 교육 장소(내용없음)중국
34광저우 신규식 외교활동지동교장중국
45광저우 신규식 외교활동지손문 관저 터중국
56대한민국임시정부 광저우 청사동산백원중국
67한국독립당 광동지부 터(내용없음)중국
78광저우 한인 비행사 교육지광동항공군사학교 터중국
89광저우 한교협회 터(내용없음)중국
910광저우 중산대학문명서로중국
순번사적지명사적지 기타사항국가
10221023파리 대한제국 공사관뒤몽 뒤르빌프랑스
10231024파리 대한제국 공사관아일라우프랑스
10241025파리 한국친우회 창립지사회 박물관프랑스
10251026파리 펠리시앙 샬레 거주지(내용없음)프랑스
10261027파리 고려통신사 사무실서영해 거주지프랑스
10271028파리 한국친우회 사무국사동발 거주지프랑스
10281029파리 조소앙 외교활동지엘리사-룩상부르 호텔프랑스
10291030파리 김법린 거주지에두아르 리스트 병원프랑스
10301031대한민국 임시정부 파리위원부 사무소(내용없음)프랑스
10311032파리 김규식 거주지(내용없음)프랑스