Overview

Dataset statistics

Number of variables3
Number of observations666
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory16.4 KiB
Average record size in memory25.2 B

Variable types

Numeric1
Text1
Categorical1

Dataset

Description전북특별자치도 대아수목원 산림문화전시관 소장품 목록(작품명, 전시장소 등)우리기관에서는 더 이상 생성 불가 데이터입니다.
Author전북특별자치도
URLhttps://www.data.go.kr/data/15055680/fileData.do

Alerts

연번 is highly overall correlated with 전 시 장 소High correlation
전 시 장 소 is highly overall correlated with 연번High correlation
연번 has unique valuesUnique

Reproduction

Analysis started2024-03-14 15:39:50.801489
Analysis finished2024-03-14 15:39:51.849005
Duration1.05 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct666
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean333.5
Minimum1
Maximum666
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size6.0 KiB
2024-03-15T00:39:52.067200image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile34.25
Q1167.25
median333.5
Q3499.75
95-th percentile632.75
Maximum666
Range665
Interquartile range (IQR)332.5

Descriptive statistics

Standard deviation192.40192
Coefficient of variation (CV)0.57691731
Kurtosis-1.2
Mean333.5
Median Absolute Deviation (MAD)166.5
Skewness0
Sum222111
Variance37018.5
MonotonicityStrictly increasing
2024-03-15T00:39:52.520185image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.2%
439 1
 
0.2%
441 1
 
0.2%
442 1
 
0.2%
443 1
 
0.2%
444 1
 
0.2%
445 1
 
0.2%
446 1
 
0.2%
447 1
 
0.2%
448 1
 
0.2%
Other values (656) 656
98.5%
ValueCountFrequency (%)
1 1
0.2%
2 1
0.2%
3 1
0.2%
4 1
0.2%
5 1
0.2%
6 1
0.2%
7 1
0.2%
8 1
0.2%
9 1
0.2%
10 1
0.2%
ValueCountFrequency (%)
666 1
0.2%
665 1
0.2%
664 1
0.2%
663 1
0.2%
662 1
0.2%
661 1
0.2%
660 1
0.2%
659 1
0.2%
658 1
0.2%
657 1
0.2%
Distinct656
Distinct (%)98.5%
Missing0
Missing (%)0.0%
Memory size5.3 KiB
2024-03-15T00:39:53.804391image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length23
Median length16
Mean length7.5795796
Min length1

Characters and Unicode

Total characters5048
Distinct characters450
Distinct categories7 ?
Distinct scripts4 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique647 ?
Unique (%)97.1%

Sample

1st row느티나무 뿌리공예품
2nd row라디에타소나무(목재,나이테)
3rd row왕벚나무(열매)
4th row해 당 화(뿌리)
5th row패랭이꽃(전초)
ValueCountFrequency (%)
9
 
1.0%
송이버섯 9
 
1.0%
8
 
0.9%
6
 
0.7%
6
 
0.7%
성장과장 6
 
0.7%
5
 
0.6%
5
 
0.6%
4
 
0.5%
4
 
0.5%
Other values (749) 809
92.9%
2024-03-15T00:39:55.472908image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
( 382
 
7.6%
) 382
 
7.6%
308
 
6.1%
201
 
4.0%
174
 
3.4%
139
 
2.8%
134
 
2.7%
126
 
2.5%
112
 
2.2%
105
 
2.1%
Other values (440) 2985
59.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 3925
77.8%
Open Punctuation 382
 
7.6%
Close Punctuation 382
 
7.6%
Space Separator 308
 
6.1%
Other Punctuation 32
 
0.6%
Decimal Number 16
 
0.3%
Uppercase Letter 3
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
201
 
5.1%
174
 
4.4%
139
 
3.5%
134
 
3.4%
126
 
3.2%
112
 
2.9%
105
 
2.7%
91
 
2.3%
70
 
1.8%
69
 
1.8%
Other values (424) 2704
68.9%
Decimal Number
ValueCountFrequency (%)
1 3
18.8%
4 3
18.8%
2 2
12.5%
3 2
12.5%
9 2
12.5%
5 1
 
6.2%
6 1
 
6.2%
7 1
 
6.2%
8 1
 
6.2%
Uppercase Letter
ValueCountFrequency (%)
S 1
33.3%
E 1
33.3%
T 1
33.3%
Open Punctuation
ValueCountFrequency (%)
( 382
100.0%
Close Punctuation
ValueCountFrequency (%)
) 382
100.0%
Space Separator
ValueCountFrequency (%)
308
100.0%
Other Punctuation
ValueCountFrequency (%)
, 32
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 3924
77.7%
Common 1120
 
22.2%
Latin 3
 
0.1%
Han 1
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
201
 
5.1%
174
 
4.4%
139
 
3.5%
134
 
3.4%
126
 
3.2%
112
 
2.9%
105
 
2.7%
91
 
2.3%
70
 
1.8%
69
 
1.8%
Other values (423) 2703
68.9%
Common
ValueCountFrequency (%)
( 382
34.1%
) 382
34.1%
308
27.5%
, 32
 
2.9%
1 3
 
0.3%
4 3
 
0.3%
2 2
 
0.2%
3 2
 
0.2%
9 2
 
0.2%
5 1
 
0.1%
Other values (3) 3
 
0.3%
Latin
ValueCountFrequency (%)
S 1
33.3%
E 1
33.3%
T 1
33.3%
Han
ValueCountFrequency (%)
1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 3924
77.7%
ASCII 1123
 
22.2%
CJK 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
( 382
34.0%
) 382
34.0%
308
27.4%
, 32
 
2.8%
1 3
 
0.3%
4 3
 
0.3%
2 2
 
0.2%
3 2
 
0.2%
9 2
 
0.2%
5 1
 
0.1%
Other values (6) 6
 
0.5%
Hangul
ValueCountFrequency (%)
201
 
5.1%
174
 
4.4%
139
 
3.5%
134
 
3.4%
126
 
3.2%
112
 
2.9%
105
 
2.7%
91
 
2.3%
70
 
1.8%
69
 
1.8%
Other values (423) 2703
68.9%
CJK
ValueCountFrequency (%)
1
100.0%

전 시 장 소
Categorical

HIGH CORRELATION 

Distinct14
Distinct (%)2.1%
Missing0
Missing (%)0.0%
Memory size5.3 KiB
버섯 전시관(2층)
144 
약용식물자원실(1층)
114 
속의 곤충들(2층)
94 
전시관 로비 안쪽(2층)
84 
우리나라의 산림(2층)
60 
Other values (9)
170 

Length

Max length15
Median length14
Mean length11.382883
Min length6

Unique

Unique2 ?
Unique (%)0.3%

Sample

1st row로비(1층)
2nd row로비(1층)
3rd row로비(1층)
4th row로비(1층)
5th row로비(1층)

Common Values

ValueCountFrequency (%)
버섯 전시관(2층) 144
21.6%
약용식물자원실(1층) 114
17.1%
속의 곤충들(2층) 94
14.1%
전시관 로비 안쪽(2층) 84
12.6%
우리나라의 산림(2층) 60
9.0%
산림의 생성과 진화(2층) 35
 
5.3%
병해충의 패해와 구제(2층) 31
 
4.7%
산림의 보존(2층) 29
 
4.4%
숲속 짐승과 새들(2층) 25
 
3.8%
임산물의 생산과 이용(2층) 25
 
3.8%
Other values (4) 25
 
3.8%

Length

2024-03-15T00:39:55.832707image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
버섯 144
 
10.3%
전시관(2층 144
 
10.3%
약용식물자원실(1층 114
 
8.1%
속의 94
 
6.7%
곤충들(2층 94
 
6.7%
전시관 84
 
6.0%
로비 84
 
6.0%
안쪽(2층 84
 
6.0%
산림의 64
 
4.6%
우리나라의 60
 
4.3%
Other values (20) 435
31.0%

Interactions

2024-03-15T00:39:51.157805image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-03-15T00:39:55.968237image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번전 시 장 소
연번1.0000.932
전 시 장 소0.9321.000
2024-03-15T00:39:56.110803image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번전 시 장 소
연번1.0000.738
전 시 장 소0.7381.000

Missing values

2024-03-15T00:39:51.494328image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-15T00:39:51.749386image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번작 품 명전 시 장 소
01느티나무 뿌리공예품로비(1층)
12라디에타소나무(목재,나이테)로비(1층)
23왕벚나무(열매)로비(1층)
34해 당 화(뿌리)로비(1층)
45패랭이꽃(전초)로비(1층)
56자귀나무(수피)로비(1층)
67익 모 초(잎)로비(1층)
78할 미 꽃(전초)로비(1층)
89규화목(고생대)로비(1층)
910규화목(중생대)로비(1층)
연번작 품 명전 시 장 소
656657노각나무(목재)전시관 로비 안쪽(2층)
657658자귀나무(목재)전시관 로비 안쪽(2층)
658659물박달나무(목재)전시관 로비 안쪽(2층)
659660소태나무(목재)전시관 로비 안쪽(2층)
660661감나무(목재)전시관 로비 안쪽(2층)
661662고욤나무(목재)전시관 로비 안쪽(2층)
662663때죽나무(목재)전시관 로비 안쪽(2층)
663664팽나무(목재)전시관 로비 안쪽(2층)
664665다릅나무(목재)전시관 로비 안쪽(2층)
665666산벚나무(목재)전시관 로비 안쪽(2층)