Overview

Dataset statistics

Number of variables6
Number of observations1828
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory87.6 KiB
Average record size in memory49.1 B

Variable types

Numeric1
Categorical2
Text2
DateTime1

Dataset

Description한국수력원자력(주)의 산업재산권(특허, 실용신안, 디자인, 상표)의 연번,권리,발명명칭,국가,등록일,등록번호 현황자료입니다.
URLhttps://www.data.go.kr/data/15060699/fileData.do

Alerts

권리 is highly imbalanced (72.6%)Imbalance
국가 is highly imbalanced (68.1%)Imbalance
연번 has unique valuesUnique

Reproduction

Analysis started2023-12-12 14:23:26.836539
Analysis finished2023-12-12 14:23:27.683430
Duration0.85 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

UNIQUE 

Distinct1828
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean914.5
Minimum1
Maximum1828
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size16.2 KiB
2023-12-12T23:23:27.775295image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile92.35
Q1457.75
median914.5
Q31371.25
95-th percentile1736.65
Maximum1828
Range1827
Interquartile range (IQR)913.5

Descriptive statistics

Standard deviation527.84246
Coefficient of variation (CV)0.57719242
Kurtosis-1.2
Mean914.5
Median Absolute Deviation (MAD)457
Skewness0
Sum1671706
Variance278617.67
MonotonicityStrictly increasing
2023-12-12T23:23:27.932075image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.1%
1230 1
 
0.1%
1228 1
 
0.1%
1227 1
 
0.1%
1226 1
 
0.1%
1225 1
 
0.1%
1224 1
 
0.1%
1223 1
 
0.1%
1222 1
 
0.1%
1221 1
 
0.1%
Other values (1818) 1818
99.5%
ValueCountFrequency (%)
1 1
0.1%
2 1
0.1%
3 1
0.1%
4 1
0.1%
5 1
0.1%
6 1
0.1%
7 1
0.1%
8 1
0.1%
9 1
0.1%
10 1
0.1%
ValueCountFrequency (%)
1828 1
0.1%
1827 1
0.1%
1826 1
0.1%
1825 1
0.1%
1824 1
0.1%
1823 1
0.1%
1822 1
0.1%
1821 1
0.1%
1820 1
0.1%
1819 1
0.1%

권리
Categorical

IMBALANCE 

Distinct5
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size14.4 KiB
특허
1620 
상표
 
138
디자인
 
63
실용신안
 
6
`
 
1

Length

Max length4
Median length2
Mean length2.0404814
Min length1

Unique

Unique1 ?
Unique (%)0.1%

Sample

1st row특허
2nd row특허
3rd row특허
4th row특허
5th row특허

Common Values

ValueCountFrequency (%)
특허 1620
88.6%
상표 138
 
7.5%
디자인 63
 
3.4%
실용신안 6
 
0.3%
` 1
 
0.1%

Length

2023-12-12T23:23:28.092306image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T23:23:28.235317image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
특허 1620
88.6%
상표 138
 
7.5%
디자인 63
 
3.4%
실용신안 6
 
0.3%
1
 
0.1%
Distinct1486
Distinct (%)81.3%
Missing0
Missing (%)0.0%
Memory size14.4 KiB
2023-12-12T23:23:28.651800image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length104
Median length61
Mean length26.291028
Min length3

Characters and Unicode

Total characters48060
Distinct characters648
Distinct categories12 ?
Distinct scripts3 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1341 ?
Unique (%)73.4%

Sample

1st row양수의 위치 에너지를 이용한 양수발전 효율 측정 및 이를 이용한 가이드 베인 개도량 제어 방법
2nd row태양광패널의 설치 위치 및 설치 각도 추천 방법
3rd row판쉘형 열 교환기를 포함하는 일체형 원자로
4th row초음파 검사 탐촉자
5th row해체원전 격납건물에 저장된 사용후핵연료 냉각방법
ValueCountFrequency (%)
730
 
6.2%
방법 477
 
4.1%
시스템 288
 
2.5%
이용한 254
 
2.2%
장치 242
 
2.1%
145
 
1.2%
이를 140
 
1.2%
원자력 118
 
1.0%
위한 117
 
1.0%
원자로 104
 
0.9%
Other values (3582) 9106
77.7%
2023-12-12T23:23:29.592722image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
9893
 
20.6%
1124
 
2.3%
871
 
1.8%
835
 
1.7%
796
 
1.7%
783
 
1.6%
748
 
1.6%
732
 
1.5%
717
 
1.5%
707
 
1.5%
Other values (638) 30854
64.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 35788
74.5%
Space Separator 9893
 
20.6%
Uppercase Letter 945
 
2.0%
Decimal Number 534
 
1.1%
Lowercase Letter 241
 
0.5%
Close Punctuation 236
 
0.5%
Open Punctuation 236
 
0.5%
Other Punctuation 71
 
0.1%
Dash Punctuation 60
 
0.1%
Math Symbol 54
 
0.1%
Other values (2) 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1124
 
3.1%
871
 
2.4%
835
 
2.3%
796
 
2.2%
783
 
2.2%
748
 
2.1%
732
 
2.0%
717
 
2.0%
707
 
2.0%
696
 
1.9%
Other values (556) 27779
77.6%
Uppercase Letter
ValueCountFrequency (%)
P 148
15.7%
A 146
15.4%
R 139
14.7%
S 70
 
7.4%
M 60
 
6.3%
E 51
 
5.4%
C 48
 
5.1%
I 36
 
3.8%
N 33
 
3.5%
O 30
 
3.2%
Other values (22) 184
19.5%
Lowercase Letter
ValueCountFrequency (%)
e 33
13.7%
o 26
 
10.8%
r 19
 
7.9%
a 17
 
7.1%
y 15
 
6.2%
i 14
 
5.8%
u 14
 
5.8%
t 14
 
5.8%
s 11
 
4.6%
n 10
 
4.1%
Other values (14) 68
28.2%
Decimal Number
ValueCountFrequency (%)
0 117
21.9%
1 113
21.2%
4 101
18.9%
2 97
18.2%
3 62
11.6%
5 16
 
3.0%
9 13
 
2.4%
7 7
 
1.3%
8 5
 
0.9%
6 2
 
0.4%
Other Punctuation
ValueCountFrequency (%)
, 34
47.9%
/ 24
33.8%
. 6
 
8.5%
: 4
 
5.6%
· 2
 
2.8%
& 1
 
1.4%
Close Punctuation
ValueCountFrequency (%)
) 125
53.0%
] 111
47.0%
Open Punctuation
ValueCountFrequency (%)
( 125
53.0%
[ 111
47.0%
Space Separator
ValueCountFrequency (%)
9893
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 60
100.0%
Math Symbol
ValueCountFrequency (%)
+ 54
100.0%
Format
ValueCountFrequency (%)
­ 1
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 35788
74.5%
Common 11086
 
23.1%
Latin 1186
 
2.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1124
 
3.1%
871
 
2.4%
835
 
2.3%
796
 
2.2%
783
 
2.2%
748
 
2.1%
732
 
2.0%
717
 
2.0%
707
 
2.0%
696
 
1.9%
Other values (556) 27779
77.6%
Latin
ValueCountFrequency (%)
P 148
 
12.5%
A 146
 
12.3%
R 139
 
11.7%
S 70
 
5.9%
M 60
 
5.1%
E 51
 
4.3%
C 48
 
4.0%
I 36
 
3.0%
e 33
 
2.8%
N 33
 
2.8%
Other values (46) 422
35.6%
Common
ValueCountFrequency (%)
9893
89.2%
) 125
 
1.1%
( 125
 
1.1%
0 117
 
1.1%
1 113
 
1.0%
] 111
 
1.0%
[ 111
 
1.0%
4 101
 
0.9%
2 97
 
0.9%
3 62
 
0.6%
Other values (16) 231
 
2.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 35786
74.5%
ASCII 12257
 
25.5%
None 15
 
< 0.1%
Compat Jamo 2
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
9893
80.7%
P 148
 
1.2%
A 146
 
1.2%
R 139
 
1.1%
) 125
 
1.0%
( 125
 
1.0%
0 117
 
1.0%
1 113
 
0.9%
] 111
 
0.9%
[ 111
 
0.9%
Other values (60) 1229
 
10.0%
Hangul
ValueCountFrequency (%)
1124
 
3.1%
871
 
2.4%
835
 
2.3%
796
 
2.2%
783
 
2.2%
748
 
2.1%
732
 
2.0%
717
 
2.0%
707
 
2.0%
696
 
1.9%
Other values (555) 27777
77.6%
Compat Jamo
ValueCountFrequency (%)
2
100.0%
None
ValueCountFrequency (%)
· 2
13.3%
2
13.3%
2
13.3%
1
6.7%
1
6.7%
­ 1
6.7%
1
6.7%
1
6.7%
1
6.7%
1
6.7%
Other values (2) 2
13.3%

국가
Categorical

IMBALANCE 

Distinct25
Distinct (%)1.4%
Missing0
Missing (%)0.0%
Memory size14.4 KiB
국내
1428 
미국
 
119
일본
 
70
중국
 
49
유럽
 
37
Other values (20)
 
125

Length

Max length8
Median length2
Mean length2.0552516
Min length2

Unique

Unique5 ?
Unique (%)0.3%

Sample

1st row국내
2nd row국내
3rd row국내
4th row국내
5th row국내

Common Values

ValueCountFrequency (%)
국내 1428
78.1%
미국 119
 
6.5%
일본 70
 
3.8%
중국 49
 
2.7%
유럽 37
 
2.0%
프랑스 28
 
1.5%
영국 23
 
1.3%
CZ 11
 
0.6%
FI 11
 
0.6%
CH 7
 
0.4%
Other values (15) 45
 
2.5%

Length

2023-12-12T23:23:29.755665image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
국내 1428
78.1%
미국 119
 
6.5%
일본 70
 
3.8%
중국 49
 
2.7%
유럽 37
 
2.0%
프랑스 28
 
1.5%
영국 23
 
1.3%
cz 11
 
0.6%
fi 11
 
0.6%
ch 7
 
0.4%
Other values (15) 45
 
2.5%
Distinct1137
Distinct (%)62.2%
Missing0
Missing (%)0.0%
Memory size14.4 KiB
Minimum1987-01-19 00:00:00
Maximum2023-06-23 00:00:00
2023-12-12T23:23:29.883898image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T23:23:30.021132image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
Distinct1824
Distinct (%)99.8%
Missing0
Missing (%)0.0%
Memory size14.4 KiB
2023-12-12T23:23:30.280822image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length19
Median length10
Mean length9.8391685
Min length4

Characters and Unicode

Total characters17986
Distinct characters45
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1820 ?
Unique (%)99.6%

Sample

1st row10-2548669
2nd row10-2548454
3rd row10-2547984
4th row10-2547983
5th row10-2547982
ValueCountFrequency (%)
영국 20
 
1.0%
프랑스 16
 
0.8%
cz 12
 
0.6%
2167750 11
 
0.6%
fi 9
 
0.5%
2 6
 
0.3%
2669896 6
 
0.3%
3291242 6
 
0.3%
ch 6
 
0.3%
3054304 6
 
0.3%
Other values (1766) 1824
94.9%
2023-12-12T23:23:30.686178image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1 3293
18.3%
0 2939
16.3%
2 1653
9.2%
- 1428
7.9%
4 1234
 
6.9%
5 1233
 
6.9%
3 1197
 
6.7%
8 1151
 
6.4%
6 1128
 
6.3%
7 1118
 
6.2%
Other values (35) 1612
9.0%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 15960
88.7%
Dash Punctuation 1428
 
7.9%
Other Punctuation 232
 
1.3%
Uppercase Letter 172
 
1.0%
Other Letter 97
 
0.5%
Space Separator 94
 
0.5%
Format 2
 
< 0.1%
Math Symbol 1
 
< 0.1%

Most frequent character per category

Uppercase Letter
ValueCountFrequency (%)
Z 57
33.1%
L 49
28.5%
C 18
 
10.5%
F 9
 
5.2%
I 9
 
5.2%
H 7
 
4.1%
P 5
 
2.9%
D 4
 
2.3%
E 4
 
2.3%
X 3
 
1.7%
Other values (6) 7
 
4.1%
Other Letter
ValueCountFrequency (%)
21
21.6%
21
21.6%
16
16.5%
16
16.5%
16
16.5%
1
 
1.0%
1
 
1.0%
1
 
1.0%
1
 
1.0%
1
 
1.0%
Other values (2) 2
 
2.1%
Decimal Number
ValueCountFrequency (%)
1 3293
20.6%
0 2939
18.4%
2 1653
10.4%
4 1234
 
7.7%
5 1233
 
7.7%
3 1197
 
7.5%
8 1151
 
7.2%
6 1128
 
7.1%
7 1118
 
7.0%
9 1014
 
6.4%
Other Punctuation
ValueCountFrequency (%)
, 183
78.9%
. 46
 
19.8%
/ 3
 
1.3%
Dash Punctuation
ValueCountFrequency (%)
- 1428
100.0%
Space Separator
ValueCountFrequency (%)
94
100.0%
Format
ValueCountFrequency (%)
­ 2
100.0%
Math Symbol
ValueCountFrequency (%)
+ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 17717
98.5%
Latin 172
 
1.0%
Hangul 97
 
0.5%

Most frequent character per script

Common
ValueCountFrequency (%)
1 3293
18.6%
0 2939
16.6%
2 1653
9.3%
- 1428
8.1%
4 1234
 
7.0%
5 1233
 
7.0%
3 1197
 
6.8%
8 1151
 
6.5%
6 1128
 
6.4%
7 1118
 
6.3%
Other values (7) 1343
7.6%
Latin
ValueCountFrequency (%)
Z 57
33.1%
L 49
28.5%
C 18
 
10.5%
F 9
 
5.2%
I 9
 
5.2%
H 7
 
4.1%
P 5
 
2.9%
D 4
 
2.3%
E 4
 
2.3%
X 3
 
1.7%
Other values (6) 7
 
4.1%
Hangul
ValueCountFrequency (%)
21
21.6%
21
21.6%
16
16.5%
16
16.5%
16
16.5%
1
 
1.0%
1
 
1.0%
1
 
1.0%
1
 
1.0%
1
 
1.0%
Other values (2) 2
 
2.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 17887
99.4%
Hangul 97
 
0.5%
None 2
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1 3293
18.4%
0 2939
16.4%
2 1653
9.2%
- 1428
8.0%
4 1234
 
6.9%
5 1233
 
6.9%
3 1197
 
6.7%
8 1151
 
6.4%
6 1128
 
6.3%
7 1118
 
6.3%
Other values (22) 1513
8.5%
Hangul
ValueCountFrequency (%)
21
21.6%
21
21.6%
16
16.5%
16
16.5%
16
16.5%
1
 
1.0%
1
 
1.0%
1
 
1.0%
1
 
1.0%
1
 
1.0%
Other values (2) 2
 
2.1%
None
ValueCountFrequency (%)
­ 2
100.0%

Interactions

2023-12-12T23:23:27.384286image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T23:23:30.780531image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번권리국가
연번1.0000.3130.258
권리0.3131.0000.501
국가0.2580.5011.000
2023-12-12T23:23:30.862017image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
국가권리
국가1.0000.268
권리0.2681.000
2023-12-12T23:23:30.954571image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번권리국가
연번1.0000.1350.097
권리0.1351.0000.268
국가0.0970.2681.000

Missing values

2023-12-12T23:23:27.519201image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T23:23:27.638827image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번권리발명명칭국가등록일등록번호
01특허양수의 위치 에너지를 이용한 양수발전 효율 측정 및 이를 이용한 가이드 베인 개도량 제어 방법국내2023-06-2310-2548669
12특허태양광패널의 설치 위치 및 설치 각도 추천 방법국내2023-06-2210-2548454
23특허판쉘형 열 교환기를 포함하는 일체형 원자로국내2023-06-2110-2547984
34특허초음파 검사 탐촉자국내2023-06-2110-2547983
45특허해체원전 격납건물에 저장된 사용후핵연료 냉각방법국내2023-06-2110-2547982
56특허원자력 발전소의 수소 생산 시스템국내2023-06-2010-2547415
67특허작동 한계가 향상된 진동형 히트 파이프국내2023-06-2010-2547414
78특허1차계통 수화학 모사장치국내2023-06-2010-2547243
89특허핵연료 집합체의 경사면 활용 록킹 구조를 갖는 헬멧형 안내관 삽입체국내2023-06-1910-2546534
910특허취수구의 이물질 제거 시스템국내2023-06-1610-2545925
연번권리발명명칭국가등록일등록번호
18181819상표PIAT국내2004-12-0140-0601361
18191820상표국내EM국내2004-08-3041-0105074
18201821상표KALANS국내2004-07-2940-0588925
18211822특허우라늄 박판의 제조방법 및 장치와 이에 제조된 우라늄 박판남아프리카공화국2004-07-282003/09615
18221823상표신형경수로1400국내2003-03-2045-0007165
18231824상표APR1400(APR1400기술개발)국내2003-03-2045-0007166
18241825상표한국수력원자력주식회사국내2002-10-2245-0006482
18251826상표Korea Hydro & Nucle아르헨티나 Power Co., Ltd국내2002-10-2245-0006483
18261827상표KHNP국내2002-10-2245-0006484
18271828상표도안(한전 등 전력그룹사 마크-흑백)국내1987-01-1941-0006883