Overview

Dataset statistics

Number of variables8
Number of observations338
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory21.6 KiB
Average record size in memory65.4 B

Variable types

Numeric1
Text3
DateTime3
Categorical1

Dataset

Description한국중부발전(주) 산업재산권 출원 정보이며, 항목은 "구분","등록번호","등록일자","출원번호","출원일자","존속일자","발명명칭","소유형태"으로 구성되어 있음
Author한국중부발전(주)
URLhttps://www.data.go.kr/data/15004241/fileData.do

Alerts

구분 has unique valuesUnique
출원번호 has unique valuesUnique

Reproduction

Analysis started2024-04-21 00:58:26.116037
Analysis finished2024-04-21 00:58:27.673059
Duration1.56 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

구분
Real number (ℝ)

UNIQUE 

Distinct338
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean169.5
Minimum1
Maximum338
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size3.1 KiB
2024-04-21T09:58:27.746498image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile17.85
Q185.25
median169.5
Q3253.75
95-th percentile321.15
Maximum338
Range337
Interquartile range (IQR)168.5

Descriptive statistics

Standard deviation97.716426
Coefficient of variation (CV)0.57649809
Kurtosis-1.2
Mean169.5
Median Absolute Deviation (MAD)84.5
Skewness0
Sum57291
Variance9548.5
MonotonicityStrictly increasing
2024-04-21T09:58:27.871961image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.3%
224 1
 
0.3%
232 1
 
0.3%
231 1
 
0.3%
230 1
 
0.3%
229 1
 
0.3%
228 1
 
0.3%
227 1
 
0.3%
226 1
 
0.3%
225 1
 
0.3%
Other values (328) 328
97.0%
ValueCountFrequency (%)
1 1
0.3%
2 1
0.3%
3 1
0.3%
4 1
0.3%
5 1
0.3%
6 1
0.3%
7 1
0.3%
8 1
0.3%
9 1
0.3%
10 1
0.3%
ValueCountFrequency (%)
338 1
0.3%
337 1
0.3%
336 1
0.3%
335 1
0.3%
334 1
0.3%
333 1
0.3%
332 1
0.3%
331 1
0.3%
330 1
0.3%
329 1
0.3%
Distinct337
Distinct (%)99.7%
Missing0
Missing (%)0.0%
Memory size2.8 KiB
2024-04-21T09:58:28.097145image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length16
Median length16
Mean length16
Min length16

Characters and Unicode

Total characters5408
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique336 ?
Unique (%)99.4%

Sample

1st row10-0455565-00-00
2nd row10-0537061-00-00
3rd row10-0537063-00-00
4th row10-0561256-00-00
5th row10-0570281-00-00
ValueCountFrequency (%)
10-1932857-00-00 2
 
0.6%
10-1895832-00-00 1
 
0.3%
10-1896630-00-00 1
 
0.3%
10-1957362-00-00 1
 
0.3%
10-1955851-00-00 1
 
0.3%
10-1942529-00-00 1
 
0.3%
10-1935101-00-00 1
 
0.3%
10-1928016-00-00 1
 
0.3%
10-1904201-00-00 1
 
0.3%
10-1901827-00-00 1
 
0.3%
Other values (327) 327
96.7%
2024-04-21T09:58:28.420658image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 1940
35.9%
- 1014
18.8%
1 760
 
14.1%
2 334
 
6.2%
5 204
 
3.8%
8 203
 
3.8%
6 198
 
3.7%
4 196
 
3.6%
7 191
 
3.5%
3 185
 
3.4%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 4394
81.2%
Dash Punctuation 1014
 
18.8%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 1940
44.2%
1 760
 
17.3%
2 334
 
7.6%
5 204
 
4.6%
8 203
 
4.6%
6 198
 
4.5%
4 196
 
4.5%
7 191
 
4.3%
3 185
 
4.2%
9 183
 
4.2%
Dash Punctuation
ValueCountFrequency (%)
- 1014
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 5408
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 1940
35.9%
- 1014
18.8%
1 760
 
14.1%
2 334
 
6.2%
5 204
 
3.8%
8 203
 
3.8%
6 198
 
3.7%
4 196
 
3.6%
7 191
 
3.5%
3 185
 
3.4%

Most occurring blocks

ValueCountFrequency (%)
ASCII 5408
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 1940
35.9%
- 1014
18.8%
1 760
 
14.1%
2 334
 
6.2%
5 204
 
3.8%
8 203
 
3.8%
6 198
 
3.7%
4 196
 
3.6%
7 191
 
3.5%
3 185
 
3.4%
Distinct304
Distinct (%)89.9%
Missing0
Missing (%)0.0%
Memory size2.8 KiB
Minimum2004-10-25 00:00:00
Maximum2023-12-11 00:00:00
2024-04-21T09:58:28.544367image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T09:58:28.662448image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

출원번호
Text

UNIQUE 

Distinct338
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size2.8 KiB
2024-04-21T09:58:28.850447image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length15
Median length15
Mean length14.997041
Min length14

Characters and Unicode

Total characters5069
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique338 ?
Unique (%)100.0%

Sample

1st row10-2004-0059886
2nd row10-2003-0097653
3rd row10-2003-0097654
4th row10-2004-0115110
5th row10-2004-0115109
ValueCountFrequency (%)
10-2004-0059886 1
 
0.3%
10-2018-0123899 1
 
0.3%
10-2012-0109119 1
 
0.3%
10-2016-0129891 1
 
0.3%
10-2017-0111357 1
 
0.3%
10-2016-0169280 1
 
0.3%
10-2011-0098436 1
 
0.3%
10-2016-0109390 1
 
0.3%
10-2017-0117548 1
 
0.3%
10-2016-0172525 1
 
0.3%
Other values (328) 328
97.0%
2024-04-21T09:58:29.159338image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 1525
30.1%
1 906
17.9%
- 676
13.3%
2 626
12.3%
9 241
 
4.8%
7 199
 
3.9%
5 192
 
3.8%
4 189
 
3.7%
8 182
 
3.6%
3 170
 
3.4%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 4393
86.7%
Dash Punctuation 676
 
13.3%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 1525
34.7%
1 906
20.6%
2 626
14.2%
9 241
 
5.5%
7 199
 
4.5%
5 192
 
4.4%
4 189
 
4.3%
8 182
 
4.1%
3 170
 
3.9%
6 163
 
3.7%
Dash Punctuation
ValueCountFrequency (%)
- 676
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 5069
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 1525
30.1%
1 906
17.9%
- 676
13.3%
2 626
12.3%
9 241
 
4.8%
7 199
 
3.9%
5 192
 
3.8%
4 189
 
3.7%
8 182
 
3.6%
3 170
 
3.4%

Most occurring blocks

ValueCountFrequency (%)
ASCII 5069
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 1525
30.1%
1 906
17.9%
- 676
13.3%
2 626
12.3%
9 241
 
4.8%
7 199
 
3.9%
5 192
 
3.8%
4 189
 
3.7%
8 182
 
3.6%
3 170
 
3.4%
Distinct246
Distinct (%)72.8%
Missing0
Missing (%)0.0%
Memory size2.8 KiB
Minimum2003-12-26 00:00:00
Maximum2023-08-18 00:00:00
2024-04-21T09:58:29.291841image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T09:58:29.422603image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
Distinct246
Distinct (%)72.8%
Missing0
Missing (%)0.0%
Memory size2.8 KiB
Minimum2023-12-26 00:00:00
Maximum2043-08-18 00:00:00
2024-04-21T09:58:29.533635image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T09:58:29.643851image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
Distinct328
Distinct (%)97.0%
Missing0
Missing (%)0.0%
Memory size2.8 KiB
2024-04-21T09:58:29.953269image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length71
Median length49
Mean length21.535503
Min length5

Characters and Unicode

Total characters7279
Distinct characters415
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique322 ?
Unique (%)95.3%

Sample

1st row전력수급시스템 및 예측방법
2nd row발전용 보일러 튜브 미끄럼 정렬금구
3rd row발전용 보일러 벅스테이 미끄럼 지지금구
4th row가스터빈 배기덕트 신축이음장치 단열매트
5th row시뮬레이터 통신회로 고장위치 탐지방법
ValueCountFrequency (%)
99
 
5.4%
장치 55
 
3.0%
시스템 51
 
2.8%
방법 43
 
2.3%
이용한 38
 
2.1%
24
 
1.3%
제조방법 23
 
1.2%
보일러 22
 
1.2%
이를 21
 
1.1%
발전소 16
 
0.9%
Other values (986) 1456
78.8%
2024-04-21T09:58:30.434240image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1511
 
20.8%
163
 
2.2%
162
 
2.2%
162
 
2.2%
155
 
2.1%
134
 
1.8%
130
 
1.8%
130
 
1.8%
130
 
1.8%
108
 
1.5%
Other values (405) 4494
61.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 5705
78.4%
Space Separator 1511
 
20.8%
Uppercase Letter 26
 
0.4%
Other Punctuation 16
 
0.2%
Lowercase Letter 10
 
0.1%
Decimal Number 6
 
0.1%
Dash Punctuation 5
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
163
 
2.9%
162
 
2.8%
162
 
2.8%
155
 
2.7%
134
 
2.3%
130
 
2.3%
130
 
2.3%
130
 
2.3%
108
 
1.9%
105
 
1.8%
Other values (368) 4326
75.8%
Uppercase Letter
ValueCountFrequency (%)
S 2
 
7.7%
R 2
 
7.7%
2
 
7.7%
V 1
 
3.8%
1
 
3.8%
G 1
 
3.8%
H 1
 
3.8%
C 1
 
3.8%
D 1
 
3.8%
1
 
3.8%
Other values (13) 13
50.0%
Lowercase Letter
ValueCountFrequency (%)
2
20.0%
2
20.0%
2
20.0%
1
10.0%
1
10.0%
1
10.0%
1
10.0%
Decimal Number
ValueCountFrequency (%)
3 3
50.0%
2 2
33.3%
1 1
 
16.7%
Other Punctuation
ValueCountFrequency (%)
, 13
81.2%
/ 3
 
18.8%
Space Separator
ValueCountFrequency (%)
1511
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 5
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 5705
78.4%
Common 1538
 
21.1%
Latin 36
 
0.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
163
 
2.9%
162
 
2.8%
162
 
2.8%
155
 
2.7%
134
 
2.3%
130
 
2.3%
130
 
2.3%
130
 
2.3%
108
 
1.9%
105
 
1.8%
Other values (368) 4326
75.8%
Latin
ValueCountFrequency (%)
2
 
5.6%
2
 
5.6%
S 2
 
5.6%
R 2
 
5.6%
2
 
5.6%
2
 
5.6%
V 1
 
2.8%
1
 
2.8%
G 1
 
2.8%
H 1
 
2.8%
Other values (20) 20
55.6%
Common
ValueCountFrequency (%)
1511
98.2%
, 13
 
0.8%
- 5
 
0.3%
3 3
 
0.2%
/ 3
 
0.2%
2 2
 
0.1%
1 1
 
0.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 5705
78.4%
ASCII 1550
 
21.3%
None 24
 
0.3%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1511
97.5%
, 13
 
0.8%
- 5
 
0.3%
3 3
 
0.2%
/ 3
 
0.2%
2 2
 
0.1%
S 2
 
0.1%
R 2
 
0.1%
V 1
 
0.1%
1 1
 
0.1%
Other values (7) 7
 
0.5%
Hangul
ValueCountFrequency (%)
163
 
2.9%
162
 
2.8%
162
 
2.8%
155
 
2.7%
134
 
2.3%
130
 
2.3%
130
 
2.3%
130
 
2.3%
108
 
1.9%
105
 
1.8%
Other values (368) 4326
75.8%
None
ValueCountFrequency (%)
2
 
8.3%
2
 
8.3%
2
 
8.3%
2
 
8.3%
1
 
4.2%
1
 
4.2%
1
 
4.2%
1
 
4.2%
1
 
4.2%
1
 
4.2%
Other values (10) 10
41.7%

소유형태
Categorical

Distinct2
Distinct (%)0.6%
Missing0
Missing (%)0.0%
Memory size2.8 KiB
공동
235 
단독
103 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row단독
2nd row단독
3rd row단독
4th row단독
5th row단독

Common Values

ValueCountFrequency (%)
공동 235
69.5%
단독 103
30.5%

Length

2024-04-21T09:58:30.551881image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-21T09:58:30.635382image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
공동 235
69.5%
단독 103
30.5%

Interactions

2024-04-21T09:58:27.357436image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-04-21T09:58:30.687924image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
구분소유형태
구분1.0000.359
소유형태0.3591.000
2024-04-21T09:58:30.758656image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
구분소유형태
구분1.0000.272
소유형태0.2721.000

Missing values

2024-04-21T09:58:27.518639image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-21T09:58:27.624566image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

구분등록번호등록일자출원번호출원일자존속일자발명명칭소유형태
0110-0455565-00-002004-10-2510-2004-00598862004-07-292024-07-29전력수급시스템 및 예측방법단독
1210-0537061-00-002005-12-0910-2003-00976532003-12-262023-12-26발전용 보일러 튜브 미끄럼 정렬금구단독
2310-0537063-00-002005-12-0910-2003-00976542003-12-262023-12-26발전용 보일러 벅스테이 미끄럼 지지금구단독
3410-0561256-00-002006-03-0810-2004-01151102004-12-292024-12-29가스터빈 배기덕트 신축이음장치 단열매트단독
4510-0570281-00-002006-04-0510-2004-01151092004-12-292024-12-29시뮬레이터 통신회로 고장위치 탐지방법단독
5610-0572729-00-002006-04-1310-2006-00050502006-01-172026-01-17난연성 및 불연성 FRP 미스트 엘리미네이터공동
6710-0608208-00-002006-07-2610-2006-00435892006-05-152026-05-15스위치 모듈과 센서를 구비한 서지 보호장치공동
7810-0620546-00-002006-08-2910-2005-00000252005-01-032025-01-03고강도 건식 재생용 이산화탄소 흡수제공동
8910-0707066-00-002007-04-0510-2005-01034942005-10-312025-10-30레이저에 의한 수중 입자성 물질 검출 장치단독
91010-0710409-00-002007-04-1610-2006-00122042006-02-082026-02-08스프레이 노즐단독
구분등록번호등록일자출원번호출원일자존속일자발명명칭소유형태
32832910-2585185-00-002023-09-2610-2021-01238302021-09-162041-09-16화력발전소 탈질설비의 우레아기화기 막힘 조기 감지 시스템단독
32933010-2585634-00-002023-09-2710-2021-00949162021-07-202041-07-20화재 감지 방법 및 시스템공동
33033110-2590661-00-002023-10-1310-2021-00635992021-05-172041-05-17감시 센서를 통한 동기 조상기 또는 발전기의 엔드 영역 문제 경감 시스템 및 방법공동
33133210-2591802-00-002023-10-1710-2023-00238402023-02-222043-02-22슈퍼 히팅 스팀 컨디셔닝 밸브공동
33233310-2595475-00-002023-10-2510-2021-01079422021-08-172041-08-17물분해소재, 물분해소재용 조성물 및 이를 이용한 물분해소재의 제조방법공동
33333410-2596985-00-002023-11-2310-2021-00750542021-06-092041-06-09사일로형 반출설비의 안전장치공동
33433510-2601898-00-002023-11-0910-2020-01800302020-12-212040-12-21터빈 제어 검증 시스템, 및 터빈 제어 검증 장치공동
33533610-2611068-00-002023-12-0410-2023-01086412023-08-182043-08-18온도센서 일체형 차압식 유량계공동
33633710-2612564-00-002023-12-0610-2023-00916232023-07-142043-07-14가스 터빈공동
33733810-2613767-00-002023-12-1110-2021-00485362021-04-142041-04-14용해성 유해 가스 분리 장치공동