Overview

Dataset statistics

Number of variables6
Number of observations43
Missing cells43
Missing cells (%)16.7%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.3 KiB
Average record size in memory54.1 B

Variable types

Numeric2
Categorical1
Text2
Unsupported1

Dataset

Description한국원자력의학원 방사선비상진료 계측장비 현황(장비명, 보유대수 등 방사선비상진료 계측장비에 관한 정보) 입니다.
Author한국원자력의학원
URLhttps://www.data.go.kr/data/15112180/fileData.do

Alerts

구분 has constant value ""Constant
번호 is highly overall correlated with 보유대수High correlation
보유대수 is highly overall correlated with 번호High correlation
Unnamed: 5 has 43 (100.0%) missing valuesMissing
번호 has unique valuesUnique
장비명 has unique valuesUnique
Unnamed: 5 is an unsupported type, check if it needs cleaning or further analysisUnsupported

Reproduction

Analysis started2023-12-12 10:11:39.306553
Analysis finished2023-12-12 10:11:40.242971
Duration0.94 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

번호
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct43
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean23.395349
Minimum1
Maximum45
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size519.0 B
2023-12-12T19:11:40.326905image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile3.1
Q112.5
median24
Q334.5
95-th percentile42.9
Maximum45
Range44
Interquartile range (IQR)22

Descriptive statistics

Standard deviation13.178517
Coefficient of variation (CV)0.56329645
Kurtosis-1.2066323
Mean23.395349
Median Absolute Deviation (MAD)11
Skewness-0.040815461
Sum1006
Variance173.67331
MonotonicityStrictly increasing
2023-12-12T19:11:40.513902image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=43)
ValueCountFrequency (%)
1 1
 
2.3%
2 1
 
2.3%
27 1
 
2.3%
28 1
 
2.3%
29 1
 
2.3%
30 1
 
2.3%
31 1
 
2.3%
32 1
 
2.3%
33 1
 
2.3%
34 1
 
2.3%
Other values (33) 33
76.7%
ValueCountFrequency (%)
1 1
2.3%
2 1
2.3%
3 1
2.3%
4 1
2.3%
5 1
2.3%
7 1
2.3%
8 1
2.3%
9 1
2.3%
10 1
2.3%
11 1
2.3%
ValueCountFrequency (%)
45 1
2.3%
44 1
2.3%
43 1
2.3%
42 1
2.3%
41 1
2.3%
40 1
2.3%
39 1
2.3%
38 1
2.3%
37 1
2.3%
36 1
2.3%

구분
Categorical

CONSTANT 

Distinct1
Distinct (%)2.3%
Missing0
Missing (%)0.0%
Memory size476.0 B
계측장비
43 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row계측장비
2nd row계측장비
3rd row계측장비
4th row계측장비
5th row계측장비

Common Values

ValueCountFrequency (%)
계측장비 43
100.0%

Length

2023-12-12T19:11:40.660993image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T19:11:40.785619image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
계측장비 43
100.0%

장비명
Text

UNIQUE 

Distinct43
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size476.0 B
2023-12-12T19:11:41.020953image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length55
Median length38
Mean length32.023256
Min length19

Characters and Unicode

Total characters1377
Distinct characters129
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique43 ?
Unique (%)100.0%

Sample

1st rowGAMMA DETECTOR (EBERLINE/FHT6020)
2nd rowRMS System(ThermoFisher/FHZ691-10)
3rd row손발오염감시기(Thermo/HFC)
4th rowLOW B.G ALPHA/BETA COUNT SYSTEM(CANBERRA/S5-XLB)
5th row오염감시기(CANBERRA/ARGOS-5AB)
ValueCountFrequency (%)
survey 7
 
6.5%
hpge 4
 
3.7%
alpha/beta 3
 
2.8%
system 3
 
2.8%
gamma 2
 
1.9%
portable 2
 
1.9%
meter 2
 
1.9%
body 2
 
1.9%
detector 2
 
1.9%
wbc 2
 
1.9%
Other values (77) 79
73.1%
2023-12-12T19:11:41.471079image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
E 87
 
6.3%
R 75
 
5.4%
66
 
4.8%
A 59
 
4.3%
T 54
 
3.9%
/ 45
 
3.3%
M 44
 
3.2%
e 42
 
3.1%
S 41
 
3.0%
) 40
 
2.9%
Other values (119) 824
59.8%

Most occurring categories

ValueCountFrequency (%)
Uppercase Letter 696
50.5%
Lowercase Letter 229
 
16.6%
Other Letter 155
 
11.3%
Decimal Number 87
 
6.3%
Space Separator 66
 
4.8%
Other Punctuation 47
 
3.4%
Close Punctuation 40
 
2.9%
Open Punctuation 39
 
2.8%
Dash Punctuation 18
 
1.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
12
 
7.7%
10
 
6.5%
8
 
5.2%
8
 
5.2%
7
 
4.5%
6
 
3.9%
4
 
2.6%
4
 
2.6%
4
 
2.6%
4
 
2.6%
Other values (59) 88
56.8%
Uppercase Letter
ValueCountFrequency (%)
E 87
12.5%
R 75
 
10.8%
A 59
 
8.5%
T 54
 
7.8%
M 44
 
6.3%
S 41
 
5.9%
C 39
 
5.6%
N 34
 
4.9%
O 31
 
4.5%
B 30
 
4.3%
Other values (15) 202
29.0%
Lowercase Letter
ValueCountFrequency (%)
e 42
18.3%
r 27
11.8%
o 23
10.0%
t 18
7.9%
a 14
 
6.1%
m 14
 
6.1%
s 13
 
5.7%
n 12
 
5.2%
c 11
 
4.8%
h 11
 
4.8%
Other values (8) 44
19.2%
Decimal Number
ValueCountFrequency (%)
0 25
28.7%
2 20
23.0%
1 11
12.6%
4 8
 
9.2%
6 6
 
6.9%
3 5
 
5.7%
5 4
 
4.6%
8 3
 
3.4%
7 3
 
3.4%
9 2
 
2.3%
Other Punctuation
ValueCountFrequency (%)
/ 45
95.7%
. 1
 
2.1%
, 1
 
2.1%
Space Separator
ValueCountFrequency (%)
66
100.0%
Close Punctuation
ValueCountFrequency (%)
) 40
100.0%
Open Punctuation
ValueCountFrequency (%)
( 39
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 18
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 925
67.2%
Common 297
 
21.6%
Hangul 155
 
11.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
12
 
7.7%
10
 
6.5%
8
 
5.2%
8
 
5.2%
7
 
4.5%
6
 
3.9%
4
 
2.6%
4
 
2.6%
4
 
2.6%
4
 
2.6%
Other values (59) 88
56.8%
Latin
ValueCountFrequency (%)
E 87
 
9.4%
R 75
 
8.1%
A 59
 
6.4%
T 54
 
5.8%
M 44
 
4.8%
e 42
 
4.5%
S 41
 
4.4%
C 39
 
4.2%
N 34
 
3.7%
O 31
 
3.4%
Other values (33) 419
45.3%
Common
ValueCountFrequency (%)
66
22.2%
/ 45
15.2%
) 40
13.5%
( 39
13.1%
0 25
 
8.4%
2 20
 
6.7%
- 18
 
6.1%
1 11
 
3.7%
4 8
 
2.7%
6 6
 
2.0%
Other values (7) 19
 
6.4%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1222
88.7%
Hangul 155
 
11.3%

Most frequent character per block

ASCII
ValueCountFrequency (%)
E 87
 
7.1%
R 75
 
6.1%
66
 
5.4%
A 59
 
4.8%
T 54
 
4.4%
/ 45
 
3.7%
M 44
 
3.6%
e 42
 
3.4%
S 41
 
3.4%
) 40
 
3.3%
Other values (50) 669
54.7%
Hangul
ValueCountFrequency (%)
12
 
7.7%
10
 
6.5%
8
 
5.2%
8
 
5.2%
7
 
4.5%
6
 
3.9%
4
 
2.6%
4
 
2.6%
4
 
2.6%
4
 
2.6%
Other values (59) 88
56.8%

보유대수
Real number (ℝ)

HIGH CORRELATION 

Distinct14
Distinct (%)32.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean6.5116279
Minimum1
Maximum51
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size519.0 B
2023-12-12T19:11:41.667377image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q11
median1
Q36.5
95-th percentile30.8
Maximum51
Range50
Interquartile range (IQR)5.5

Descriptive statistics

Standard deviation11.853013
Coefficient of variation (CV)1.8202841
Kurtosis7.9338534
Mean6.5116279
Median Absolute Deviation (MAD)0
Skewness2.825544
Sum280
Variance140.49391
MonotonicityNot monotonic
2023-12-12T19:11:41.780405image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=14)
ValueCountFrequency (%)
1 25
58.1%
2 4
 
9.3%
20 2
 
4.7%
3 2
 
4.7%
7 1
 
2.3%
8 1
 
2.3%
17 1
 
2.3%
9 1
 
2.3%
50 1
 
2.3%
11 1
 
2.3%
Other values (4) 4
 
9.3%
ValueCountFrequency (%)
1 25
58.1%
2 4
 
9.3%
3 2
 
4.7%
6 1
 
2.3%
7 1
 
2.3%
8 1
 
2.3%
9 1
 
2.3%
10 1
 
2.3%
11 1
 
2.3%
17 1
 
2.3%
ValueCountFrequency (%)
51 1
2.3%
50 1
2.3%
32 1
2.3%
20 2
4.7%
17 1
2.3%
11 1
2.3%
10 1
2.3%
9 1
2.3%
8 1
2.3%
7 1
2.3%
Distinct32
Distinct (%)74.4%
Missing0
Missing (%)0.0%
Memory size476.0 B
2023-12-12T19:11:42.013496image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length33
Median length19
Mean length15.255814
Min length4

Characters and Unicode

Total characters656
Distinct characters97
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique25 ?
Unique (%)58.1%

Sample

1st row방사선치료병동의 감마선량률 측정
2nd row분류실/수술실 등 방사선상해환자 이동구역 공간선량률 감시
3rd row방사선비상진료요원의 손발오염 감시
4th row알파/베타 핵종 방사능 측정
5th row전신외부오염 측정
ValueCountFrequency (%)
측정 18
 
12.0%
12
 
8.0%
누적선량 7
 
4.7%
내부오염 6
 
4.0%
감마선량률 5
 
3.3%
감시 5
 
3.3%
분석 5
 
3.3%
감마선 4
 
2.7%
방사선상해환자 4
 
2.7%
핵종 4
 
2.7%
Other values (52) 80
53.3%
2023-12-12T19:11:42.496567image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
108
 
16.5%
36
 
5.5%
22
 
3.4%
21
 
3.2%
20
 
3.0%
19
 
2.9%
19
 
2.9%
17
 
2.6%
17
 
2.6%
15
 
2.3%
Other values (87) 362
55.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 531
80.9%
Space Separator 108
 
16.5%
Other Punctuation 7
 
1.1%
Close Punctuation 4
 
0.6%
Open Punctuation 4
 
0.6%
Uppercase Letter 2
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
36
 
6.8%
22
 
4.1%
21
 
4.0%
20
 
3.8%
19
 
3.6%
19
 
3.6%
17
 
3.2%
17
 
3.2%
15
 
2.8%
15
 
2.8%
Other values (81) 330
62.1%
Uppercase Letter
ValueCountFrequency (%)
G 1
50.0%
M 1
50.0%
Space Separator
ValueCountFrequency (%)
108
100.0%
Other Punctuation
ValueCountFrequency (%)
/ 7
100.0%
Close Punctuation
ValueCountFrequency (%)
) 4
100.0%
Open Punctuation
ValueCountFrequency (%)
( 4
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 531
80.9%
Common 123
 
18.8%
Latin 2
 
0.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
36
 
6.8%
22
 
4.1%
21
 
4.0%
20
 
3.8%
19
 
3.6%
19
 
3.6%
17
 
3.2%
17
 
3.2%
15
 
2.8%
15
 
2.8%
Other values (81) 330
62.1%
Common
ValueCountFrequency (%)
108
87.8%
/ 7
 
5.7%
) 4
 
3.3%
( 4
 
3.3%
Latin
ValueCountFrequency (%)
G 1
50.0%
M 1
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 531
80.9%
ASCII 125
 
19.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
108
86.4%
/ 7
 
5.6%
) 4
 
3.2%
( 4
 
3.2%
G 1
 
0.8%
M 1
 
0.8%
Hangul
ValueCountFrequency (%)
36
 
6.8%
22
 
4.1%
21
 
4.0%
20
 
3.8%
19
 
3.6%
19
 
3.6%
17
 
3.2%
17
 
3.2%
15
 
2.8%
15
 
2.8%
Other values (81) 330
62.1%

Unnamed: 5
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing43
Missing (%)100.0%
Memory size519.0 B

Interactions

2023-12-12T19:11:39.789138image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T19:11:39.548560image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T19:11:39.884538image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T19:11:39.631298image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T19:11:42.607168image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
번호장비명보유대수사용용도
번호1.0001.0000.1700.873
장비명1.0001.0001.0001.000
보유대수0.1701.0001.0000.000
사용용도0.8731.0000.0001.000
2023-12-12T19:11:42.722191image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
번호보유대수
번호1.0000.502
보유대수0.5021.000

Missing values

2023-12-12T19:11:40.055292image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T19:11:40.193003image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

번호구분장비명보유대수사용용도Unnamed: 5
01계측장비GAMMA DETECTOR (EBERLINE/FHT6020)7방사선치료병동의 감마선량률 측정<NA>
12계측장비RMS System(ThermoFisher/FHZ691-10)1분류실/수술실 등 방사선상해환자 이동구역 공간선량률 감시<NA>
23계측장비손발오염감시기(Thermo/HFC)1방사선비상진료요원의 손발오염 감시<NA>
34계측장비LOW B.G ALPHA/BETA COUNT SYSTEM(CANBERRA/S5-XLB)1알파/베타 핵종 방사능 측정<NA>
45계측장비오염감시기(CANBERRA/ARGOS-5AB)1전신외부오염 측정<NA>
57계측장비HPGE GAMMA-RAY SPECTROSCOPY SYSTEM1감마선 방출 핵종 분석<NA>
68계측장비MOBILE SKID FOR THE WBC SYSTEM1방사선상해환자 및 방사선치료환자의 내부오염 핵종분석(의자형)<NA>
79계측장비방사능계수기(SPECTECH/ST360)8GM계수기를 이용한 감마선 계수<NA>
810계측장비저준위 액체섬광 계수기(PERKINELMER/1220)1생체시료에서의 전알파/베타 방사능 분석<NA>
911계측장비NaI GAMMER SPECTROSCOPY SYSTEM1감마선 방출 핵종 분석<NA>
번호구분장비명보유대수사용용도Unnamed: 5
3336계측장비SURVEY METER (EBERLINE/FH40G)10감마선량률 및 누적선량 측정<NA>
3437계측장비SURVEY METER(LUDLUM/44-6)6감마선량률 및 누적선량 측정<NA>
3538계측장비SURVEY METER(CANBERRA/B-81)2감마선량률 및 누적선량 측정<NA>
3639계측장비EPD(CANBERRA/MRAD101)20개인피폭 방사선량률 및 누적선량 측정<NA>
3740계측장비개인피폭선량계(THERMO/EPD-MK2)32개인피폭 방사선량률 및 누적선량 측정<NA>
3841계측장비전자개인선량계(Thermo/TruDose)51개인피폭 방사선량률 및 누적선량 측정<NA>
3942계측장비PORTABLE WHOLE BODY GAMMA MONITOR (CANBERRA/MINISENTRY)3현장기반의 외부오염확인 및 분류<NA>
4043계측장비이동식 외부오염감시기(네오시스/NRPM-2VH)1현장기반의 외부오염확인 및 분류<NA>
4144계측장비핵종분석기(THERMO/INTERCEPTER)1현장에서의 핵종분석<NA>
4245계측장비휴대용 내부오염감시기(ThermoFisher/MG3)3현장 내부피폭 감시<NA>