Overview

Dataset statistics

Number of variables4
Number of observations1147
Missing cells0
Missing cells (%)0.0%
Duplicate rows28
Duplicate rows (%)2.4%
Total size in memory36.0 KiB
Average record size in memory32.1 B

Variable types

Text2
DateTime2

Dataset

Description충청남도 천안시 내 의료기관의 진단용방사선발생장치 현황(기관명, 장비모델명, 장비검사일자 등)의 항목을 제공합니다.
URLhttps://www.data.go.kr/data/15030999/fileData.do

Alerts

데이터기준일자 has constant value ""Constant
Dataset has 28 (2.4%) duplicate rowsDuplicates

Reproduction

Analysis started2023-12-12 21:15:58.094168
Analysis finished2023-12-12 21:15:58.517805
Duration0.42 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct347
Distinct (%)30.3%
Missing0
Missing (%)0.0%
Memory size9.1 KiB
2023-12-13T06:15:58.768286image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length84
Median length47
Mean length10.353967
Min length2

Characters and Unicode

Total characters11876
Distinct characters78
Distinct categories12 ?
Distinct scripts3 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique184 ?
Unique (%)16.0%

Sample

1st rowHorizon Wi
2nd rowAccuRay D6
3rd rowOSCAR Prime
4th rowN1
5th rowFOREXIA-H3
ValueCountFrequency (%)
dexxum 62
 
3.4%
t 62
 
3.4%
vex-p300 51
 
2.8%
3d 50
 
2.8%
zen-2090 43
 
2.4%
pht-35lhs 41
 
2.3%
accuray 35
 
1.9%
max-gls 33
 
1.8%
pro 32
 
1.8%
point 31
 
1.7%
Other values (442) 1369
75.7%
2023-12-13T06:15:59.203995image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 753
 
6.3%
- 745
 
6.3%
662
 
5.6%
X 558
 
4.7%
E 467
 
3.9%
S 431
 
3.6%
P 410
 
3.5%
R 404
 
3.4%
D 377
 
3.2%
T 365
 
3.1%
Other values (68) 6704
56.4%

Most occurring categories

ValueCountFrequency (%)
Uppercase Letter 5567
46.9%
Lowercase Letter 2837
23.9%
Decimal Number 1982
 
16.7%
Dash Punctuation 745
 
6.3%
Space Separator 662
 
5.6%
Other Punctuation 24
 
0.2%
Letter Number 20
 
0.2%
Other Letter 18
 
0.2%
Close Punctuation 8
 
0.1%
Open Punctuation 8
 
0.1%
Other values (2) 5
 
< 0.1%

Most frequent character per category

Uppercase Letter
ValueCountFrequency (%)
X 558
 
10.0%
E 467
 
8.4%
S 431
 
7.7%
P 410
 
7.4%
R 404
 
7.3%
D 377
 
6.8%
T 365
 
6.6%
A 346
 
6.2%
M 257
 
4.6%
O 232
 
4.2%
Other values (16) 1720
30.9%
Lowercase Letter
ValueCountFrequency (%)
o 314
11.1%
i 308
10.9%
a 273
 
9.6%
r 212
 
7.5%
t 196
 
6.9%
n 192
 
6.8%
e 191
 
6.7%
u 156
 
5.5%
m 148
 
5.2%
s 141
 
5.0%
Other values (15) 706
24.9%
Decimal Number
ValueCountFrequency (%)
0 753
38.0%
5 357
18.0%
3 250
 
12.6%
2 231
 
11.7%
6 100
 
5.0%
1 96
 
4.8%
9 77
 
3.9%
4 65
 
3.3%
8 33
 
1.7%
7 20
 
1.0%
Letter Number
ValueCountFrequency (%)
18
90.0%
1
 
5.0%
1
 
5.0%
Other Punctuation
ValueCountFrequency (%)
/ 16
66.7%
. 7
29.2%
, 1
 
4.2%
Other Letter
ValueCountFrequency (%)
6
33.3%
6
33.3%
6
33.3%
Close Punctuation
ValueCountFrequency (%)
) 7
87.5%
] 1
 
12.5%
Open Punctuation
ValueCountFrequency (%)
( 7
87.5%
[ 1
 
12.5%
Dash Punctuation
ValueCountFrequency (%)
- 745
100.0%
Space Separator
ValueCountFrequency (%)
662
100.0%
Math Symbol
ValueCountFrequency (%)
+ 3
100.0%
Other Symbol
ValueCountFrequency (%)
2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 8424
70.9%
Common 3434
28.9%
Hangul 18
 
0.2%

Most frequent character per script

Latin
ValueCountFrequency (%)
X 558
 
6.6%
E 467
 
5.5%
S 431
 
5.1%
P 410
 
4.9%
R 404
 
4.8%
D 377
 
4.5%
T 365
 
4.3%
A 346
 
4.1%
o 314
 
3.7%
i 308
 
3.7%
Other values (44) 4444
52.8%
Common
ValueCountFrequency (%)
0 753
21.9%
- 745
21.7%
662
19.3%
5 357
10.4%
3 250
 
7.3%
2 231
 
6.7%
6 100
 
2.9%
1 96
 
2.8%
9 77
 
2.2%
4 65
 
1.9%
Other values (11) 98
 
2.9%
Hangul
ValueCountFrequency (%)
6
33.3%
6
33.3%
6
33.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 11836
99.7%
Number Forms 20
 
0.2%
Hangul 18
 
0.2%
Letterlike Symbols 2
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 753
 
6.4%
- 745
 
6.3%
662
 
5.6%
X 558
 
4.7%
E 467
 
3.9%
S 431
 
3.6%
P 410
 
3.5%
R 404
 
3.4%
D 377
 
3.2%
T 365
 
3.1%
Other values (61) 6664
56.3%
Number Forms
ValueCountFrequency (%)
18
90.0%
1
 
5.0%
1
 
5.0%
Hangul
ValueCountFrequency (%)
6
33.3%
6
33.3%
6
33.3%
Letterlike Symbols
ValueCountFrequency (%)
2
100.0%
Distinct456
Distinct (%)39.8%
Missing0
Missing (%)0.0%
Memory size9.1 KiB
2023-12-13T06:15:59.720614image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length20
Median length17
Mean length8.7306016
Min length3

Characters and Unicode

Total characters10014
Distinct characters320
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique114 ?
Unique (%)9.9%

Sample

1st row천안시서북구보건소
2nd row올리본의원
3rd row올리본의원
4th row남서울치과의원
5th row굿모닝비뇨기과의원
ValueCountFrequency (%)
단국대학교의과대학부속병원 42
 
3.6%
학교법인동은학원순천향대학교부속천안병원 38
 
3.3%
의료법인영서의료재단천안충무병원 25
 
2.2%
단국대학교치과대학치과병원 21
 
1.8%
충청남도천안의료원 16
 
1.4%
천안우리병원 13
 
1.1%
하나메디칼의원 11
 
0.9%
연세우일치과병원 10
 
0.9%
화인메트로병원 10
 
0.9%
서울대정병원 9
 
0.8%
Other values (451) 964
83.2%
2023-12-13T06:16:00.149730image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1188
 
11.9%
961
 
9.6%
901
 
9.0%
531
 
5.3%
349
 
3.5%
291
 
2.9%
223
 
2.2%
186
 
1.9%
179
 
1.8%
165
 
1.6%
Other values (310) 5040
50.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 9929
99.2%
Uppercase Letter 40
 
0.4%
Decimal Number 27
 
0.3%
Space Separator 12
 
0.1%
Dash Punctuation 2
 
< 0.1%
Lowercase Letter 2
 
< 0.1%
Other Punctuation 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1188
 
12.0%
961
 
9.7%
901
 
9.1%
531
 
5.3%
349
 
3.5%
291
 
2.9%
223
 
2.2%
186
 
1.9%
179
 
1.8%
165
 
1.7%
Other values (294) 4955
49.9%
Uppercase Letter
ValueCountFrequency (%)
S 16
40.0%
J 7
17.5%
G 7
17.5%
N 5
 
12.5%
M 2
 
5.0%
Y 2
 
5.0%
W 1
 
2.5%
Decimal Number
ValueCountFrequency (%)
2 6
22.2%
1 6
22.2%
5 5
18.5%
6 5
18.5%
3 5
18.5%
Space Separator
ValueCountFrequency (%)
12
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 2
100.0%
Lowercase Letter
ValueCountFrequency (%)
e 2
100.0%
Other Punctuation
ValueCountFrequency (%)
: 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 9929
99.2%
Common 43
 
0.4%
Latin 42
 
0.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1188
 
12.0%
961
 
9.7%
901
 
9.1%
531
 
5.3%
349
 
3.5%
291
 
2.9%
223
 
2.2%
186
 
1.9%
179
 
1.8%
165
 
1.7%
Other values (294) 4955
49.9%
Latin
ValueCountFrequency (%)
S 16
38.1%
J 7
16.7%
G 7
16.7%
N 5
 
11.9%
M 2
 
4.8%
Y 2
 
4.8%
e 2
 
4.8%
W 1
 
2.4%
Common
ValueCountFrequency (%)
12
27.9%
2 6
14.0%
1 6
14.0%
5 5
11.6%
6 5
11.6%
3 5
11.6%
- 2
 
4.7%
: 2
 
4.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 9929
99.2%
ASCII 85
 
0.8%

Most frequent character per block

Hangul
ValueCountFrequency (%)
1188
 
12.0%
961
 
9.7%
901
 
9.1%
531
 
5.3%
349
 
3.5%
291
 
2.9%
223
 
2.2%
186
 
1.9%
179
 
1.8%
165
 
1.7%
Other values (294) 4955
49.9%
ASCII
ValueCountFrequency (%)
S 16
18.8%
12
14.1%
J 7
8.2%
G 7
8.2%
2 6
 
7.1%
1 6
 
7.1%
5 5
 
5.9%
6 5
 
5.9%
3 5
 
5.9%
N 5
 
5.9%
Other values (6) 11
12.9%
Distinct496
Distinct (%)43.2%
Missing0
Missing (%)0.0%
Memory size9.1 KiB
Minimum2020-07-15 00:00:00
Maximum2023-08-11 00:00:00
2023-12-13T06:16:00.280228image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T06:16:00.396307image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

데이터기준일자
Date

CONSTANT 

Distinct1
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size9.1 KiB
Minimum2023-08-23 00:00:00
Maximum2023-08-23 00:00:00
2023-12-13T06:16:00.482059image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T06:16:00.562016image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=1)

Missing values

2023-12-13T06:15:58.390583image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T06:15:58.483687image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

장치모델명의료기관명장비검사일자데이터기준일자
0Horizon Wi천안시서북구보건소2023-08-012023-08-23
1AccuRay D6올리본의원2023-06-262023-08-23
2OSCAR Prime올리본의원2023-06-222023-08-23
3N1남서울치과의원2023-06-222023-08-23
4FOREXIA-H3굿모닝비뇨기과의원2023-06-052023-08-23
5Rifle-F굿모닝비뇨기과의원2023-06-052023-08-23
6VEX-S300C내이로치과의원2023-05-252023-08-23
7ZEN-2090 Turbo서울프라임병원2023-05-172023-08-23
8SIG-40-525불당내과의원2023-05-122023-08-23
9DEXXUM T Quantum불당내과의원2023-05-122023-08-23
장치모델명의료기관명장비검사일자데이터기준일자
1137MAX-GLS고려치과병원2022-12-082023-08-23
1138MAX-GLS중앙치과의원2022-07-062023-08-23
1139R-100-100김병기정형외과의원2023-04-132023-08-23
1140MAX-GLS안치과의원2023-03-082023-08-23
1141BRS-E이석훈내과의원2023-05-302023-08-23
1142MAX-GLS하얀플란트치과의원2023-03-132023-08-23
1143PANORAMAX단국대학교치과대학치과병원2021-12-172023-08-23
1144MAX-GLS문치과병원2021-06-232023-08-23
1145DY-525R-TBS-Ⅰ천안세인트요양병원2023-04-272023-08-23
1146KXO-12M-CB단국대학교치과대학치과병원2021-12-162023-08-23

Duplicate rows

Most frequently occurring

장치모델명의료기관명장비검사일자데이터기준일자# duplicates
5KODAK 2200 Intraoral X-ray System단국대학교치과대학치과병원2021-12-132023-08-233
8PROSTAR연세나무병원2023-03-082023-08-233
11Point-X더보스톤치과병원2023-03-062023-08-233
20Xcam우리가함께하는치과의원2022-12-062023-08-233
23ZEN-2090 Pro서울대정병원2023-03-092023-08-233
0ARMES 35본정형외과병원2023-04-042023-08-232
1DIOX-602아홉가지약속치과의원2020-08-182023-08-232
2ECO smart나은필병원2021-10-282023-08-232
3ELMO-MX8학교법인동은학원순천향대학교부속천안병원2022-05-252023-08-232
4GXR-C40SD천안우리병원2023-08-032023-08-232