Overview

Dataset statistics

Number of variables7
Number of observations10000
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory654.3 KiB
Average record size in memory67.0 B

Variable types

Numeric3
Categorical3
DateTime1

Dataset

Description한국보건의료인국가시험에서 최초 발급한 면허 교부 현황(일련번호,연도,직종,회차,면허승인일,면허수령지역, 성별)을 제공합니다.
URLhttps://www.data.go.kr/data/15061940/fileData.do

Alerts

일련번호 is highly overall correlated with 연도High correlation
연도 is highly overall correlated with 일련번호 and 1 other fieldsHigh correlation
회차 is highly overall correlated with 연도 and 2 other fieldsHigh correlation
직종 is highly overall correlated with 회차 and 1 other fieldsHigh correlation
성별 is highly overall correlated with 회차 and 1 other fieldsHigh correlation
일련번호 has unique valuesUnique

Reproduction

Analysis started2023-12-12 09:49:34.982422
Analysis finished2023-12-12 09:49:36.811864
Duration1.83 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

일련번호
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct10000
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean48012.672
Minimum10
Maximum95287
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-12T18:49:36.912693image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum10
5-th percentile5230.6
Q123945.5
median48383
Q371729.75
95-th percentile90408.7
Maximum95287
Range95277
Interquartile range (IQR)47784.25

Descriptive statistics

Standard deviation27461.869
Coefficient of variation (CV)0.57197127
Kurtosis-1.2069892
Mean48012.672
Median Absolute Deviation (MAD)23819.5
Skewness-0.014855543
Sum4.8012672 × 108
Variance7.5415424 × 108
MonotonicityNot monotonic
2023-12-12T18:49:37.056457image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
56347 1
 
< 0.1%
31541 1
 
< 0.1%
45353 1
 
< 0.1%
9822 1
 
< 0.1%
26007 1
 
< 0.1%
89028 1
 
< 0.1%
73164 1
 
< 0.1%
32858 1
 
< 0.1%
19100 1
 
< 0.1%
24283 1
 
< 0.1%
Other values (9990) 9990
99.9%
ValueCountFrequency (%)
10 1
< 0.1%
24 1
< 0.1%
30 1
< 0.1%
36 1
< 0.1%
37 1
< 0.1%
45 1
< 0.1%
54 1
< 0.1%
67 1
< 0.1%
72 1
< 0.1%
77 1
< 0.1%
ValueCountFrequency (%)
95287 1
< 0.1%
95278 1
< 0.1%
95269 1
< 0.1%
95253 1
< 0.1%
95226 1
< 0.1%
95201 1
< 0.1%
95197 1
< 0.1%
95184 1
< 0.1%
95178 1
< 0.1%
95164 1
< 0.1%

연도
Real number (ℝ)

HIGH CORRELATION 

Distinct10
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2010.1732
Minimum2003
Maximum2012
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-12T18:49:37.175559image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2003
5-th percentile2009
Q12010
median2010
Q32011
95-th percentile2011
Maximum2012
Range9
Interquartile range (IQR)1

Descriptive statistics

Standard deviation0.78515184
Coefficient of variation (CV)0.00039058915
Kurtosis2.2166754
Mean2010.1732
Median Absolute Deviation (MAD)1
Skewness-0.36058707
Sum20101732
Variance0.61646341
MonotonicityNot monotonic
2023-12-12T18:49:37.633391image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%)
2010 4859
48.6%
2011 3049
30.5%
2009 1751
 
17.5%
2012 290
 
2.9%
2008 30
 
0.3%
2007 9
 
0.1%
2006 6
 
0.1%
2004 3
 
< 0.1%
2005 2
 
< 0.1%
2003 1
 
< 0.1%
ValueCountFrequency (%)
2003 1
 
< 0.1%
2004 3
 
< 0.1%
2005 2
 
< 0.1%
2006 6
 
0.1%
2007 9
 
0.1%
2008 30
 
0.3%
2009 1751
 
17.5%
2010 4859
48.6%
2011 3049
30.5%
2012 290
 
2.9%
ValueCountFrequency (%)
2012 290
 
2.9%
2011 3049
30.5%
2010 4859
48.6%
2009 1751
 
17.5%
2008 30
 
0.3%
2007 9
 
0.1%
2006 6
 
0.1%
2005 2
 
< 0.1%
2004 3
 
< 0.1%
2003 1
 
< 0.1%

직종
Categorical

HIGH CORRELATION 

Distinct21
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
간호사
2770 
위생사
1048 
영양사
1009 
치과위생사
812 
물리치료사
722 
Other values (16)
3639 

Length

Max length9
Median length3
Mean length3.8827
Min length2

Unique

Unique1 ?
Unique (%)< 0.1%

Sample

1st row치과의사
2nd row안경사
3rd row방사선사
4th row약사(4년제)
5th row영양사

Common Values

ValueCountFrequency (%)
간호사 2770
27.7%
위생사 1048
 
10.5%
영양사 1009
 
10.1%
치과위생사 812
 
8.1%
물리치료사 722
 
7.2%
의사 686
 
6.9%
방사선사 383
 
3.8%
안경사 367
 
3.7%
임상병리사 331
 
3.3%
치과기공사 330
 
3.3%
Other values (11) 1542
15.4%

Length

2023-12-12T18:49:37.785225image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
간호사 2770
27.7%
위생사 1048
 
10.5%
영양사 1009
 
10.1%
치과위생사 812
 
8.1%
물리치료사 722
 
7.2%
의사 686
 
6.9%
방사선사 383
 
3.8%
안경사 367
 
3.7%
임상병리사 331
 
3.3%
치과기공사 330
 
3.3%
Other values (11) 1542
15.4%

회차
Real number (ℝ)

HIGH CORRELATION 

Distinct43
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean42.6705
Minimum10
Maximum76
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-12T18:49:37.942942image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum10
5-th percentile22
Q133
median38
Q351
95-th percentile74
Maximum76
Range66
Interquartile range (IQR)18

Descriptive statistics

Standard deviation13.976339
Coefficient of variation (CV)0.32754103
Kurtosis0.11731161
Mean42.6705
Median Absolute Deviation (MAD)11
Skewness0.52839982
Sum426705
Variance195.33806
MonotonicityNot monotonic
2023-12-12T18:49:38.122436image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=43)
ValueCountFrequency (%)
51 1323
13.2%
38 1260
12.6%
50 1228
12.3%
37 1221
12.2%
33 746
 
7.5%
31 453
 
4.5%
34 431
 
4.3%
32 407
 
4.1%
39 343
 
3.4%
75 339
 
3.4%
Other values (33) 2249
22.5%
ValueCountFrequency (%)
10 10
 
0.1%
11 31
 
0.3%
12 14
 
0.1%
14 3
 
< 0.1%
15 120
1.2%
16 158
1.6%
17 73
0.7%
18 1
 
< 0.1%
20 1
 
< 0.1%
21 5
 
0.1%
ValueCountFrequency (%)
76 38
 
0.4%
75 339
3.4%
74 319
3.2%
67 10
 
0.1%
66 76
 
0.8%
65 90
 
0.9%
64 7
 
0.1%
63 93
 
0.9%
62 229
2.3%
61 133
 
1.3%
Distinct248
Distinct (%)2.5%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
Minimum2010-01-04 00:00:00
Maximum2012-02-21 00:00:00
2023-12-12T18:49:38.312665image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T18:49:38.492387image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
Distinct21
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
서울특별시
1816 
경기도
1751 
부산광역시
745 
경상남도
726 
대구광역시
691 
Other values (16)
4271 

Length

Max length38
Median length32
Mean length4.3291
Min length2

Unique

Unique4 ?
Unique (%)< 0.1%

Sample

1st row광주광역시
2nd row서울특별시
3rd row부산광역시
4th row서울특별시
5th row서울특별시

Common Values

ValueCountFrequency (%)
서울특별시 1816
18.2%
경기도 1751
17.5%
부산광역시 745
7.4%
경상남도 726
 
7.3%
대구광역시 691
 
6.9%
경상북도 665
 
6.7%
광주광역시 556
 
5.6%
전라북도 544
 
5.4%
인천광역시 435
 
4.3%
전라남도 427
 
4.3%
Other values (11) 1644
16.4%

Length

2023-12-12T18:49:38.652010image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
서울특별시 1816
18.1%
경기도 1751
17.5%
부산광역시 745
7.4%
경상남도 726
 
7.2%
대구광역시 691
 
6.9%
경상북도 665
 
6.6%
광주광역시 556
 
5.5%
전라북도 544
 
5.4%
인천광역시 435
 
4.3%
전라남도 427
 
4.3%
Other values (30) 1663
16.6%

성별
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
7430 
2570 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row
2nd row
3rd row
4th row
5th row

Common Values

ValueCountFrequency (%)
7430
74.3%
2570
 
25.7%

Length

2023-12-12T18:49:38.785677image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T18:49:38.879643image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
7430
74.3%
2570
 
25.7%

Interactions

2023-12-12T18:49:36.232625image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T18:49:35.589894image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T18:49:35.887221image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T18:49:36.349326image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T18:49:35.681754image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T18:49:36.002638image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T18:49:36.446342image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T18:49:35.778917image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T18:49:36.125821image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T18:49:38.955933image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
일련번호연도직종회차면허수령지역성별
일련번호1.0000.5970.4960.4700.2790.081
연도0.5971.0000.5240.4190.1290.029
직종0.4960.5241.0000.9810.3300.651
회차0.4700.4190.9811.0000.2620.645
면허수령지역0.2790.1290.3300.2621.0000.079
성별0.0810.0290.6510.6450.0791.000
2023-12-12T18:49:39.098957image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
직종성별면허수령지역
직종1.0000.5790.077
성별0.5791.0000.070
면허수령지역0.0770.0701.000
2023-12-12T18:49:39.240153image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
일련번호연도회차직종면허수령지역성별
일련번호1.0000.6880.0760.2070.1060.062
연도0.6881.0000.5090.2180.0450.028
회차0.0760.5091.0000.8870.0990.501
직종0.2070.2180.8871.0000.0770.579
면허수령지역0.1060.0450.0990.0771.0000.070
성별0.0620.0280.5010.5790.0701.000

Missing values

2023-12-12T18:49:36.603428image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T18:49:36.750772image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

일련번호연도직종회차면허승인일면허수령지역성별
56346563472011치과의사632011-02-23광주광역시
73308733092010안경사232011-03-08서울특별시
69835698362010방사선사382011-03-04부산광역시
22354223552010약사(4년제)612010-03-04서울특별시
79522795232011영양사342011-03-22서울특별시
46932469332010치과기공사382011-02-11광주광역시
10295102962009치과위생사372010-02-23전라북도
71183711842011한의사662011-03-07경상북도
62832628332011간호사512011-02-25서울특별시
67284672852010치과위생사382011-03-03부산광역시
일련번호연도직종회차면허승인일면허수령지역성별
179017912010의사742010-02-01경상남도
91458914592011물리치료사392012-02-17경상남도
10520105212010간호사502010-02-23제주특별자치시
11896118972010간호사502010-02-24대구광역시
38333383342010영양사332010-04-05경기도
19569195702010한의사652010-03-03경상남도
52644526452010물리치료사382011-02-21경기도
11364113652010간호사502010-02-24경상남도
901690172010간호사502010-02-23경기도
50561505622011간호사512011-02-17경상북도