Dataset statistics
Number of variables | 5 |
---|---|
Number of observations | 682 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 28.8 KiB |
Average record size in memory | 43.2 B |
Variable types
Numeric | 3 |
---|---|
Text | 1 |
Boolean | 1 |
Dataset
Description | 국립암센터에서 19년도 9월까지 암환자의료비지원정보시스템을 통해 개방하는 설문정보 중 설문 보기 테이블 정보입니다. 설문 정보등에 대한 정보가 있습니다. |
---|---|
Author | 국립암센터 |
URL | https://www.data.go.kr/data/15049636/fileData.do |
여부 is highly imbalanced (67.2%) | Imbalance |
Reproduction
Analysis started | 2023-12-12 14:58:24.976878 |
---|---|
Analysis finished | 2023-12-12 14:58:26.704122 |
Duration | 1.73 second |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
설문번호
Real number (ℝ)
Distinct | 8 |
---|---|
Distinct (%) | 1.2% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 8.3973607 |
Minimum | 6 |
---|---|
Maximum | 13 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 6.1 KiB |
Quantile statistics
Minimum | 6 |
---|---|
5-th percentile | 7 |
Q1 | 7 |
median | 8 |
Q3 | 8 |
95-th percentile | 13 |
Maximum | 13 |
Range | 7 |
Interquartile range (IQR) | 1 |
Descriptive statistics
Standard deviation | 1.8174045 |
---|---|
Coefficient of variation (CV) | 0.21642568 |
Kurtosis | 1.0216346 |
Mean | 8.3973607 |
Median Absolute Deviation (MAD) | 1 |
Skewness | 1.5193699 |
Sum | 5727 |
Variance | 3.3029593 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
8 | 287 | |
7 | 246 | |
13 | 45 | 6.6% |
12 | 45 | 6.6% |
10 | 20 | 2.9% |
9 | 20 | 2.9% |
11 | 18 | 2.6% |
6 | 1 | 0.1% |
Value | Count | Frequency (%) |
6 | 1 | 0.1% |
7 | 246 | |
8 | 287 | |
9 | 20 | 2.9% |
10 | 20 | 2.9% |
11 | 18 | 2.6% |
12 | 45 | 6.6% |
13 | 45 | 6.6% |
Value | Count | Frequency (%) |
13 | 45 | 6.6% |
12 | 45 | 6.6% |
11 | 18 | 2.6% |
10 | 20 | 2.9% |
9 | 20 | 2.9% |
8 | 287 | |
7 | 246 | |
6 | 1 | 0.1% |
예시번호
Real number (ℝ)
Distinct | 9 |
---|---|
Distinct (%) | 1.3% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 2.8768328 |
Minimum | 1 |
---|---|
Maximum | 9 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 6.1 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 1 |
Q1 | 2 |
median | 3 |
Q3 | 4 |
95-th percentile | 6 |
Maximum | 9 |
Range | 8 |
Interquartile range (IQR) | 2 |
Descriptive statistics
Standard deviation | 1.698083 |
---|---|
Coefficient of variation (CV) | 0.59026128 |
Kurtosis | 1.9704371 |
Mean | 2.8768328 |
Median Absolute Deviation (MAD) | 1 |
Skewness | 1.2112025 |
Sum | 1962 |
Variance | 2.883486 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
1 | 166 | |
2 | 151 | |
3 | 146 | |
4 | 139 | |
5 | 40 | 5.9% |
7 | 10 | 1.5% |
9 | 10 | 1.5% |
8 | 10 | 1.5% |
6 | 10 | 1.5% |
Value | Count | Frequency (%) |
1 | 166 | |
2 | 151 | |
3 | 146 | |
4 | 139 | |
5 | 40 | 5.9% |
6 | 10 | 1.5% |
7 | 10 | 1.5% |
8 | 10 | 1.5% |
9 | 10 | 1.5% |
Value | Count | Frequency (%) |
9 | 10 | 1.5% |
8 | 10 | 1.5% |
7 | 10 | 1.5% |
6 | 10 | 1.5% |
5 | 40 | 5.9% |
4 | 139 | |
3 | 146 | |
2 | 151 | |
1 | 166 |
질문번호
Real number (ℝ)
Distinct | 78 |
---|---|
Distinct (%) | 11.4% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 30.97654 |
Minimum | 1 |
---|---|
Maximum | 78 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 6.1 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 2 |
Q1 | 6 |
median | 29 |
Q3 | 50 |
95-th percentile | 69 |
Maximum | 78 |
Range | 77 |
Interquartile range (IQR) | 44 |
Descriptive statistics
Standard deviation | 23.04333 |
---|---|
Coefficient of variation (CV) | 0.74389619 |
Kurtosis | -1.2168972 |
Mean | 30.97654 |
Median Absolute Deviation (MAD) | 22 |
Skewness | 0.27605092 |
Sum | 21126 |
Variance | 530.99504 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
4 | 40 | 5.9% |
5 | 40 | 5.9% |
3 | 39 | 5.7% |
2 | 37 | 5.4% |
55 | 10 | 1.5% |
56 | 10 | 1.5% |
57 | 10 | 1.5% |
53 | 10 | 1.5% |
1 | 9 | 1.3% |
64 | 9 | 1.3% |
Other values (68) | 468 |
Value | Count | Frequency (%) |
1 | 9 | 1.3% |
2 | 37 | |
3 | 39 | |
4 | 40 | |
5 | 40 | |
6 | 9 | 1.3% |
7 | 9 | 1.3% |
8 | 3 | 0.4% |
9 | 5 | 0.7% |
10 | 5 | 0.7% |
Value | Count | Frequency (%) |
78 | 1 | 0.1% |
77 | 5 | |
76 | 4 | |
75 | 3 | |
74 | 4 | |
73 | 4 | |
72 | 4 | |
71 | 4 | |
70 | 4 | |
69 | 4 |
답변
Text
Distinct | 127 |
---|---|
Distinct (%) | 18.6% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 5.5 KiB |
Value | Count | Frequency (%) |
매우 | 195 | 15.0% |
그렇다 | 181 | 13.9% |
아니다 | 153 | 11.7% |
않다 | 42 | 3.2% |
그렇지 | 29 | 2.2% |
필요 | 24 | 1.8% |
불필요 | 24 | 1.8% |
주관식 | 23 | 1.8% |
어렵다 | 21 | 1.6% |
전혀 | 21 | 1.6% |
Other values (241) | 591 |
Most occurring characters
Value | Count | Frequency (%) |
622 | 14.3% | |
다 | 441 | 10.1% |
그 | 221 | 5.1% |
렇 | 221 | 5.1% |
매 | 200 | 4.6% |
우 | 200 | 4.6% |
아 | 155 | 3.6% |
니 | 154 | 3.5% |
지 | 67 | 1.5% |
0 | 64 | 1.5% |
Other values (213) | 2010 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 3127 | |
Space Separator | 622 | 14.3% |
Decimal Number | 279 | 6.4% |
Close Punctuation | 105 | 2.4% |
Open Punctuation | 104 | 2.4% |
Other Punctuation | 61 | 1.4% |
Connector Punctuation | 22 | 0.5% |
Math Symbol | 13 | 0.3% |
Other Number | 10 | 0.2% |
Uppercase Letter | 9 | 0.2% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
다 | 441 | 14.1% |
그 | 221 | 7.1% |
렇 | 221 | 7.1% |
매 | 200 | 6.4% |
우 | 200 | 6.4% |
아 | 155 | 5.0% |
니 | 154 | 4.9% |
지 | 67 | 2.1% |
않 | 52 | 1.7% |
요 | 49 | 1.6% |
Other values (179) | 1367 |
Decimal Number
Value | Count | Frequency (%) |
0 | 64 | |
1 | 40 | |
5 | 36 | |
2 | 25 | 9.0% |
7 | 22 | 7.9% |
3 | 20 | 7.2% |
9 | 20 | 7.2% |
4 | 18 | 6.5% |
8 | 17 | 6.1% |
6 | 17 | 6.1% |
Other Number
Value | Count | Frequency (%) |
③ | 1 | |
④ | 1 | |
⑤ | 1 | |
⑩ | 1 | |
⑥ | 1 | |
⑦ | 1 | |
⑧ | 1 | |
⑨ | 1 | |
② | 1 | |
① | 1 |
Other Punctuation
Value | Count | Frequency (%) |
, | 26 | |
. | 17 | |
% | 16 | |
/ | 2 | 3.3% |
Close Punctuation
Value | Count | Frequency (%) |
) | 64 | |
] | 41 |
Open Punctuation
Value | Count | Frequency (%) |
( | 63 | |
[ | 41 |
Math Symbol
Value | Count | Frequency (%) |
~ | 12 | |
> | 1 | 7.7% |
Space Separator
Value | Count | Frequency (%) |
622 |
Connector Punctuation
Value | Count | Frequency (%) |
_ | 22 |
Uppercase Letter
Value | Count | Frequency (%) |
N | 9 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 3 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 3127 | |
Common | 1219 | 28.0% |
Latin | 9 | 0.2% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
다 | 441 | 14.1% |
그 | 221 | 7.1% |
렇 | 221 | 7.1% |
매 | 200 | 6.4% |
우 | 200 | 6.4% |
아 | 155 | 5.0% |
니 | 154 | 4.9% |
지 | 67 | 2.1% |
않 | 52 | 1.7% |
요 | 49 | 1.6% |
Other values (179) | 1367 |
Common
Value | Count | Frequency (%) |
622 | ||
0 | 64 | 5.3% |
) | 64 | 5.3% |
( | 63 | 5.2% |
[ | 41 | 3.4% |
] | 41 | 3.4% |
1 | 40 | 3.3% |
5 | 36 | 3.0% |
, | 26 | 2.1% |
2 | 25 | 2.1% |
Other values (23) | 197 | 16.2% |
Latin
Value | Count | Frequency (%) |
N | 9 |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 3127 | |
ASCII | 1218 | 28.0% |
Enclosed Alphanum | 10 | 0.2% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
622 | ||
0 | 64 | 5.3% |
) | 64 | 5.3% |
( | 63 | 5.2% |
[ | 41 | 3.4% |
] | 41 | 3.4% |
1 | 40 | 3.3% |
5 | 36 | 3.0% |
, | 26 | 2.1% |
2 | 25 | 2.1% |
Other values (14) | 196 | 16.1% |
Hangul
Value | Count | Frequency (%) |
다 | 441 | 14.1% |
그 | 221 | 7.1% |
렇 | 221 | 7.1% |
매 | 200 | 6.4% |
우 | 200 | 6.4% |
아 | 155 | 5.0% |
니 | 154 | 4.9% |
지 | 67 | 2.1% |
않 | 52 | 1.7% |
요 | 49 | 1.6% |
Other values (179) | 1367 |
Enclosed Alphanum
Value | Count | Frequency (%) |
③ | 1 | |
④ | 1 | |
⑤ | 1 | |
⑩ | 1 | |
⑥ | 1 | |
⑦ | 1 | |
⑧ | 1 | |
⑨ | 1 | |
② | 1 | |
① | 1 |
여부
Boolean
IMBALANCE
 
Distinct | 2 |
---|---|
Distinct (%) | 0.3% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 814.0 B |
False | |
---|---|
True | 41 |
Value | Count | Frequency (%) |
False | 641 | |
True | 41 | 6.0% |
설문번호 | 예시번호 | 질문번호 | 여부 | |
---|---|---|---|---|
설문번호 | 1.000 | 0.440 | 0.578 | 0.335 |
예시번호 | 0.440 | 1.000 | 0.271 | 0.324 |
질문번호 | 0.578 | 0.271 | 1.000 | 0.220 |
여부 | 0.335 | 0.324 | 0.220 | 1.000 |
설문번호 | 예시번호 | 질문번호 | 여부 | |
---|---|---|---|---|
설문번호 | 1.000 | 0.216 | -0.427 | 0.250 |
예시번호 | 0.216 | 1.000 | -0.156 | 0.322 |
질문번호 | -0.427 | -0.156 | 1.000 | 0.168 |
여부 | 0.250 | 0.322 | 0.168 | 1.000 |
설문번호 | 예시번호 | 질문번호 | 답변 | 여부 | |
---|---|---|---|---|---|
0 | 7 | 2 | 4 | 간호직 | N |
1 | 7 | 2 | 3 | 전문대졸 | N |
2 | 6 | 1 | 2 | 주관식 | Y |
3 | 7 | 2 | 5 | 7급 | N |
4 | 7 | 2 | 13 | 그렇다 | N |
5 | 7 | 3 | 10 | 아니다 | N |
6 | 7 | 2 | 9 | 그렇다 | N |
7 | 7 | 4 | 12 | 매우 아니다 | N |
8 | 7 | 3 | 14 | 아니다 | N |
9 | 7 | 2 | 15 | 그렇다 | N |
설문번호 | 예시번호 | 질문번호 | 답변 | 여부 | |
---|---|---|---|---|---|
672 | 12 | 4 | 4 | 7 | N |
673 | 12 | 5 | 4 | 6 (보통이다) | N |
674 | 12 | 8 | 4 | 3 | N |
675 | 12 | 1 | 5 | 10 (매우 그렇다) | N |
676 | 12 | 7 | 5 | 4 (그렇지 않은 편이다) | N |
677 | 12 | 9 | 6 | 2 (전혀 그렇지 않다) | N |
678 | 13 | 1 | 1 | 10(매우그렇다) | N |
679 | 12 | 2 | 6 | 9 | N |
680 | 12 | 3 | 6 | 8 (그렇다) | N |
681 | 13 | 1 | 2 | 10(매우 그렇다) | N |