Dataset statistics
Number of variables | 9 |
---|---|
Number of observations | 591 |
Missing cells | 146 |
Missing cells (%) | 2.7% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 43.4 KiB |
Average record size in memory | 75.2 B |
Variable types
Numeric | 3 |
---|---|
Text | 1 |
Categorical | 3 |
DateTime | 2 |
Dataset
Description | 중국 위생허가 신청기업 현황(해외인증)은 사업년도, 사업자번호, 주관기업명, 지원분야, 인증명칭, 신청일, 협약체결일, 사업종료일로 구성 |
---|---|
Author | 중소벤처기업부 |
URL | https://www.data.go.kr/data/15024850/fileData.do |
인증명칭 has constant value "" | Constant |
번호 is highly overall correlated with 사업년도 and 1 other fields | High correlation |
사업년도 is highly overall correlated with 번호 and 1 other fields | High correlation |
협약체결일 is highly overall correlated with 번호 and 1 other fields | High correlation |
지원분야 is highly imbalanced (67.0%) | Imbalance |
사업종료일 has 146 (24.7%) missing values | Missing |
번호 has unique values | Unique |
Reproduction
Analysis started | 2023-12-12 09:50:50.281891 |
---|---|
Analysis finished | 2023-12-12 09:50:52.463838 |
Duration | 2.18 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
번호
Real number (ℝ)
HIGH CORRELATION
  UNIQUE
 
Distinct | 591 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 296 |
Minimum | 1 |
---|---|
Maximum | 591 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 5.3 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 30.5 |
Q1 | 148.5 |
median | 296 |
Q3 | 443.5 |
95-th percentile | 561.5 |
Maximum | 591 |
Range | 590 |
Interquartile range (IQR) | 295 |
Descriptive statistics
Standard deviation | 170.75128 |
---|---|
Coefficient of variation (CV) | 0.57686244 |
Kurtosis | -1.2 |
Mean | 296 |
Median Absolute Deviation (MAD) | 148 |
Skewness | 0 |
Sum | 174936 |
Variance | 29156 |
Monotonicity | Strictly increasing |
Value | Count | Frequency (%) |
1 | 1 | 0.2% |
390 | 1 | 0.2% |
392 | 1 | 0.2% |
393 | 1 | 0.2% |
394 | 1 | 0.2% |
395 | 1 | 0.2% |
396 | 1 | 0.2% |
397 | 1 | 0.2% |
398 | 1 | 0.2% |
399 | 1 | 0.2% |
Other values (581) | 581 |
Value | Count | Frequency (%) |
1 | 1 | |
2 | 1 | |
3 | 1 | |
4 | 1 | |
5 | 1 | |
6 | 1 | |
7 | 1 | |
8 | 1 | |
9 | 1 | |
10 | 1 |
Value | Count | Frequency (%) |
591 | 1 | |
590 | 1 | |
589 | 1 | |
588 | 1 | |
587 | 1 | |
586 | 1 | |
585 | 1 | |
584 | 1 | |
583 | 1 | |
582 | 1 |
사업년도
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 9 |
---|---|
Distinct (%) | 1.5% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 2018.2589 |
Minimum | 2015 |
---|---|
Maximum | 2023 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 5.3 KiB |
Quantile statistics
Minimum | 2015 |
---|---|
5-th percentile | 2015 |
Q1 | 2016 |
median | 2017 |
Q3 | 2021 |
95-th percentile | 2022 |
Maximum | 2023 |
Range | 8 |
Interquartile range (IQR) | 5 |
Descriptive statistics
Standard deviation | 2.4665185 |
---|---|
Coefficient of variation (CV) | 0.0012221021 |
Kurtosis | -1.2720215 |
Mean | 2018.2589 |
Median Absolute Deviation (MAD) | 1 |
Skewness | 0.40540736 |
Sum | 1192791 |
Variance | 6.0837133 |
Monotonicity | Increasing |
Value | Count | Frequency (%) |
2016 | 169 | |
2017 | 82 | |
2022 | 69 | |
2020 | 64 | 10.8% |
2021 | 61 | 10.3% |
2015 | 48 | 8.1% |
2018 | 48 | 8.1% |
2019 | 30 | 5.1% |
2023 | 20 | 3.4% |
Value | Count | Frequency (%) |
2015 | 48 | 8.1% |
2016 | 169 | |
2017 | 82 | |
2018 | 48 | 8.1% |
2019 | 30 | 5.1% |
2020 | 64 | 10.8% |
2021 | 61 | 10.3% |
2022 | 69 | |
2023 | 20 | 3.4% |
Value | Count | Frequency (%) |
2023 | 20 | 3.4% |
2022 | 69 | |
2021 | 61 | 10.3% |
2020 | 64 | 10.8% |
2019 | 30 | 5.1% |
2018 | 48 | 8.1% |
2017 | 82 | |
2016 | 169 | |
2015 | 48 | 8.1% |
사업자번호
Real number (ℝ)
Distinct | 510 |
---|---|
Distinct (%) | 86.3% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 3.3305734 × 109 |
Minimum | 1.0181145 × 109 |
---|---|
Maximum | 8.9204016 × 109 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 5.3 KiB |
Quantile statistics
Minimum | 1.0181145 × 109 |
---|---|
5-th percentile | 1.0883764 × 109 |
Q1 | 1.3181817 × 109 |
median | 2.2481447 × 109 |
Q3 | 5.079478 × 109 |
95-th percentile | 7.8086504 × 109 |
Maximum | 8.9204016 × 109 |
Range | 7.9022872 × 109 |
Interquartile range (IQR) | 3.7612964 × 109 |
Descriptive statistics
Standard deviation | 2.2542503 × 109 |
---|---|
Coefficient of variation (CV) | 0.67683549 |
Kurtosis | -0.51950465 |
Mean | 3.3305734 × 109 |
Median Absolute Deviation (MAD) | 9.8953852 × 108 |
Skewness | 0.8752498 |
Sum | 1.9683689 × 1012 |
Variance | 5.0816444 × 1018 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
1208804273 | 4 | 0.7% |
1358149079 | 3 | 0.5% |
5158133943 | 3 | 0.5% |
5038604232 | 3 | 0.5% |
2308108577 | 3 | 0.5% |
3058601397 | 3 | 0.5% |
1398120389 | 3 | 0.5% |
1308146868 | 3 | 0.5% |
1208199219 | 3 | 0.5% |
1058815091 | 2 | 0.3% |
Other values (500) | 561 |
Value | Count | Frequency (%) |
1018114470 | 1 | |
1018157824 | 1 | |
1018651316 | 1 | |
1052432915 | 2 | |
1058690704 | 1 | |
1058716360 | 2 | |
1058723084 | 1 | |
1058724900 | 1 | |
1058764902 | 1 | |
1058807096 | 1 |
Value | Count | Frequency (%) |
8920401634 | 1 | |
8808600680 | 1 | |
8790700384 | 1 | |
8788100872 | 1 | |
8758100271 | 2 | |
8728701915 | 1 | |
8698700472 | 1 | |
8648800019 | 1 | |
8638800170 | 1 | |
8538700953 | 1 |
주관기업명
Text
Distinct | 529 |
---|---|
Distinct (%) | 89.5% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 4.7 KiB |
Value | Count | Frequency (%) |
주식회사 | 115 | 15.9% |
주 | 8 | 1.1% |
코스메틱 | 4 | 0.6% |
주)스킨러버스코스메틱 | 4 | 0.6% |
유씨엘(주 | 3 | 0.4% |
제닉 | 3 | 0.4% |
주)오스테오시스 | 3 | 0.4% |
에이팜 | 3 | 0.4% |
주)하스 | 3 | 0.4% |
주)웰코스 | 3 | 0.4% |
Other values (523) | 576 |
Most occurring characters
Value | Count | Frequency (%) |
주 | 489 | 10.3% |
) | 348 | 7.4% |
( | 345 | 7.3% |
스 | 233 | 4.9% |
이 | 178 | 3.8% |
사 | 148 | 3.1% |
140 | 3.0% | |
회 | 140 | 3.0% |
식 | 139 | 2.9% |
코 | 124 | 2.6% |
Other values (355) | 2442 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 3801 | |
Close Punctuation | 348 | 7.4% |
Open Punctuation | 346 | 7.3% |
Space Separator | 140 | 3.0% |
Uppercase Letter | 57 | 1.2% |
Lowercase Letter | 14 | 0.3% |
Decimal Number | 11 | 0.2% |
Other Punctuation | 7 | 0.1% |
Other Symbol | 2 | < 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
주 | 489 | 12.9% |
스 | 233 | 6.1% |
이 | 178 | 4.7% |
사 | 148 | 3.9% |
회 | 140 | 3.7% |
식 | 139 | 3.7% |
코 | 124 | 3.3% |
에 | 85 | 2.2% |
메 | 82 | 2.2% |
오 | 79 | 2.1% |
Other values (312) | 2104 |
Uppercase Letter
Value | Count | Frequency (%) |
S | 7 | |
E | 6 | |
A | 5 | 8.8% |
C | 5 | 8.8% |
O | 4 | 7.0% |
L | 4 | 7.0% |
T | 3 | 5.3% |
R | 3 | 5.3% |
I | 3 | 5.3% |
P | 3 | 5.3% |
Other values (9) | 14 |
Lowercase Letter
Value | Count | Frequency (%) |
l | 2 | |
i | 2 | |
n | 2 | |
t | 2 | |
d | 1 | |
o | 1 | |
c | 1 | |
a | 1 | |
r | 1 | |
b | 1 |
Decimal Number
Value | Count | Frequency (%) |
0 | 3 | |
3 | 3 | |
9 | 2 | |
6 | 1 | 9.1% |
1 | 1 | 9.1% |
4 | 1 | 9.1% |
Other Punctuation
Value | Count | Frequency (%) |
. | 4 | |
& | 2 | |
, | 1 | 14.3% |
Open Punctuation
Value | Count | Frequency (%) |
( | 345 | |
( | 1 | 0.3% |
Close Punctuation
Value | Count | Frequency (%) |
) | 348 |
Space Separator
Value | Count | Frequency (%) |
140 |
Other Symbol
Value | Count | Frequency (%) |
㈜ | 2 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 3803 | |
Common | 852 | 18.0% |
Latin | 71 | 1.5% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
주 | 489 | 12.9% |
스 | 233 | 6.1% |
이 | 178 | 4.7% |
사 | 148 | 3.9% |
회 | 140 | 3.7% |
식 | 139 | 3.7% |
코 | 124 | 3.3% |
에 | 85 | 2.2% |
메 | 82 | 2.2% |
오 | 79 | 2.1% |
Other values (313) | 2106 |
Latin
Value | Count | Frequency (%) |
S | 7 | 9.9% |
E | 6 | 8.5% |
A | 5 | 7.0% |
C | 5 | 7.0% |
O | 4 | 5.6% |
L | 4 | 5.6% |
T | 3 | 4.2% |
R | 3 | 4.2% |
I | 3 | 4.2% |
P | 3 | 4.2% |
Other values (19) | 28 |
Common
Value | Count | Frequency (%) |
) | 348 | |
( | 345 | |
140 | ||
. | 4 | 0.5% |
0 | 3 | 0.4% |
3 | 3 | 0.4% |
9 | 2 | 0.2% |
& | 2 | 0.2% |
( | 1 | 0.1% |
6 | 1 | 0.1% |
Other values (3) | 3 | 0.4% |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 3801 | |
ASCII | 922 | 19.5% |
None | 3 | 0.1% |
Most frequent character per block
Hangul
Value | Count | Frequency (%) |
주 | 489 | 12.9% |
스 | 233 | 6.1% |
이 | 178 | 4.7% |
사 | 148 | 3.9% |
회 | 140 | 3.7% |
식 | 139 | 3.7% |
코 | 124 | 3.3% |
에 | 85 | 2.2% |
메 | 82 | 2.2% |
오 | 79 | 2.1% |
Other values (312) | 2104 |
ASCII
Value | Count | Frequency (%) |
) | 348 | |
( | 345 | |
140 | ||
S | 7 | 0.8% |
E | 6 | 0.7% |
A | 5 | 0.5% |
C | 5 | 0.5% |
. | 4 | 0.4% |
O | 4 | 0.4% |
L | 4 | 0.4% |
Other values (31) | 54 | 5.9% |
None
Value | Count | Frequency (%) |
㈜ | 2 | |
( | 1 |
지원분야
Categorical
IMBALANCE
 
Distinct | 4 |
---|---|
Distinct (%) | 0.7% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 4.7 KiB |
화장품 | |
---|---|
의료기기 | 38 |
의료기기 및 용품 | 32 |
공산품 | 1 |
Length
Max length | 9 |
---|---|
Median length | 3 |
Mean length | 3.3891709 |
Min length | 3 |
Unique
Unique | 1 ? |
---|---|
Unique (%) | 0.2% |
Sample
1st row | 의료기기 및 용품 |
---|---|
2nd row | 의료기기 및 용품 |
3rd row | 의료기기 및 용품 |
4th row | 의료기기 및 용품 |
5th row | 의료기기 및 용품 |
Common Values
Value | Count | Frequency (%) |
화장품 | 520 | |
의료기기 | 38 | 6.4% |
의료기기 및 용품 | 32 | 5.4% |
공산품 | 1 | 0.2% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
화장품 | 520 | |
의료기기 | 70 | 10.7% |
및 | 32 | 4.9% |
용품 | 32 | 4.9% |
공산품 | 1 | 0.2% |
인증명칭
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 0.2% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 4.7 KiB |
NMPA(구.CDFA) |
---|
Length
Max length | 12 |
---|---|
Median length | 12 |
Mean length | 12 |
Min length | 12 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | NMPA(구.CDFA) |
---|---|
2nd row | NMPA(구.CDFA) |
3rd row | NMPA(구.CDFA) |
4th row | NMPA(구.CDFA) |
5th row | NMPA(구.CDFA) |
Common Values
Value | Count | Frequency (%) |
NMPA(구.CDFA) | 591 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
nmpa(구.cdfa | 591 |
신청일
Date
Distinct | 214 |
---|---|
Distinct (%) | 36.2% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 4.7 KiB |
Minimum | 2015-03-12 00:00:00 |
---|---|
Maximum | 2023-05-31 00:00:00 |
협약체결일
Categorical
HIGH CORRELATION
 
Distinct | 50 |
---|---|
Distinct (%) | 8.5% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 4.7 KiB |
2016-11-14 | |
---|---|
2016-06-20 | |
2015-06-30 | |
2017-11-20 | |
2020-04-23 | 29 |
Other values (45) |
Length
Max length | 10 |
---|---|
Median length | 10 |
Mean length | 10 |
Min length | 10 |
Unique
Unique | 14 ? |
---|---|
Unique (%) | 2.4% |
Sample
1st row | 2015-10-23 |
---|---|
2nd row | 2015-06-30 |
3rd row | 2015-06-30 |
4th row | 2015-06-30 |
5th row | 2015-06-30 |
Common Values
Value | Count | Frequency (%) |
2016-11-14 | 68 | 11.5% |
2016-06-20 | 57 | 9.6% |
2015-06-30 | 47 | 8.0% |
2017-11-20 | 42 | 7.1% |
2020-04-23 | 29 | 4.9% |
2022-04-01 | 28 | 4.7% |
2022-10-31 | 21 | 3.6% |
2016-12-20 | 21 | 3.6% |
2021-04-28 | 21 | 3.6% |
2021-07-23 | 21 | 3.6% |
Other values (40) | 236 |
Length
Value | Count | Frequency (%) |
2016-11-14 | 68 | 11.5% |
2016-06-20 | 57 | 9.6% |
2015-06-30 | 47 | 8.0% |
2017-11-20 | 42 | 7.1% |
2020-04-23 | 29 | 4.9% |
2022-04-01 | 28 | 4.7% |
2022-10-31 | 21 | 3.6% |
2016-12-20 | 21 | 3.6% |
2021-04-28 | 21 | 3.6% |
2021-07-23 | 21 | 3.6% |
Other values (40) | 236 |
사업종료일
Date
MISSING
 
Distinct | 202 |
---|---|
Distinct (%) | 45.4% |
Missing | 146 |
Missing (%) | 24.7% |
Memory size | 4.7 KiB |
Minimum | 2016-06-02 00:00:00 |
---|---|
Maximum | 2023-08-24 00:00:00 |
번호 | 사업년도 | 사업자번호 | 지원분야 | 협약체결일 | |
---|---|---|---|---|---|
번호 | 1.000 | 0.906 | 0.226 | 0.467 | 0.995 |
사업년도 | 0.906 | 1.000 | 0.152 | 0.491 | 1.000 |
사업자번호 | 0.226 | 0.152 | 1.000 | 0.000 | 0.321 |
지원분야 | 0.467 | 0.491 | 0.000 | 1.000 | 0.682 |
협약체결일 | 0.995 | 1.000 | 0.321 | 0.682 | 1.000 |
지원분야 | 협약체결일 | |
---|---|---|
지원분야 | 1.000 | 0.396 |
협약체결일 | 0.396 | 1.000 |
번호 | 사업년도 | 사업자번호 | 지원분야 | 협약체결일 | |
---|---|---|---|---|---|
번호 | 1.000 | 0.984 | 0.201 | 0.305 | 0.856 |
사업년도 | 0.984 | 1.000 | 0.200 | 0.243 | 0.964 |
사업자번호 | 0.201 | 0.200 | 1.000 | 0.000 | 0.104 |
지원분야 | 0.305 | 0.243 | 0.000 | 1.000 | 0.396 |
협약체결일 | 0.856 | 0.964 | 0.104 | 0.396 | 1.000 |
번호 | 사업년도 | 사업자번호 | 주관기업명 | 지원분야 | 인증명칭 | 신청일 | 협약체결일 | 사업종료일 | |
---|---|---|---|---|---|---|---|---|---|
0 | 1 | 2015 | 2158618799 | (주)워랜텍 | 의료기기 및 용품 | NMPA(구.CDFA) | 2015-08-24 | 2015-10-23 | 2016-12-14 |
1 | 2 | 2015 | 5048137935 | (주)덴토스 | 의료기기 및 용품 | NMPA(구.CDFA) | 2015-04-03 | 2015-06-30 | 2017-12-29 |
2 | 3 | 2015 | 1298636481 | (주)월드바이오텍 | 의료기기 및 용품 | NMPA(구.CDFA) | 2015-04-02 | 2015-06-30 | 2017-09-13 |
3 | 4 | 2015 | 1238175370 | (주)휴비딕 | 의료기기 및 용품 | NMPA(구.CDFA) | 2015-04-01 | 2015-06-30 | 2017-11-06 |
4 | 5 | 2015 | 1208199219 | (주)오스테오시스 | 의료기기 및 용품 | NMPA(구.CDFA) | 2015-03-31 | 2015-06-30 | 2017-08-30 |
5 | 6 | 2015 | 1238601005 | (주)참메드 | 의료기기 및 용품 | NMPA(구.CDFA) | 2015-03-30 | 2015-06-30 | 2017-06-29 |
6 | 7 | 2015 | 3178126590 | (주)더아이엔지메디칼 | 의료기기 및 용품 | NMPA(구.CDFA) | 2015-03-23 | 2015-06-30 | 2016-06-02 |
7 | 8 | 2015 | 6038158163 | 주식회사 네오실 | 의료기기 및 용품 | NMPA(구.CDFA) | 2015-03-17 | 2015-06-30 | 2017-09-14 |
8 | 9 | 2015 | 6218195152 | (주)포셀 | 화장품 | NMPA(구.CDFA) | 2015-04-03 | 2015-06-30 | 2018-01-19 |
9 | 10 | 2015 | 2118622189 | 클리오 | 화장품 | NMPA(구.CDFA) | 2015-04-03 | 2015-06-30 | 2017-12-29 |
번호 | 사업년도 | 사업자번호 | 주관기업명 | 지원분야 | 인증명칭 | 신청일 | 협약체결일 | 사업종료일 | |
---|---|---|---|---|---|---|---|---|---|
581 | 582 | 2023 | 6988601552 | 티핏클래스(주) | 화장품 | NMPA(구.CDFA) | 20230329 | 2023-06-02 | <NA> |
582 | 583 | 2023 | 8788100872 | (주)기베스트 | 화장품 | NMPA(구.CDFA) | 20230530 | 2023-07-21 | <NA> |
583 | 584 | 2023 | 8538700953 | (주)스타스테크 | 화장품 | NMPA(구.CDFA) | 20230531 | 2023-07-21 | <NA> |
584 | 585 | 2023 | 1438120262 | (주)아침해의료기 | 의료기기 | NMPA(구.CDFA) | 20230531 | 2023-07-21 | <NA> |
585 | 586 | 2023 | 2048612193 | (주)와이제이비앤 | 화장품 | NMPA(구.CDFA) | 20230530 | 2023-07-21 | <NA> |
586 | 587 | 2023 | 1298636481 | (주)월드바이오텍 | 의료기기 | NMPA(구.CDFA) | 20230531 | 2023-07-21 | <NA> |
587 | 588 | 2023 | 3328100885 | (주)쥬네뷰 | 화장품 | NMPA(구.CDFA) | 20230519 | 2023-07-21 | <NA> |
588 | 589 | 2023 | 7118700829 | (주)코스모어플러스 | 화장품 | NMPA(구.CDFA) | 20230530 | 2023-07-21 | <NA> |
589 | 590 | 2023 | 1228701163 | 오션스바이오(주) | 의료기기 | NMPA(구.CDFA) | 20230530 | 2023-07-21 | <NA> |
590 | 591 | 2023 | 8920401634 | 주아빛 | 화장품 | NMPA(구.CDFA) | 20230531 | 2023-07-21 | <NA> |