Dataset statistics
Number of variables | 3 |
---|---|
Number of observations | 883 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 21.7 KiB |
Average record size in memory | 25.1 B |
Variable types
Numeric | 1 |
---|---|
Text | 2 |
Dataset
Description | 산업통상자원부 국가기술표준원의 기관 대표 홈페이지에 게제된 게시글의 첨부파일 관련 정보로서 게시글 번호, 파일명, 원본 파일명 정보를 제공합니다. |
---|---|
URL | https://www.data.go.kr/data/15040687/fileData.do |
파일명 has unique values | Unique |
Reproduction
Analysis started | 2023-12-12 18:27:35.710491 |
---|---|
Analysis finished | 2023-12-12 18:27:36.305141 |
Duration | 0.59 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
게시글번호
Real number (ℝ)
Distinct | 593 |
---|---|
Distinct (%) | 67.2% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 23624.561 |
Minimum | 19142 |
---|---|
Maximum | 23941 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 7.9 KiB |
Quantile statistics
Minimum | 19142 |
---|---|
5-th percentile | 23372.1 |
Q1 | 23483.5 |
median | 23643 |
Q3 | 23787.5 |
95-th percentile | 23911 |
Maximum | 23941 |
Range | 4799 |
Interquartile range (IQR) | 304 |
Descriptive statistics
Standard deviation | 297.5825 |
---|---|
Coefficient of variation (CV) | 0.012596319 |
Kurtosis | 123.7114 |
Mean | 23624.561 |
Median Absolute Deviation (MAD) | 152 |
Skewness | -8.8385162 |
Sum | 20860487 |
Variance | 88555.342 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
23378 | 5 | 0.6% |
23784 | 5 | 0.6% |
23911 | 5 | 0.6% |
23798 | 5 | 0.6% |
23703 | 5 | 0.6% |
23663 | 5 | 0.6% |
23376 | 5 | 0.6% |
23377 | 4 | 0.5% |
23421 | 4 | 0.5% |
23445 | 4 | 0.5% |
Other values (583) | 836 |
Value | Count | Frequency (%) |
19142 | 2 | |
21269 | 2 | |
23340 | 1 | 0.1% |
23341 | 2 | |
23342 | 1 | 0.1% |
23343 | 1 | 0.1% |
23344 | 1 | 0.1% |
23345 | 2 | |
23346 | 1 | 0.1% |
23347 | 4 |
Value | Count | Frequency (%) |
23941 | 1 | 0.1% |
23940 | 1 | 0.1% |
23939 | 4 | |
23938 | 1 | 0.1% |
23937 | 2 | |
23936 | 1 | 0.1% |
23935 | 1 | 0.1% |
23934 | 1 | 0.1% |
23933 | 1 | 0.1% |
23932 | 1 | 0.1% |
파일명
Text
UNIQUE
 
Distinct | 883 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 7.0 KiB |
Length
Max length | 29 |
---|---|
Median length | 28 |
Mean length | 28.02718 |
Min length | 27 |
Characters and Unicode
Total characters | 24748 |
---|---|
Distinct characters | 37 |
Distinct categories | 5 ? |
Distinct scripts | 2 ? |
Distinct blocks | 1 ? |
Unique
Unique | 883 ? |
---|---|
Unique (%) | 100.0% |
Sample
1st row | 23343_202210050806404390.pdf |
---|---|
2nd row | 23365_202210171516584330.pdf |
3rd row | 23397_202210310711363180.jpg |
4th row | 23397_202210310711363430.jpg |
5th row | 23397_202210310711364551.jpg |
Value | Count | Frequency (%) |
23343_202210050806404390.pdf | 1 | 0.1% |
23458_202212011649448050.pdf | 1 | 0.1% |
23435_202211171751278570.pdf | 1 | 0.1% |
23468_202212061606027290.hwp | 1 | 0.1% |
23468_202212061606028261.pdf | 1 | 0.1% |
23468_202212061606029252.pdf | 1 | 0.1% |
23471_202212091738002430.pdf | 1 | 0.1% |
23471_202212091738003331.zip | 1 | 0.1% |
23474_202212121722181770.hwp | 1 | 0.1% |
23451_202211241718159630.pdf | 1 | 0.1% |
Other values (873) | 873 |
Most occurring characters
Value | Count | Frequency (%) |
2 | 4428 | |
0 | 3670 | |
3 | 2840 | |
1 | 2782 | |
4 | 1302 | 5.3% |
5 | 1223 | 4.9% |
7 | 1193 | 4.8% |
6 | 984 | 4.0% |
8 | 971 | 3.9% |
9 | 917 | 3.7% |
Other values (27) | 4438 |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 20310 | |
Lowercase Letter | 2571 | 10.4% |
Connector Punctuation | 883 | 3.6% |
Other Punctuation | 883 | 3.6% |
Uppercase Letter | 101 | 0.4% |
Most frequent character per category
Lowercase Letter
Value | Count | Frequency (%) |
p | 841 | |
h | 474 | |
w | 473 | |
d | 265 | 10.3% |
f | 263 | 10.2% |
z | 75 | 2.9% |
i | 74 | 2.9% |
g | 31 | 1.2% |
j | 31 | 1.2% |
x | 27 | 1.1% |
Other values (6) | 17 | 0.7% |
Decimal Number
Value | Count | Frequency (%) |
2 | 4428 | |
0 | 3670 | |
3 | 2840 | |
1 | 2782 | |
4 | 1302 | 6.4% |
5 | 1223 | 6.0% |
7 | 1193 | 5.9% |
6 | 984 | 4.8% |
8 | 971 | 4.8% |
9 | 917 | 4.5% |
Uppercase Letter
Value | Count | Frequency (%) |
P | 31 | |
G | 30 | |
N | 25 | |
J | 5 | 5.0% |
X | 4 | 4.0% |
L | 2 | 2.0% |
S | 2 | 2.0% |
D | 1 | 1.0% |
F | 1 | 1.0% |
Connector Punctuation
Value | Count | Frequency (%) |
_ | 883 |
Other Punctuation
Value | Count | Frequency (%) |
. | 883 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 22076 | |
Latin | 2672 | 10.8% |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
p | 841 | |
h | 474 | |
w | 473 | |
d | 265 | 9.9% |
f | 263 | 9.8% |
z | 75 | 2.8% |
i | 74 | 2.8% |
g | 31 | 1.2% |
P | 31 | 1.2% |
j | 31 | 1.2% |
Other values (15) | 114 | 4.3% |
Common
Value | Count | Frequency (%) |
2 | 4428 | |
0 | 3670 | |
3 | 2840 | |
1 | 2782 | |
4 | 1302 | 5.9% |
5 | 1223 | 5.5% |
7 | 1193 | 5.4% |
6 | 984 | 4.5% |
8 | 971 | 4.4% |
9 | 917 | 4.2% |
Other values (2) | 1766 | 8.0% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 24748 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
2 | 4428 | |
0 | 3670 | |
3 | 2840 | |
1 | 2782 | |
4 | 1302 | 5.3% |
5 | 1223 | 4.9% |
7 | 1193 | 4.8% |
6 | 984 | 4.0% |
8 | 971 | 3.9% |
9 | 917 | 3.7% |
Other values (27) | 4438 |
오리지널파일명
Text
Distinct | 814 |
---|---|
Distinct (%) | 92.2% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 7.0 KiB |
Length
Max length | 93 |
---|---|
Median length | 66 |
Mean length | 40.475651 |
Min length | 6 |
Characters and Unicode
Total characters | 35740 |
---|---|
Distinct characters | 538 |
Distinct categories | 15 ? |
Distinct scripts | 3 ? |
Distinct blocks | 7 ? |
Unique
Unique | 764 ? |
---|---|
Unique (%) | 86.5% |
Sample
1st row | 상임전문위원 서류전형 합격자 및 면접시험 공고.pdf |
---|---|
2nd row | 상임전문위원 채용시험 최종합격자 공고.pdf |
3rd row | 16656528475150.jpg |
4th row | 16656528475811.jpg |
5th row | 16656528478745.jpg |
Value | Count | Frequency (%) |
공고 | 157 | 3.6% |
국가기술표준원 | 135 | 3.1% |
kolas | 85 | 1.9% |
및 | 84 | 1.9% |
2023년 | 60 | 1.4% |
공고.hwp | 47 | 1.1% |
공인시험기관 | 46 | 1.0% |
인정공고(국가기술표준원 | 39 | 0.9% |
안전기준 | 38 | 0.9% |
2023년도 | 33 | 0.7% |
Other values (1999) | 3693 |
Most occurring characters
Value | Count | Frequency (%) |
3536 | 9.9% | |
2 | 1880 | 5.3% |
0 | 1453 | 4.1% |
. | 1160 | 3.2% |
) | 878 | 2.5% |
( | 877 | 2.5% |
p | 849 | 2.4% |
기 | 807 | 2.3% |
1 | 782 | 2.2% |
3 | 762 | 2.1% |
Other values (528) | 22756 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 17718 | |
Decimal Number | 6074 | 17.0% |
Space Separator | 3536 | 9.9% |
Lowercase Letter | 2761 | 7.7% |
Other Punctuation | 1466 | 4.1% |
Uppercase Letter | 1456 | 4.1% |
Close Punctuation | 915 | 2.6% |
Open Punctuation | 914 | 2.6% |
Connector Punctuation | 472 | 1.3% |
Dash Punctuation | 401 | 1.1% |
Other values (5) | 27 | 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
기 | 807 | 4.6% |
공 | 711 | 4.0% |
제 | 693 | 3.9% |
고 | 651 | 3.7% |
준 | 572 | 3.2% |
표 | 541 | 3.1% |
인 | 519 | 2.9% |
정 | 483 | 2.7% |
국 | 469 | 2.6% |
술 | 448 | 2.5% |
Other values (441) | 11824 |
Uppercase Letter
Value | Count | Frequency (%) |
K | 204 | |
A | 168 | |
S | 164 | |
L | 149 | |
O | 147 | |
P | 96 | 6.6% |
N | 80 | 5.5% |
G | 77 | 5.3% |
R | 66 | 4.5% |
T | 63 | 4.3% |
Other values (14) | 242 |
Lowercase Letter
Value | Count | Frequency (%) |
p | 849 | |
h | 479 | |
w | 473 | |
d | 269 | 9.7% |
f | 265 | 9.6% |
i | 95 | 3.4% |
z | 75 | 2.7% |
g | 39 | 1.4% |
j | 31 | 1.1% |
x | 27 | 1.0% |
Other values (13) | 159 | 5.8% |
Decimal Number
Value | Count | Frequency (%) |
2 | 1880 | |
0 | 1453 | |
1 | 782 | |
3 | 762 | |
5 | 245 | 4.0% |
6 | 241 | 4.0% |
4 | 203 | 3.3% |
7 | 196 | 3.2% |
8 | 169 | 2.8% |
9 | 143 | 2.4% |
Other Punctuation
Value | Count | Frequency (%) |
. | 1160 | |
, | 253 | 17.3% |
· | 25 | 1.7% |
' | 10 | 0.7% |
! | 9 | 0.6% |
? | 3 | 0.2% |
; | 2 | 0.1% |
& | 2 | 0.1% |
※ | 1 | 0.1% |
% | 1 | 0.1% |
Math Symbol
Value | Count | Frequency (%) |
+ | 3 | |
~ | 2 | |
= | 2 | |
∼ | 1 | 12.5% |
Close Punctuation
Value | Count | Frequency (%) |
) | 878 | |
] | 25 | 2.7% |
」 | 12 | 1.3% |
Open Punctuation
Value | Count | Frequency (%) |
( | 877 | |
[ | 25 | 2.7% |
「 | 12 | 1.3% |
Final Punctuation
Value | Count | Frequency (%) |
” | 4 | |
’ | 2 |
Initial Punctuation
Value | Count | Frequency (%) |
“ | 4 | |
‘ | 2 |
Other Symbol
Value | Count | Frequency (%) |
★ | 3 | |
㈜ | 2 |
Space Separator
Value | Count | Frequency (%) |
3536 |
Connector Punctuation
Value | Count | Frequency (%) |
_ | 472 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 401 |
Modifier Symbol
Value | Count | Frequency (%) |
` | 2 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 17720 | |
Common | 13803 | |
Latin | 4217 | 11.8% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
기 | 807 | 4.6% |
공 | 711 | 4.0% |
제 | 693 | 3.9% |
고 | 651 | 3.7% |
준 | 572 | 3.2% |
표 | 541 | 3.1% |
인 | 519 | 2.9% |
정 | 483 | 2.7% |
국 | 469 | 2.6% |
술 | 448 | 2.5% |
Other values (442) | 11826 |
Latin
Value | Count | Frequency (%) |
p | 849 | |
h | 479 | |
w | 473 | |
d | 269 | 6.4% |
f | 265 | 6.3% |
K | 204 | 4.8% |
A | 168 | 4.0% |
S | 164 | 3.9% |
L | 149 | 3.5% |
O | 147 | 3.5% |
Other values (37) | 1050 |
Common
Value | Count | Frequency (%) |
3536 | ||
2 | 1880 | |
0 | 1453 | |
. | 1160 | 8.4% |
) | 878 | 6.4% |
( | 877 | 6.4% |
1 | 782 | 5.7% |
3 | 762 | 5.5% |
_ | 472 | 3.4% |
- | 401 | 2.9% |
Other values (29) | 1602 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 17954 | |
Hangul | 17717 | |
None | 51 | 0.1% |
Punctuation | 13 | < 0.1% |
Misc Symbols | 3 | < 0.1% |
Compat Jamo | 1 | < 0.1% |
Math Operators | 1 | < 0.1% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
3536 | ||
2 | 1880 | 10.5% |
0 | 1453 | 8.1% |
. | 1160 | 6.5% |
) | 878 | 4.9% |
( | 877 | 4.9% |
p | 849 | 4.7% |
1 | 782 | 4.4% |
3 | 762 | 4.2% |
h | 479 | 2.7% |
Other values (66) | 5298 |
Hangul
Value | Count | Frequency (%) |
기 | 807 | 4.6% |
공 | 711 | 4.0% |
제 | 693 | 3.9% |
고 | 651 | 3.7% |
준 | 572 | 3.2% |
표 | 541 | 3.1% |
인 | 519 | 2.9% |
정 | 483 | 2.7% |
국 | 469 | 2.6% |
술 | 448 | 2.5% |
Other values (440) | 11823 |
None
Value | Count | Frequency (%) |
· | 25 | |
」 | 12 | |
「 | 12 | |
㈜ | 2 | 3.9% |
Punctuation
Value | Count | Frequency (%) |
” | 4 | |
“ | 4 | |
’ | 2 | |
‘ | 2 | |
※ | 1 | 7.7% |
Misc Symbols
Value | Count | Frequency (%) |
★ | 3 |
Compat Jamo
Value | Count | Frequency (%) |
ㆍ | 1 |
Math Operators
Value | Count | Frequency (%) |
∼ | 1 |
게시글번호 | 파일명 | 오리지널파일명 | |
---|---|---|---|
0 | 23343 | 23343_202210050806404390.pdf | 상임전문위원 서류전형 합격자 및 면접시험 공고.pdf |
1 | 23365 | 23365_202210171516584330.pdf | 상임전문위원 채용시험 최종합격자 공고.pdf |
2 | 23397 | 23397_202210310711363180.jpg | 16656528475150.jpg |
3 | 23397 | 23397_202210310711363430.jpg | 16656528475811.jpg |
4 | 23397 | 23397_202210310711364551.jpg | 16656528478745.jpg |
5 | 23398 | 23398_202210310715302150.jpg | 16665945972121.jpg |
6 | 23398 | 23398_202210310715303190.jpg | 16665945964850.jpg |
7 | 23398 | 23398_202210310715304091.jpg | 16665945973763.jpg |
8 | 23398 | 23398_202210310715305192.jpg | 16665945975325.jpg |
9 | 23399 | 23399_202210310720220070.jpg | 16667683966195.jpg |
게시글번호 | 파일명 | 오리지널파일명 | |
---|---|---|---|
873 | 23756 | 23756_202305170936246711.hwp | 0515(16석간)기계융합산업표준과, 한, 친환경 선박분야 ISO 국제표준 주도.hwp |
874 | 23761 | 23761_202305220942129690.hwp | 0519(22조간)기계융합산업표준과, 우리 자율주행 기술, 국제표준으로 세계시장 진출.hwp |
875 | 23416 | 23416_202211081627187940.hwp | 1103(04석간) 메타버스 서비스표준화 포럼.hwp |
876 | 23434 | 23434_202211171410528180.hwp | 1117(18조간)바이오화학서비스표준과, 건물에너지 효율 위한 건물일체형 태양광(BIPV) KS 개정.hwp |
877 | 23585 | 23585_202302122044413410.hwp | 0210(11조간)바이오화학서비스표준과, 우리나라 나노센서 성능평가 기술, 국제표준으로 제정.hwp |
878 | 23609 | 23609_202302231245037280.hwp | 0223(조간)바이오화학서비스표준과, 우리나라 주도로 넷제로 에너지 국제표준 최초 개발_제출_최종.hwp |
879 | 23687 | 23687_202304070924536500.hwp | 0406(7조간)바이오화학서비스표준과, 한국인 고령인구(70세_84세) 20년 전보다 키 크고 날씬해져.hwp |
880 | 23822 | 23822_202306291517506800.hwp | 0628(29조간)바이오화학서비스표준과, 이차전지 양극재 분석 표준화로 K배터리 글로벌 경쟁력 강화.hwp |
881 | 23934 | 23934_202308240916358411.hwp | 0823(24조간)바이오화학서비스표준과, 백신산업 분류체계 표준화로 글로벌 백신강국 토대 마련.hwp |
882 | 23412 | 23412_202211031748080300.JPG | test.JPG |