Dataset statistics
Number of variables | 3 |
---|---|
Number of observations | 4749 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 111.4 KiB |
Average record size in memory | 24.0 B |
Variable types
Text | 2 |
---|---|
Categorical | 1 |
Dataset
Description | 국립암센터에서 19년도 9월까지 국립암센터홈페이지를 통해 개방하는 공지코드 |
---|---|
Author | 국립암센터 |
URL | https://www.data.go.kr/data/15049634/fileData.do |
NTC_IDX has unique values | Unique |
Reproduction
Analysis started | 2023-12-12 05:39:44.135020 |
---|---|
Analysis finished | 2023-12-12 05:39:44.435951 |
Duration | 0.3 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
NTC_IDX
Text
UNIQUE
 
Distinct | 4749 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 37.2 KiB |
Value | Count | Frequency (%) |
358 | 1 | < 0.1% |
8,611 | 1 | < 0.1% |
8,607 | 1 | < 0.1% |
8,609 | 1 | < 0.1% |
8,596 | 1 | < 0.1% |
8,597 | 1 | < 0.1% |
8,595 | 1 | < 0.1% |
8,591 | 1 | < 0.1% |
8,569 | 1 | < 0.1% |
8,559 | 1 | < 0.1% |
Other values (4739) | 4739 |
Most occurring characters
Value | Count | Frequency (%) |
1 | 4343 | |
, | 4202 | |
3 | 2053 | |
2 | 1994 | |
4 | 1992 | |
5 | 1886 | |
6 | 1886 | |
0 | 1676 | 6.8% |
9 | 1631 | 6.6% |
7 | 1611 | 6.5% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 20597 | |
Other Punctuation | 4202 | 16.9% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
1 | 4343 | |
3 | 2053 | |
2 | 1994 | |
4 | 1992 | |
5 | 1886 | |
6 | 1886 | |
0 | 1676 | 8.1% |
9 | 1631 | 7.9% |
7 | 1611 | 7.8% |
8 | 1525 | 7.4% |
Other Punctuation
Value | Count | Frequency (%) |
, | 4202 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 24799 |
Most frequent character per script
Common
Value | Count | Frequency (%) |
1 | 4343 | |
, | 4202 | |
3 | 2053 | |
2 | 1994 | |
4 | 1992 | |
5 | 1886 | |
6 | 1886 | |
0 | 1676 | 6.8% |
9 | 1631 | 6.6% |
7 | 1611 | 6.5% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 24799 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
1 | 4343 | |
, | 4202 | |
3 | 2053 | |
2 | 1994 | |
4 | 1992 | |
5 | 1886 | |
6 | 1886 | |
0 | 1676 | 6.8% |
9 | 1631 | 6.6% |
7 | 1611 | 6.5% |
NTC_ID
Text
Distinct | 4739 |
---|---|
Distinct (%) | 99.8% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 37.2 KiB |
Value | Count | Frequency (%) |
1,834 | 3 | 0.1% |
1,693 | 2 | < 0.1% |
1,698 | 2 | < 0.1% |
3,028 | 2 | < 0.1% |
1,696 | 2 | < 0.1% |
1,106 | 2 | < 0.1% |
1,879 | 2 | < 0.1% |
1,013 | 2 | < 0.1% |
4,862 | 2 | < 0.1% |
2,521 | 1 | < 0.1% |
Other values (4729) | 4729 |
Most occurring characters
Value | Count | Frequency (%) |
, | 3944 | |
4 | 2400 | |
1 | 2376 | |
3 | 2370 | |
2 | 2359 | |
5 | 1543 | 7.0% |
9 | 1442 | 6.5% |
8 | 1412 | 6.4% |
0 | 1412 | 6.4% |
7 | 1400 | 6.4% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 18101 | |
Other Punctuation | 3944 | 17.9% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
4 | 2400 | |
1 | 2376 | |
3 | 2370 | |
2 | 2359 | |
5 | 1543 | |
9 | 1442 | |
8 | 1412 | |
0 | 1412 | |
7 | 1400 | |
6 | 1387 |
Other Punctuation
Value | Count | Frequency (%) |
, | 3944 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 22045 |
Most frequent character per script
Common
Value | Count | Frequency (%) |
, | 3944 | |
4 | 2400 | |
1 | 2376 | |
3 | 2370 | |
2 | 2359 | |
5 | 1543 | 7.0% |
9 | 1442 | 6.5% |
8 | 1412 | 6.4% |
0 | 1412 | 6.4% |
7 | 1400 | 6.4% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 22045 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
, | 3944 | |
4 | 2400 | |
1 | 2376 | |
3 | 2370 | |
2 | 2359 | |
5 | 1543 | 7.0% |
9 | 1442 | 6.5% |
8 | 1412 | 6.4% |
0 | 1412 | 6.4% |
7 | 1400 | 6.4% |
NTC_CODE
Categorical
Distinct | 25 |
---|---|
Distinct (%) | 0.5% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 37.2 KiB |
005_002 | |
---|---|
005_001 | |
005_005 | |
005_006 | |
005_004 | 152 |
Other values (20) |
Length
Max length | 9 |
---|---|
Median length | 7 |
Mean length | 7.0593809 |
Min length | 7 |
Unique
Unique | 3 ? |
---|---|
Unique (%) | 0.1% |
Sample
1st row | 005_002 |
---|---|
2nd row | 005_002 |
3rd row | 005_002 |
4th row | 005_002 |
5th row | 005_002 |
Common Values
Value | Count | Frequency (%) |
005_002 | 2006 | |
005_001 | 1120 | |
005_005 | 359 | 7.6% |
005_006 | 261 | 5.5% |
005_004 | 152 | 3.2% |
005_001_1 | 139 | 2.9% |
005_003 | 117 | 2.5% |
005_010 | 85 | 1.8% |
005_102 | 83 | 1.7% |
005_027 | 80 | 1.7% |
Other values (15) | 347 | 7.3% |
Length
Value | Count | Frequency (%) |
005_002 | 2006 | |
005_001 | 1122 | |
005_005 | 359 | 7.6% |
005_006 | 261 | 5.5% |
005_004 | 152 | 3.2% |
005_001_1 | 139 | 2.9% |
005_003 | 117 | 2.5% |
005_010 | 85 | 1.8% |
005_102 | 83 | 1.7% |
005_027 | 80 | 1.7% |
Other values (14) | 345 | 7.3% |
NTC_IDX | NTC_ID | NTC_CODE | |
---|---|---|---|
0 | 358 | 449 | 005_002 |
1 | 359 | 450 | 005_002 |
2 | 360 | 452 | 005_002 |
3 | 361 | 451 | 005_002 |
4 | 362 | 454 | 005_002 |
5 | 363 | 455 | 005_003 |
6 | 364 | 456 | 005_002 |
7 | 365 | 457 | 005_002 |
8 | 366 | 460 | 005_002 |
9 | 367 | 459 | 005_002 |
NTC_IDX | NTC_ID | NTC_CODE | |
---|---|---|---|
4739 | 16,320 | 5,058 | 005_002 |
4740 | 16,322 | 5,059 | 005_002 |
4741 | 16,357 | 5,063 | 005_002 |
4742 | 16,463 | 5,081 | 005_002 |
4743 | 16,420 | 5,077 | 005_002 |
4744 | 16,439 | 5,080 | 005_002 |
4745 | 16,578 | 5,124 | 005_002 |
4746 | 16,464 | 5,084 | 005_002 |
4747 | 16,584 | 5,128 | 005_002 |
4748 | 16,462 | 5,083 | 005_002 |