Dataset statistics
Number of variables | 4 |
---|---|
Number of observations | 100 |
Missing cells | 100 |
Missing cells (%) | 25.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 3.4 KiB |
Average record size in memory | 34.3 B |
Variable types
Text | 3 |
---|---|
Unsupported | 1 |
Dataset
Description | 샘플 데이터 |
---|---|
Author | 한국과학기술원 |
URL | https://www.bigdata-environment.kr/user/data_market/detail.do?id=8dece210-33ae-11eb-8f72-932712f5aa3c |
Reproduction
Analysis started | 2023-12-10 10:16:08.880020 |
---|---|
Analysis finished | 2023-12-10 10:16:09.919332 |
Duration | 1.04 second |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
장소 아이디
Text
UNIQUE
 
Distinct | 100 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
Length
Max length | 24 |
---|---|
Median length | 24 |
Mean length | 24 |
Min length | 24 |
Characters and Unicode
Total characters | 2400 |
---|---|
Distinct characters | 16 |
Distinct categories | 2 ? |
Distinct scripts | 2 ? |
Distinct blocks | 1 ? |
Unique
Unique | 100 ? |
---|---|
Unique (%) | 100.0% |
Sample
1st row | 5f28cf7ea42fcd6d828e0137 |
---|---|
2nd row | 5f28d167a42fcd6d828e013b |
3rd row | 5f28db25ee39d47796f08a01 |
4th row | 5f28dc6eee39d47796f08a05 |
5th row | 5f28f15b4de0bb7ebabe6db8 |
Value | Count | Frequency (%) |
5f28cf7ea42fcd6d828e0137 | 1 | 1.0% |
5f929cf7e687616c801a19be | 1 | 1.0% |
5f92a0e1d3522b13c574bdd7 | 1 | 1.0% |
5f92a0e1d3522b13c574bdd3 | 1 | 1.0% |
5f92a0e0d3522b13c574bdcf | 1 | 1.0% |
5f92a0e0d3522b13c574bdcb | 1 | 1.0% |
5f92a0dfd3522b13c574bd75 | 1 | 1.0% |
5f92a0dbd3522b13c574bd71 | 1 | 1.0% |
5f92a0d8d3522b13c574bd6d | 1 | 1.0% |
5f92a0d2d3522b13c574bd3d | 1 | 1.0% |
Other values (90) | 90 |
Most occurring characters
Value | Count | Frequency (%) |
5 | 224 | 9.3% |
7 | 206 | 8.6% |
f | 191 | 8.0% |
d | 173 | 7.2% |
1 | 170 | 7.1% |
e | 168 | 7.0% |
2 | 155 | 6.5% |
8 | 144 | 6.0% |
9 | 138 | 5.8% |
6 | 134 | 5.6% |
Other values (6) | 697 |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 1508 | |
Lowercase Letter | 892 |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
5 | 224 | |
7 | 206 | |
1 | 170 | |
2 | 155 | |
8 | 144 | |
9 | 138 | |
6 | 134 | |
3 | 119 | |
4 | 115 | |
0 | 103 |
Lowercase Letter
Value | Count | Frequency (%) |
f | 191 | |
d | 173 | |
e | 168 | |
b | 133 | |
c | 126 | |
a | 101 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 1508 | |
Latin | 892 |
Most frequent character per script
Common
Value | Count | Frequency (%) |
5 | 224 | |
7 | 206 | |
1 | 170 | |
2 | 155 | |
8 | 144 | |
9 | 138 | |
6 | 134 | |
3 | 119 | |
4 | 115 | |
0 | 103 |
Latin
Value | Count | Frequency (%) |
f | 191 | |
d | 173 | |
e | 168 | |
b | 133 | |
c | 126 | |
a | 101 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 2400 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
5 | 224 | 9.3% |
7 | 206 | 8.6% |
f | 191 | 8.0% |
d | 173 | 7.2% |
1 | 170 | 7.1% |
e | 168 | 7.0% |
2 | 155 | 6.5% |
8 | 144 | 6.0% |
9 | 138 | 5.8% |
6 | 134 | 5.6% |
Other values (6) | 697 |
장소 타입
Unsupported
MISSING
  REJECTED
  UNSUPPORTED
 
Missing | 100 |
---|---|
Missing (%) | 100.0% |
Memory size | 1.0 KiB |
장소 이름
Text
Distinct | 80 |
---|---|
Distinct (%) | 80.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
Length
Max length | 11 |
---|---|
Median length | 9 |
Mean length | 6.76 |
Min length | 2 |
Characters and Unicode
Total characters | 676 |
---|---|
Distinct characters | 47 |
Distinct categories | 8 ? |
Distinct scripts | 3 ? |
Distinct blocks | 3 ? |
Unique
Unique | 73 ? |
---|---|
Unique (%) | 73.0% |
Sample
1st row | 100동 100호 |
---|---|
2nd row | 7208동 1204호 |
3rd row | 205동 604호 |
4th row | 316동 2304호 |
5th row | 105동 2101호 |
Value | Count | Frequency (%) |
301 | 8 | 5.8% |
123 | 5 | 3.6% |
107동 | 5 | 3.6% |
111 | 4 | 2.9% |
102동 | 4 | 2.9% |
1111 | 3 | 2.2% |
304 | 3 | 2.2% |
604호 | 2 | 1.4% |
1층 | 2 | 1.4% |
106동 | 2 | 1.4% |
Other values (93) | 100 |
Most occurring characters
Value | Count | Frequency (%) |
1 | 167 | |
0 | 136 | |
2 | 60 | 8.9% |
3 | 49 | 7.2% |
38 | 5.6% | |
호 | 36 | 5.3% |
동 | 35 | 5.2% |
4 | 34 | 5.0% |
& | 22 | 3.3% |
5 | 18 | 2.7% |
Other values (37) | 81 |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 500 | |
Other Letter | 106 | 15.7% |
Space Separator | 38 | 5.6% |
Other Punctuation | 22 | 3.3% |
Dash Punctuation | 4 | 0.6% |
Lowercase Letter | 3 | 0.4% |
Uppercase Letter | 2 | 0.3% |
Connector Punctuation | 1 | 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
호 | 36 | |
동 | 35 | |
층 | 3 | 2.8% |
이 | 3 | 2.8% |
어 | 3 | 2.8% |
집 | 2 | 1.9% |
린 | 2 | 1.9% |
학 | 2 | 1.9% |
님 | 1 | 0.9% |
방 | 1 | 0.9% |
Other values (18) | 18 |
Decimal Number
Value | Count | Frequency (%) |
1 | 167 | |
0 | 136 | |
2 | 60 | 12.0% |
3 | 49 | 9.8% |
4 | 34 | 6.8% |
5 | 18 | 3.6% |
7 | 17 | 3.4% |
8 | 9 | 1.8% |
6 | 7 | 1.4% |
9 | 3 | 0.6% |
Lowercase Letter
Value | Count | Frequency (%) |
f | 1 | |
d | 1 | |
a | 1 |
Uppercase Letter
Value | Count | Frequency (%) |
B | 1 | |
A | 1 |
Space Separator
Value | Count | Frequency (%) |
38 |
Other Punctuation
Value | Count | Frequency (%) |
& | 22 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 4 |
Connector Punctuation
Value | Count | Frequency (%) |
_ | 1 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 565 | |
Hangul | 106 | 15.7% |
Latin | 5 | 0.7% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
호 | 36 | |
동 | 35 | |
층 | 3 | 2.8% |
이 | 3 | 2.8% |
어 | 3 | 2.8% |
집 | 2 | 1.9% |
린 | 2 | 1.9% |
학 | 2 | 1.9% |
님 | 1 | 0.9% |
방 | 1 | 0.9% |
Other values (18) | 18 |
Common
Value | Count | Frequency (%) |
1 | 167 | |
0 | 136 | |
2 | 60 | 10.6% |
3 | 49 | 8.7% |
38 | 6.7% | |
4 | 34 | 6.0% |
& | 22 | 3.9% |
5 | 18 | 3.2% |
7 | 17 | 3.0% |
8 | 9 | 1.6% |
Other values (4) | 15 | 2.7% |
Latin
Value | Count | Frequency (%) |
B | 1 | |
f | 1 | |
d | 1 | |
a | 1 | |
A | 1 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 570 | |
Hangul | 101 | 14.9% |
Compat Jamo | 5 | 0.7% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
1 | 167 | |
0 | 136 | |
2 | 60 | 10.5% |
3 | 49 | 8.6% |
38 | 6.7% | |
4 | 34 | 6.0% |
& | 22 | 3.9% |
5 | 18 | 3.2% |
7 | 17 | 3.0% |
8 | 9 | 1.6% |
Other values (9) | 20 | 3.5% |
Hangul
Value | Count | Frequency (%) |
호 | 36 | |
동 | 35 | |
층 | 3 | 3.0% |
이 | 3 | 3.0% |
어 | 3 | 3.0% |
집 | 2 | 2.0% |
린 | 2 | 2.0% |
학 | 2 | 2.0% |
님 | 1 | 1.0% |
방 | 1 | 1.0% |
Other values (13) | 13 | 12.9% |
Compat Jamo
Value | Count | Frequency (%) |
ㅂ | 1 | |
ㄹ | 1 | |
ㅇ | 1 | |
ㄴ | 1 | |
ㅁ | 1 |
건물 아이디
Text
Distinct | 51 |
---|---|
Distinct (%) | 51.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
Length
Max length | 24 |
---|---|
Median length | 24 |
Mean length | 24 |
Min length | 24 |
Characters and Unicode
Total characters | 2400 |
---|---|
Distinct characters | 16 |
Distinct categories | 2 ? |
Distinct scripts | 2 ? |
Distinct blocks | 1 ? |
Unique
Unique | 38 ? |
---|---|
Unique (%) | 38.0% |
Sample
1st row | 5f06a67b153b58ec1eb3ea5b |
---|---|
2nd row | 5f06a1b2153b58ec1eb3ea29 |
3rd row | 5f06a67b153b58ec1eb3ea5b |
4th row | 5f06a67b153b58ec1eb3ea5b |
5th row | 5f06a658153b58ec1eb3ea59 |
Value | Count | Frequency (%) |
5f77ed1f39518d739336219b | 22 | |
5f92a0d2d3522b13c574bd3c | 8 | 8.0% |
5f2b9e80fcc931639f20d589 | 6 | 6.0% |
5f067dd8153b58ec1eb3ea12 | 4 | 4.0% |
5f06a4e9153b58ec1eb3ea4d | 3 | 3.0% |
5f8d465df61d2e318b09fcf5 | 3 | 3.0% |
5f4eeeaf706250145618bb53 | 3 | 3.0% |
5f06a67b153b58ec1eb3ea5b | 3 | 3.0% |
5f06a658153b58ec1eb3ea59 | 2 | 2.0% |
5f06a478153b58ec1eb3ea47 | 2 | 2.0% |
Other values (41) | 44 |
Most occurring characters
Value | Count | Frequency (%) |
5 | 282 | |
3 | 236 | 9.8% |
1 | 205 | 8.5% |
f | 183 | 7.6% |
e | 174 | 7.2% |
b | 162 | 6.8% |
9 | 157 | 6.5% |
8 | 136 | 5.7% |
7 | 134 | 5.6% |
d | 133 | 5.5% |
Other values (6) | 598 |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 1547 | |
Lowercase Letter | 853 |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
5 | 282 | |
3 | 236 | |
1 | 205 | |
9 | 157 | |
8 | 136 | |
7 | 134 | |
2 | 122 | |
6 | 113 | |
0 | 93 | 6.0% |
4 | 69 | 4.5% |
Lowercase Letter
Value | Count | Frequency (%) |
f | 183 | |
e | 174 | |
b | 162 | |
d | 133 | |
a | 103 | |
c | 98 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 1547 | |
Latin | 853 |
Most frequent character per script
Common
Value | Count | Frequency (%) |
5 | 282 | |
3 | 236 | |
1 | 205 | |
9 | 157 | |
8 | 136 | |
7 | 134 | |
2 | 122 | |
6 | 113 | |
0 | 93 | 6.0% |
4 | 69 | 4.5% |
Latin
Value | Count | Frequency (%) |
f | 183 | |
e | 174 | |
b | 162 | |
d | 133 | |
a | 103 | |
c | 98 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 2400 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
5 | 282 | |
3 | 236 | 9.8% |
1 | 205 | 8.5% |
f | 183 | 7.6% |
e | 174 | 7.2% |
b | 162 | 6.8% |
9 | 157 | 6.5% |
8 | 136 | 5.7% |
7 | 134 | 5.6% |
d | 133 | 5.5% |
Other values (6) | 598 |
장소 아이디 | 장소 이름 | 건물 아이디 | |
---|---|---|---|
장소 아이디 | 1.000 | 1.000 | 1.000 |
장소 이름 | 1.000 | 1.000 | 0.987 |
건물 아이디 | 1.000 | 0.987 | 1.000 |
장소 아이디 | 장소 타입 | 장소 이름 | 건물 아이디 | |
---|---|---|---|---|
0 | 5f28cf7ea42fcd6d828e0137 | <NA> | 100동 100호 | 5f06a67b153b58ec1eb3ea5b |
1 | 5f28d167a42fcd6d828e013b | <NA> | 7208동 1204호 | 5f06a1b2153b58ec1eb3ea29 |
2 | 5f28db25ee39d47796f08a01 | <NA> | 205동 604호 | 5f06a67b153b58ec1eb3ea5b |
3 | 5f28dc6eee39d47796f08a05 | <NA> | 316동 2304호 | 5f06a67b153b58ec1eb3ea5b |
4 | 5f28f15b4de0bb7ebabe6db8 | <NA> | 105동 2101호 | 5f06a658153b58ec1eb3ea59 |
5 | 5f28f1bb4de0bb7ebabe6dbc | <NA> | 107동 503호 | 5f06a658153b58ec1eb3ea59 |
6 | 5f28f21a4de0bb7ebabe6dc0 | <NA> | A동 402호 | 5f0844b80e917c7f792a2df4 |
7 | 5f28f3324de0bb7ebabe6dc6 | <NA> | 2413동 401호 | 5f06a5b2153b58ec1eb3ea53 |
8 | 5f28f3804de0bb7ebabe6dca | <NA> | 102동 301호 | 5f06a5d8153b58ec1eb3ea55 |
9 | 5f28f3f44de0bb7ebabe6dcd | <NA> | 102동 1004호 | 5f06a627153b58ec1eb3ea57 |
장소 아이디 | 장소 타입 | 장소 이름 | 건물 아이디 | |
---|---|---|---|---|
90 | 5f962e189dc73747d6e71c91 | <NA> | 102&1301 | 5f77ed1f39518d739336219b |
91 | 5f962e189dc73747d6e71c94 | <NA> | 102&301 | 5f77ed1f39518d739336219b |
92 | 5f962e189dc73747d6e71c97 | <NA> | 102&1002 | 5f77ed1f39518d739336219b |
93 | 5f962e189dc73747d6e71c9a | <NA> | 101&702 | 5f77ed1f39518d739336219b |
94 | 5f962e189dc73747d6e71c9d | <NA> | 101&1205 | 5f77ed1f39518d739336219b |
95 | 5f962e189dc73747d6e71ca0 | <NA> | 103&704 | 5f77ed1f39518d739336219b |
96 | 5f962e189dc73747d6e71ca3 | <NA> | 101&1204 | 5f77ed1f39518d739336219b |
97 | 5f962e189dc73747d6e71ca6 | <NA> | 101&701 | 5f77ed1f39518d739336219b |
98 | 5f962e189dc73747d6e71cb0 | <NA> | 102&401 | 5f77ed1f39518d739336219b |
99 | 5f962e189dc73747d6e71cb3 | <NA> | 103&805 | 5f77ed1f39518d739336219b |