Dataset statistics
Number of variables | 12 |
---|---|
Number of observations | 606 |
Missing cells | 759 |
Missing cells (%) | 10.4% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 59.9 KiB |
Average record size in memory | 101.2 B |
Variable types
Numeric | 4 |
---|---|
Text | 5 |
Categorical | 3 |
Dataset
Description | 연번,책방 이름,구 코드,구 이름,주소,전화번호,홈페이지 url,책방 구분,책방 구분명,위도,경도,SNS url |
---|---|
Author | 서울특별시 |
URL | https://data.seoul.go.kr/dataList/OA-21062/S/1/datasetView.do |
책방 구분 is highly overall correlated with 연번 and 1 other fields | High correlation |
책방 구분명 is highly overall correlated with 연번 and 1 other fields | High correlation |
연번 is highly overall correlated with 책방 구분 and 1 other fields | High correlation |
구 코드 is highly overall correlated with 위도 and 1 other fields | High correlation |
위도 is highly overall correlated with 구 코드 and 1 other fields | High correlation |
경도 is highly overall correlated with 구 이름 | High correlation |
구 이름 is highly overall correlated with 구 코드 and 2 other fields | High correlation |
전화번호 has 50 (8.3%) missing values | Missing |
홈페이지 url has 368 (60.7%) missing values | Missing |
SNS url has 341 (56.3%) missing values | Missing |
연번 has unique values | Unique |
Reproduction
Analysis started | 2024-05-11 07:07:14.173667 |
---|---|
Analysis finished | 2024-05-11 07:07:16.699701 |
Duration | 2.53 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
연번
Real number (ℝ)
HIGH CORRELATION
  UNIQUE
 
Distinct | 606 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 2670.9967 |
Minimum | 2283 |
---|---|
Maximum | 3402 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 5.5 KiB |
Quantile statistics
Minimum | 2283 |
---|---|
5-th percentile | 2317.25 |
Q1 | 2448.25 |
median | 2622.5 |
Q3 | 2793.75 |
95-th percentile | 3276 |
Maximum | 3402 |
Range | 1119 |
Interquartile range (IQR) | 345.5 |
Descriptive statistics
Standard deviation | 281.47196 |
---|---|
Coefficient of variation (CV) | 0.10538087 |
Kurtosis | -0.04619084 |
Mean | 2670.9967 |
Median Absolute Deviation (MAD) | 173 |
Skewness | 0.83396452 |
Sum | 1618624 |
Variance | 79226.466 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
2283 | 1 | 0.2% |
2578 | 1 | 0.2% |
2580 | 1 | 0.2% |
2581 | 1 | 0.2% |
2582 | 1 | 0.2% |
2586 | 1 | 0.2% |
2587 | 1 | 0.2% |
3347 | 1 | 0.2% |
2588 | 1 | 0.2% |
2589 | 1 | 0.2% |
Other values (596) | 596 |
Value | Count | Frequency (%) |
2283 | 1 | |
2284 | 1 | |
2285 | 1 | |
2286 | 1 | |
2287 | 1 | |
2288 | 1 | |
2289 | 1 | |
2290 | 1 | |
2293 | 1 | |
2294 | 1 |
Value | Count | Frequency (%) |
3402 | 1 | |
3401 | 1 | |
3384 | 1 | |
3383 | 1 | |
3382 | 1 | |
3381 | 1 | |
3361 | 1 | |
3350 | 1 | |
3349 | 1 | |
3348 | 1 |
책방 이름
Text
Distinct | 591 |
---|---|
Distinct (%) | 97.5% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 4.9 KiB |
Value | Count | Frequency (%) |
알라딘 | 17 | 2.3% |
책방 | 11 | 1.5% |
서점 | 8 | 1.1% |
바로드림센터 | 6 | 0.8% |
더북스 | 4 | 0.5% |
동아서점 | 3 | 0.4% |
서재 | 3 | 0.4% |
노원문고 | 3 | 0.4% |
그림책방 | 3 | 0.4% |
문화서점 | 3 | 0.4% |
Other values (662) | 679 |
Most occurring characters
Value | Count | Frequency (%) |
서 | 230 | 6.4% |
점 | 210 | 5.9% |
문 | 140 | 3.9% |
고 | 140 | 3.9% |
137 | 3.8% | |
책 | 118 | 3.3% |
스 | 82 | 2.3% |
) | 79 | 2.2% |
( | 79 | 2.2% |
방 | 77 | 2.2% |
Other values (441) | 2287 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 3192 | |
Space Separator | 137 | 3.8% |
Close Punctuation | 79 | 2.2% |
Open Punctuation | 79 | 2.2% |
Lowercase Letter | 34 | 0.9% |
Uppercase Letter | 34 | 0.9% |
Decimal Number | 22 | 0.6% |
Other Punctuation | 1 | < 0.1% |
Dash Punctuation | 1 | < 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
서 | 230 | 7.2% |
점 | 210 | 6.6% |
문 | 140 | 4.4% |
고 | 140 | 4.4% |
책 | 118 | 3.7% |
스 | 82 | 2.6% |
방 | 77 | 2.4% |
북 | 62 | 1.9% |
이 | 44 | 1.4% |
적 | 35 | 1.1% |
Other values (401) | 2054 |
Uppercase Letter
Value | Count | Frequency (%) |
P | 4 | |
S | 4 | |
C | 4 | |
T | 4 | |
I | 3 | |
N | 2 | 5.9% |
B | 2 | 5.9% |
A | 2 | 5.9% |
Y | 2 | 5.9% |
E | 2 | 5.9% |
Other values (4) | 5 |
Lowercase Letter
Value | Count | Frequency (%) |
o | 9 | |
a | 4 | |
e | 3 | 8.8% |
t | 3 | 8.8% |
f | 3 | 8.8% |
n | 3 | 8.8% |
l | 2 | 5.9% |
k | 2 | 5.9% |
w | 1 | 2.9% |
b | 1 | 2.9% |
Other values (3) | 3 | 8.8% |
Decimal Number
Value | Count | Frequency (%) |
4 | 5 | |
1 | 5 | |
2 | 4 | |
7 | 3 | |
9 | 2 | 9.1% |
8 | 1 | 4.5% |
5 | 1 | 4.5% |
3 | 1 | 4.5% |
Space Separator
Value | Count | Frequency (%) |
137 |
Close Punctuation
Value | Count | Frequency (%) |
) | 79 |
Open Punctuation
Value | Count | Frequency (%) |
( | 79 |
Other Punctuation
Value | Count | Frequency (%) |
& | 1 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 1 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 3192 | |
Common | 319 | 8.9% |
Latin | 68 | 1.9% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
서 | 230 | 7.2% |
점 | 210 | 6.6% |
문 | 140 | 4.4% |
고 | 140 | 4.4% |
책 | 118 | 3.7% |
스 | 82 | 2.6% |
방 | 77 | 2.4% |
북 | 62 | 1.9% |
이 | 44 | 1.4% |
적 | 35 | 1.1% |
Other values (401) | 2054 |
Latin
Value | Count | Frequency (%) |
o | 9 | 13.2% |
a | 4 | 5.9% |
P | 4 | 5.9% |
S | 4 | 5.9% |
C | 4 | 5.9% |
T | 4 | 5.9% |
I | 3 | 4.4% |
e | 3 | 4.4% |
t | 3 | 4.4% |
f | 3 | 4.4% |
Other values (17) | 27 |
Common
Value | Count | Frequency (%) |
137 | ||
) | 79 | |
( | 79 | |
4 | 5 | 1.6% |
1 | 5 | 1.6% |
2 | 4 | 1.3% |
7 | 3 | 0.9% |
9 | 2 | 0.6% |
8 | 1 | 0.3% |
5 | 1 | 0.3% |
Other values (3) | 3 | 0.9% |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 3192 | |
ASCII | 387 | 10.8% |
Most frequent character per block
Hangul
Value | Count | Frequency (%) |
서 | 230 | 7.2% |
점 | 210 | 6.6% |
문 | 140 | 4.4% |
고 | 140 | 4.4% |
책 | 118 | 3.7% |
스 | 82 | 2.6% |
방 | 77 | 2.4% |
북 | 62 | 1.9% |
이 | 44 | 1.4% |
적 | 35 | 1.1% |
Other values (401) | 2054 |
ASCII
Value | Count | Frequency (%) |
137 | ||
) | 79 | |
( | 79 | |
o | 9 | 2.3% |
4 | 5 | 1.3% |
1 | 5 | 1.3% |
a | 4 | 1.0% |
P | 4 | 1.0% |
S | 4 | 1.0% |
2 | 4 | 1.0% |
Other values (30) | 57 |
구 코드
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 25 |
---|---|
Distinct (%) | 4.1% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 311.50495 |
Minimum | 300 |
---|---|
Maximum | 324 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 5.5 KiB |
Quantile statistics
Minimum | 300 |
---|---|
5-th percentile | 300 |
Q1 | 304 |
median | 313 |
Q3 | 318 |
95-th percentile | 323 |
Maximum | 324 |
Range | 24 |
Interquartile range (IQR) | 14 |
Descriptive statistics
Standard deviation | 7.5459458 |
---|---|
Coefficient of variation (CV) | 0.02422416 |
Kurtosis | -1.187591 |
Mean | 311.50495 |
Median Absolute Deviation (MAD) | 6 |
Skewness | -0.09692884 |
Sum | 188772 |
Variance | 56.941298 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
313 | 73 | 12.0% |
300 | 66 | 10.9% |
301 | 35 | 5.8% |
320 | 33 | 5.4% |
322 | 31 | 5.1% |
315 | 30 | 5.0% |
312 | 24 | 4.0% |
310 | 24 | 4.0% |
307 | 23 | 3.8% |
323 | 23 | 3.8% |
Other values (15) | 244 |
Value | Count | Frequency (%) |
300 | 66 | |
301 | 35 | |
302 | 21 | 3.5% |
303 | 17 | 2.8% |
304 | 15 | 2.5% |
305 | 8 | 1.3% |
306 | 13 | 2.1% |
307 | 23 | 3.8% |
308 | 11 | 1.8% |
309 | 17 | 2.8% |
Value | Count | Frequency (%) |
324 | 16 | |
323 | 23 | |
322 | 31 | |
321 | 19 | |
320 | 33 | |
319 | 22 | |
318 | 21 | |
317 | 10 | 1.7% |
316 | 12 | 2.0% |
315 | 30 |
구 이름
Categorical
HIGH CORRELATION
 
Distinct | 25 |
---|---|
Distinct (%) | 4.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 4.9 KiB |
마포구 | |
---|---|
종로구 | |
중구 | 35 |
관악구 | 33 |
강남구 | 31 |
Other values (20) |
Length
Max length | 4 |
---|---|
Median length | 3 |
Mean length | 3.029703 |
Min length | 2 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 마포구 |
---|---|
2nd row | 강남구 |
3rd row | 강남구 |
4th row | 용산구 |
5th row | 동작구 |
Common Values
Value | Count | Frequency (%) |
마포구 | 73 | 12.0% |
종로구 | 66 | 10.9% |
중구 | 35 | 5.8% |
관악구 | 33 | 5.4% |
강남구 | 31 | 5.1% |
강서구 | 30 | 5.0% |
서대문구 | 24 | 4.0% |
노원구 | 24 | 4.0% |
성북구 | 23 | 3.8% |
송파구 | 23 | 3.8% |
Other values (15) | 244 |
Length
Value | Count | Frequency (%) |
마포구 | 73 | 12.0% |
종로구 | 66 | 10.9% |
중구 | 35 | 5.8% |
관악구 | 33 | 5.4% |
강남구 | 31 | 5.1% |
강서구 | 30 | 5.0% |
서대문구 | 24 | 4.0% |
노원구 | 24 | 4.0% |
성북구 | 23 | 3.8% |
송파구 | 23 | 3.8% |
Other values (15) | 244 |
주소
Text
Distinct | 600 |
---|---|
Distinct (%) | 99.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 4.9 KiB |
Length
Max length | 43 |
---|---|
Median length | 32 |
Mean length | 16.379538 |
Min length | 6 |
Characters and Unicode
Total characters | 9926 |
---|---|
Distinct characters | 346 |
Distinct categories | 10 ? |
Distinct scripts | 3 ? |
Distinct blocks | 2 ? |
Unique
Unique | 594 ? |
---|---|
Unique (%) | 98.0% |
Sample
1st row | 마포구 동교로 194 혜원빌딩 |
---|---|
2nd row | 강남구 남부순환로359길 31 |
3rd row | 강남구 남부순환로 2806 군인공제회관 지하1층 |
4th row | 용산구 녹사평대로 208 |
5th row | 동작구 만양로1길 1 1층 |
Value | Count | Frequency (%) |
1층 | 75 | 3.2% |
지하1층 | 69 | 2.9% |
종로구 | 62 | 2.6% |
지하 | 60 | 2.5% |
2층 | 60 | 2.5% |
마포구 | 57 | 2.4% |
중구 | 34 | 1.4% |
관악구 | 30 | 1.3% |
강남구 | 28 | 1.2% |
강서구 | 27 | 1.1% |
Other values (1070) | 1868 |
Most occurring characters
Value | Count | Frequency (%) |
1783 | 18.0% | |
로 | 649 | 6.5% |
1 | 641 | 6.5% |
구 | 575 | 5.8% |
2 | 376 | 3.8% |
길 | 287 | 2.9% |
3 | 275 | 2.8% |
층 | 249 | 2.5% |
4 | 223 | 2.2% |
5 | 195 | 2.0% |
Other values (336) | 4673 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 5496 | |
Decimal Number | 2468 | |
Space Separator | 1783 | 18.0% |
Dash Punctuation | 99 | 1.0% |
Uppercase Letter | 33 | 0.3% |
Other Punctuation | 22 | 0.2% |
Lowercase Letter | 15 | 0.2% |
Open Punctuation | 4 | < 0.1% |
Close Punctuation | 4 | < 0.1% |
Math Symbol | 2 | < 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
로 | 649 | 11.8% |
구 | 575 | 10.5% |
길 | 287 | 5.2% |
층 | 249 | 4.5% |
하 | 150 | 2.7% |
지 | 145 | 2.6% |
동 | 126 | 2.3% |
대 | 106 | 1.9% |
강 | 103 | 1.9% |
서 | 102 | 1.9% |
Other values (300) | 3004 |
Uppercase Letter
Value | Count | Frequency (%) |
B | 8 | |
A | 6 | |
C | 4 | |
Y | 2 | 6.1% |
F | 2 | 6.1% |
S | 2 | 6.1% |
M | 2 | 6.1% |
N | 2 | 6.1% |
L | 1 | 3.0% |
I | 1 | 3.0% |
Other values (3) | 3 | 9.1% |
Decimal Number
Value | Count | Frequency (%) |
1 | 641 | |
2 | 376 | |
3 | 275 | |
4 | 223 | 9.0% |
5 | 195 | 7.9% |
6 | 174 | 7.1% |
0 | 171 | 6.9% |
7 | 157 | 6.4% |
9 | 129 | 5.2% |
8 | 127 | 5.1% |
Lowercase Letter
Value | Count | Frequency (%) |
e | 3 | |
b | 3 | |
l | 3 | |
m | 2 | |
c | 2 | |
a | 1 | 6.7% |
n | 1 | 6.7% |
Space Separator
Value | Count | Frequency (%) |
1783 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 99 |
Other Punctuation
Value | Count | Frequency (%) |
, | 22 |
Open Punctuation
Value | Count | Frequency (%) |
( | 4 |
Close Punctuation
Value | Count | Frequency (%) |
) | 4 |
Math Symbol
Value | Count | Frequency (%) |
~ | 2 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 5496 | |
Common | 4382 | |
Latin | 48 | 0.5% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
로 | 649 | 11.8% |
구 | 575 | 10.5% |
길 | 287 | 5.2% |
층 | 249 | 4.5% |
하 | 150 | 2.7% |
지 | 145 | 2.6% |
동 | 126 | 2.3% |
대 | 106 | 1.9% |
강 | 103 | 1.9% |
서 | 102 | 1.9% |
Other values (300) | 3004 |
Latin
Value | Count | Frequency (%) |
B | 8 | |
A | 6 | |
C | 4 | 8.3% |
e | 3 | 6.2% |
b | 3 | 6.2% |
l | 3 | 6.2% |
Y | 2 | 4.2% |
m | 2 | 4.2% |
F | 2 | 4.2% |
c | 2 | 4.2% |
Other values (10) | 13 |
Common
Value | Count | Frequency (%) |
1783 | ||
1 | 641 | 14.6% |
2 | 376 | 8.6% |
3 | 275 | 6.3% |
4 | 223 | 5.1% |
5 | 195 | 4.5% |
6 | 174 | 4.0% |
0 | 171 | 3.9% |
7 | 157 | 3.6% |
9 | 129 | 2.9% |
Other values (6) | 258 | 5.9% |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 5496 | |
ASCII | 4430 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
1783 | ||
1 | 641 | 14.5% |
2 | 376 | 8.5% |
3 | 275 | 6.2% |
4 | 223 | 5.0% |
5 | 195 | 4.4% |
6 | 174 | 3.9% |
0 | 171 | 3.9% |
7 | 157 | 3.5% |
9 | 129 | 2.9% |
Other values (26) | 306 | 6.9% |
Hangul
Value | Count | Frequency (%) |
로 | 649 | 11.8% |
구 | 575 | 10.5% |
길 | 287 | 5.2% |
층 | 249 | 4.5% |
하 | 150 | 2.7% |
지 | 145 | 2.6% |
동 | 126 | 2.3% |
대 | 106 | 1.9% |
강 | 103 | 1.9% |
서 | 102 | 1.9% |
Other values (300) | 3004 |
전화번호
Text
MISSING
 
Distinct | 521 |
---|---|
Distinct (%) | 93.7% |
Missing | 50 |
Missing (%) | 8.3% |
Memory size | 4.9 KiB |
Length
Max length | 15 |
---|---|
Median length | 14 |
Mean length | 11.861511 |
Min length | 1 |
Characters and Unicode
Total characters | 6595 |
---|---|
Distinct characters | 12 |
Distinct categories | 3 ? |
Distinct scripts | 1 ? |
Distinct blocks | 1 ? |
Unique
Unique | 516 ? |
---|---|
Unique (%) | 92.8% |
Sample
1st row | 02-325-1984 |
---|---|
2nd row | 02-3463-1880 |
3rd row | 02-2190-2178 |
4th row | 02-793-8249 |
5th row | 070-4177-0021 |
Value | Count | Frequency (%) |
1544-2514 | 18 | 3.3% |
1544-1900 | 15 | 2.7% |
070-4070-0204 | 2 | 0.4% |
0507-1305-5475 | 2 | 0.4% |
02-944-2651 | 1 | 0.2% |
02-374-8917 | 1 | 0.2% |
0507-1413-4144 | 1 | 0.2% |
02-3411-3215 | 1 | 0.2% |
02-529-5949 | 1 | 0.2% |
0507-1414-6922 | 1 | 0.2% |
Other values (510) | 510 |
Most occurring characters
Value | Count | Frequency (%) |
0 | 1072 | |
- | 1071 | |
2 | 894 | |
1 | 540 | |
4 | 525 | |
7 | 516 | |
5 | 496 | |
3 | 453 | |
8 | 342 | 5.2% |
9 | 329 | 5.0% |
Other values (2) | 357 | 5.4% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 5491 | |
Dash Punctuation | 1071 | 16.2% |
Space Separator | 33 | 0.5% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
0 | 1072 | |
2 | 894 | |
1 | 540 | |
4 | 525 | |
7 | 516 | |
5 | 496 | |
3 | 453 | |
8 | 342 | 6.2% |
9 | 329 | 6.0% |
6 | 324 | 5.9% |
Dash Punctuation
Value | Count | Frequency (%) |
- | 1071 |
Space Separator
Value | Count | Frequency (%) |
33 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 6595 |
Most frequent character per script
Common
Value | Count | Frequency (%) |
0 | 1072 | |
- | 1071 | |
2 | 894 | |
1 | 540 | |
4 | 525 | |
7 | 516 | |
5 | 496 | |
3 | 453 | |
8 | 342 | 5.2% |
9 | 329 | 5.0% |
Other values (2) | 357 | 5.4% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 6595 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
0 | 1072 | |
- | 1071 | |
2 | 894 | |
1 | 540 | |
4 | 525 | |
7 | 516 | |
5 | 496 | |
3 | 453 | |
8 | 342 | 5.2% |
9 | 329 | 5.0% |
Other values (2) | 357 | 5.4% |
홈페이지 url
Text
MISSING
 
Distinct | 231 |
---|---|
Distinct (%) | 97.1% |
Missing | 368 |
Missing (%) | 60.7% |
Memory size | 4.9 KiB |
Length
Max length | 99 |
---|---|
Median length | 46 |
Mean length | 31.554622 |
Min length | 13 |
Characters and Unicode
Total characters | 7510 |
---|---|
Distinct characters | 67 |
Distinct categories | 8 ? |
Distinct scripts | 2 ? |
Distinct blocks | 1 ? |
Unique
Unique | 226 ? |
---|---|
Unique (%) | 95.0% |
Sample
1st row | https://prntseoul.com |
---|---|
2nd row | http://77page.com |
3rd row | https://blog.naver.com/dnfladydgk |
4th row | https://blog.naver.com/experiencelibrary |
5th row | http://www.bookgore.com/ |
Value | Count | Frequency (%) |
http://www.storagebookandfilm.com | 3 | 1.3% |
http://www.nowonbook.com | 3 | 1.3% |
https://www.the-ref.kr | 2 | 0.8% |
https://smartstore.naver.com/kenektidxbookstore | 2 | 0.8% |
https://www.bookslibro.com | 2 | 0.8% |
http://blog.naver.com/now_afterbooks | 1 | 0.4% |
http://www.eraebook.co.kr | 1 | 0.4% |
https://www.frederic.co.kr | 1 | 0.4% |
http://hululuc.com | 1 | 0.4% |
http://www.2sangbook.com | 1 | 0.4% |
Other values (222) | 222 |
Most occurring characters
Value | Count | Frequency (%) |
o | 810 | 10.8% |
t | 660 | 8.8% |
/ | 637 | 8.5% |
. | 477 | 6.4% |
s | 412 | 5.5% |
r | 350 | 4.7% |
e | 346 | 4.6% |
h | 343 | 4.6% |
a | 336 | 4.5% |
p | 300 | 4.0% |
Other values (57) | 2839 |
Most occurring categories
Value | Count | Frequency (%) |
Lowercase Letter | 5925 | |
Other Punctuation | 1370 | 18.2% |
Decimal Number | 107 | 1.4% |
Uppercase Letter | 61 | 0.8% |
Connector Punctuation | 26 | 0.3% |
Dash Punctuation | 15 | 0.2% |
Space Separator | 4 | 0.1% |
Math Symbol | 2 | < 0.1% |
Most frequent character per category
Lowercase Letter
Value | Count | Frequency (%) |
o | 810 | |
t | 660 | 11.1% |
s | 412 | 7.0% |
r | 350 | 5.9% |
e | 346 | 5.8% |
h | 343 | 5.8% |
a | 336 | 5.7% |
p | 300 | 5.1% |
m | 288 | 4.9% |
c | 283 | 4.8% |
Other values (16) | 1797 |
Uppercase Letter
Value | Count | Frequency (%) |
C | 8 | |
A | 7 | |
B | 6 | 9.8% |
E | 6 | 9.8% |
R | 4 | 6.6% |
D | 4 | 6.6% |
X | 3 | 4.9% |
J | 3 | 4.9% |
N | 3 | 4.9% |
W | 2 | 3.3% |
Other values (11) | 15 |
Decimal Number
Value | Count | Frequency (%) |
1 | 19 | |
2 | 14 | |
0 | 12 | |
9 | 12 | |
4 | 11 | |
5 | 10 | |
8 | 8 | |
3 | 8 | |
7 | 7 | 6.5% |
6 | 6 | 5.6% |
Other Punctuation
Value | Count | Frequency (%) |
/ | 637 | |
. | 477 | |
: | 237 | 17.3% |
% | 15 | 1.1% |
@ | 2 | 0.1% |
? | 2 | 0.1% |
Connector Punctuation
Value | Count | Frequency (%) |
_ | 26 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 15 |
Space Separator
Value | Count | Frequency (%) |
4 |
Math Symbol
Value | Count | Frequency (%) |
= | 2 |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 5986 | |
Common | 1524 | 20.3% |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
o | 810 | |
t | 660 | 11.0% |
s | 412 | 6.9% |
r | 350 | 5.8% |
e | 346 | 5.8% |
h | 343 | 5.7% |
a | 336 | 5.6% |
p | 300 | 5.0% |
m | 288 | 4.8% |
c | 283 | 4.7% |
Other values (37) | 1858 |
Common
Value | Count | Frequency (%) |
/ | 637 | |
. | 477 | |
: | 237 | 15.6% |
_ | 26 | 1.7% |
1 | 19 | 1.2% |
% | 15 | 1.0% |
- | 15 | 1.0% |
2 | 14 | 0.9% |
0 | 12 | 0.8% |
9 | 12 | 0.8% |
Other values (10) | 60 | 3.9% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 7510 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
o | 810 | 10.8% |
t | 660 | 8.8% |
/ | 637 | 8.5% |
. | 477 | 6.4% |
s | 412 | 5.5% |
r | 350 | 4.7% |
e | 346 | 4.6% |
h | 343 | 4.6% |
a | 336 | 4.5% |
p | 300 | 4.0% |
Other values (57) | 2839 |
책방 구분
Categorical
HIGH CORRELATION
 
Distinct | 2 |
---|---|
Distinct (%) | 0.3% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 4.9 KiB |
2 | |
---|---|
1 |
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 2 |
---|---|
2nd row | 2 |
3rd row | 2 |
4th row | 1 |
5th row | 2 |
Common Values
Value | Count | Frequency (%) |
2 | 522 | |
1 | 84 | 13.9% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
2 | 522 | |
1 | 84 | 13.9% |
책방 구분명
Categorical
HIGH CORRELATION
 
Distinct | 2 |
---|---|
Distinct (%) | 0.3% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 4.9 KiB |
새책방 | |
---|---|
헌책방 |
Length
Max length | 9 |
---|---|
Median length | 9 |
Mean length | 9 |
Min length | 9 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 새책방 |
---|---|
2nd row | 새책방 |
3rd row | 새책방 |
4th row | 헌책방 |
5th row | 새책방 |
Common Values
Value | Count | Frequency (%) |
새책방 | 522 | |
헌책방 | 84 | 13.9% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
새책방 | 522 | |
헌책방 | 84 | 13.9% |
위도
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 576 |
---|---|
Distinct (%) | 95.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 37.552184 |
Minimum | 37.44935 |
---|---|
Maximum | 37.684345 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 5.5 KiB |
Quantile statistics
Minimum | 37.44935 |
---|---|
5-th percentile | 37.477872 |
Q1 | 37.516755 |
median | 37.554805 |
Q3 | 37.578931 |
95-th percentile | 37.645832 |
Maximum | 37.684345 |
Range | 0.2349946 |
Interquartile range (IQR) | 0.062176238 |
Descriptive statistics
Standard deviation | 0.046686331 |
---|---|
Coefficient of variation (CV) | 0.0012432388 |
Kurtosis | -0.14158927 |
Mean | 37.552184 |
Median Absolute Deviation (MAD) | 0.028219123 |
Skewness | 0.22392207 |
Sum | 22756.624 |
Variance | 0.0021796135 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
37.5694258617472 | 15 | 2.5% |
37.5706568981549 | 7 | 1.2% |
37.5679798569884 | 2 | 0.3% |
37.573776066701 | 2 | 0.3% |
37.5395404996998 | 2 | 0.3% |
37.5409328113161 | 2 | 0.3% |
37.5364764239853 | 2 | 0.3% |
37.5693683779231 | 2 | 0.3% |
37.5717572313412 | 2 | 0.3% |
37.5855682577741 | 2 | 0.3% |
Other values (566) | 568 |
Value | Count | Frequency (%) |
37.4493499425115 | 1 | |
37.4505063344668 | 1 | |
37.4506617287092 | 1 | |
37.455143719434 | 1 | |
37.4566169360635 | 1 | |
37.4595006193821 | 1 | |
37.4609911113644 | 1 | |
37.4645381767935 | 1 | |
37.4686620625239 | 1 | |
37.4689381362592 | 1 |
Value | Count | Frequency (%) |
37.6843445449287 | 1 | |
37.6834512512412 | 1 | |
37.6781841932825 | 1 | |
37.6674429516351 | 1 | |
37.666911617005 | 1 | |
37.6645718853517 | 1 | |
37.6621909006666 | 1 | |
37.6587118738827 | 1 | |
37.6579748616037 | 1 | |
37.6576975981725 | 1 |
경도
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 576 |
---|---|
Distinct (%) | 95.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 126.97949 |
Minimum | 126.80296 |
---|---|
Maximum | 127.17115 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 5.5 KiB |
Quantile statistics
Minimum | 126.80296 |
---|---|
5-th percentile | 126.8546 |
Q1 | 126.92027 |
median | 126.98038 |
Q3 | 127.03337 |
95-th percentile | 127.10459 |
Maximum | 127.17115 |
Range | 0.36818479 |
Interquartile range (IQR) | 0.11309982 |
Descriptive statistics
Standard deviation | 0.075694348 |
---|---|
Coefficient of variation (CV) | 0.00059611475 |
Kurtosis | -0.56756748 |
Mean | 126.97949 |
Median Absolute Deviation (MAD) | 0.057865394 |
Skewness | 0.11385238 |
Sum | 76949.571 |
Variance | 0.0057296343 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
127.00804756339 | 15 | 2.5% |
127.006479473351 | 7 | 1.2% |
126.995365114169 | 2 | 0.3% |
127.004180855855 | 2 | 0.3% |
126.949490374862 | 2 | 0.3% |
126.993591556374 | 2 | 0.3% |
126.867389257907 | 2 | 0.3% |
127.004721112823 | 2 | 0.3% |
127.018765856837 | 2 | 0.3% |
127.000601419034 | 2 | 0.3% |
Other values (566) | 568 |
Value | Count | Frequency (%) |
126.80296035927 | 1 | |
126.81297893299 | 1 | |
126.813650452864 | 1 | |
126.822484282875 | 1 | |
126.823249100217 | 1 | |
126.831580783627 | 1 | |
126.831692301904 | 1 | |
126.832970268909 | 1 | |
126.833972867298 | 1 | |
126.834185904309 | 1 |
Value | Count | Frequency (%) |
127.17114514801 | 1 | |
127.155472908009 | 1 | |
127.15511458665 | 1 | |
127.155000732652 | 1 | |
127.153830303625 | 1 | |
127.153137021369 | 1 | |
127.147108511955 | 1 | |
127.146146799638 | 1 | |
127.145304402527 | 1 | |
127.144013803921 | 1 |
SNS url
Text
MISSING
 
Distinct | 263 |
---|---|
Distinct (%) | 99.2% |
Missing | 341 |
Missing (%) | 56.3% |
Memory size | 4.9 KiB |
Length
Max length | 68 |
---|---|
Median length | 47 |
Mean length | 39.060377 |
Min length | 14 |
Characters and Unicode
Total characters | 10351 |
---|---|
Distinct characters | 53 |
Distinct categories | 8 ? |
Distinct scripts | 2 ? |
Distinct blocks | 1 ? |
Unique
Unique | 261 ? |
---|---|
Unique (%) | 98.5% |
Sample
1st row | https://www.instagram.com/1984store |
---|---|
2nd row | https://www.instagram.com/itaewon_foreign_bookstore |
3rd row | https://www.instagram.com/prntseoul |
4th row | https://www.instagram.com/gaga77page |
5th row | https://www.instagram.com/gamseongingan |
Value | Count | Frequency (%) |
https://www.instagram.com/the_reference_seoul | 2 | 0.8% |
https://www.instagram.com/graphic.fan | 2 | 0.8% |
https://www.instagram.com/kenektidxbookstore | 2 | 0.8% |
https://www.instagram.com/pumpkin_vege_book | 1 | 0.4% |
https://www.instagram.com/the_present_world | 1 | 0.4% |
https://www.instagram.com/kenektid_flagship | 1 | 0.4% |
https://www.instagram.com/ongodangbook | 1 | 0.4% |
https://www.instagram.com/bulon0802 | 1 | 0.4% |
https://www.instagram.com/allornothing_deardark | 1 | 0.4% |
http://www.instagram.com/worldmag.co.kr | 1 | 0.4% |
Other values (253) | 253 |
Most occurring characters
Value | Count | Frequency (%) |
t | 906 | 8.8% |
o | 858 | 8.3% |
/ | 843 | 8.1% |
w | 816 | 7.9% |
s | 756 | 7.3% |
a | 740 | 7.1% |
m | 600 | 5.8% |
. | 573 | 5.5% |
n | 454 | 4.4% |
i | 413 | 4.0% |
Other values (43) | 3392 |
Most occurring categories
Value | Count | Frequency (%) |
Lowercase Letter | 8439 | |
Other Punctuation | 1681 | 16.2% |
Connector Punctuation | 129 | 1.2% |
Decimal Number | 83 | 0.8% |
Uppercase Letter | 11 | 0.1% |
Space Separator | 5 | < 0.1% |
Dash Punctuation | 2 | < 0.1% |
Math Symbol | 1 | < 0.1% |
Most frequent character per category
Lowercase Letter
Value | Count | Frequency (%) |
t | 906 | |
o | 858 | |
w | 816 | 9.7% |
s | 756 | 9.0% |
a | 740 | 8.8% |
m | 600 | 7.1% |
n | 454 | 5.4% |
i | 413 | 4.9% |
r | 402 | 4.8% |
h | 374 | 4.4% |
Other values (16) | 2120 |
Decimal Number
Value | Count | Frequency (%) |
2 | 16 | |
1 | 14 | |
0 | 13 | |
4 | 9 | |
3 | 8 | |
7 | 8 | |
8 | 5 | 6.0% |
5 | 4 | 4.8% |
9 | 4 | 4.8% |
6 | 2 | 2.4% |
Uppercase Letter
Value | Count | Frequency (%) |
G | 2 | |
E | 2 | |
U | 1 | |
C | 1 | |
H | 1 | |
A | 1 | |
J | 1 | |
B | 1 | |
D | 1 |
Other Punctuation
Value | Count | Frequency (%) |
/ | 843 | |
. | 573 | |
: | 264 | 15.7% |
? | 1 | 0.1% |
Connector Punctuation
Value | Count | Frequency (%) |
_ | 129 |
Space Separator
Value | Count | Frequency (%) |
5 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 2 |
Math Symbol
Value | Count | Frequency (%) |
= | 1 |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 8450 | |
Common | 1901 | 18.4% |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
t | 906 | |
o | 858 | |
w | 816 | 9.7% |
s | 756 | 8.9% |
a | 740 | 8.8% |
m | 600 | 7.1% |
n | 454 | 5.4% |
i | 413 | 4.9% |
r | 402 | 4.8% |
h | 374 | 4.4% |
Other values (25) | 2131 |
Common
Value | Count | Frequency (%) |
/ | 843 | |
. | 573 | |
: | 264 | 13.9% |
_ | 129 | 6.8% |
2 | 16 | 0.8% |
1 | 14 | 0.7% |
0 | 13 | 0.7% |
4 | 9 | 0.5% |
3 | 8 | 0.4% |
7 | 8 | 0.4% |
Other values (8) | 24 | 1.3% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 10351 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
t | 906 | 8.8% |
o | 858 | 8.3% |
/ | 843 | 8.1% |
w | 816 | 7.9% |
s | 756 | 7.3% |
a | 740 | 7.1% |
m | 600 | 5.8% |
. | 573 | 5.5% |
n | 454 | 4.4% |
i | 413 | 4.0% |
Other values (43) | 3392 |
연번 | 구 코드 | 구 이름 | 책방 구분 | 책방 구분명 | 위도 | 경도 | |
---|---|---|---|---|---|---|---|
연번 | 1.000 | 0.136 | 0.255 | 0.966 | 0.966 | 0.130 | 0.085 |
구 코드 | 0.136 | 1.000 | 1.000 | 0.218 | 0.218 | 0.896 | 0.901 |
구 이름 | 0.255 | 1.000 | 1.000 | 0.301 | 0.301 | 0.931 | 0.939 |
책방 구분 | 0.966 | 0.218 | 0.301 | 1.000 | 1.000 | 0.194 | 0.222 |
책방 구분명 | 0.966 | 0.218 | 0.301 | 1.000 | 1.000 | 0.194 | 0.222 |
위도 | 0.130 | 0.896 | 0.931 | 0.194 | 0.194 | 1.000 | 0.659 |
경도 | 0.085 | 0.901 | 0.939 | 0.222 | 0.222 | 0.659 | 1.000 |
책방 구분 | 책방 구분명 | 구 이름 | |
---|---|---|---|
책방 구분 | 1.000 | 0.993 | 0.255 |
책방 구분명 | 0.993 | 1.000 | 0.255 |
구 이름 | 0.255 | 0.255 | 1.000 |
연번 | 구 코드 | 위도 | 경도 | 구 이름 | 책방 구분 | 책방 구분명 | |
---|---|---|---|---|---|---|---|
연번 | 1.000 | -0.071 | 0.030 | 0.057 | 0.090 | 0.836 | 0.836 |
구 코드 | -0.071 | 1.000 | -0.710 | -0.125 | 0.987 | 0.092 | 0.092 |
위도 | 0.030 | -0.710 | 1.000 | 0.205 | 0.664 | 0.147 | 0.147 |
경도 | 0.057 | -0.125 | 0.205 | 1.000 | 0.685 | 0.169 | 0.169 |
구 이름 | 0.090 | 0.987 | 0.664 | 0.685 | 1.000 | 0.255 | 0.255 |
책방 구분 | 0.836 | 0.092 | 0.147 | 0.169 | 0.255 | 1.000 | 0.993 |
책방 구분명 | 0.836 | 0.092 | 0.147 | 0.169 | 0.255 | 0.993 | 1.000 |
연번 | 책방 이름 | 구 코드 | 구 이름 | 주소 | 전화번호 | 홈페이지 url | 책방 구분 | 책방 구분명 | 위도 | 경도 | SNS url | |
---|---|---|---|---|---|---|---|---|---|---|---|---|
0 | 2283 | 1984 | 313 | 마포구 | 마포구 동교로 194 혜원빌딩 | 02-325-1984 | <NA> | 2 | 새책방 | 37.557385 | 126.922886 | https://www.instagram.com/1984store |
1 | 2284 | 21세기문고 | 322 | 강남구 | 강남구 남부순환로359길 31 | 02-3463-1880 | <NA> | 2 | 새책방 | 37.486833 | 127.035565 | <NA> |
2 | 2285 | C&S서점 | 322 | 강남구 | 강남구 남부순환로 2806 군인공제회관 지하1층 | 02-2190-2178 | <NA> | 2 | 새책방 | 37.489109 | 127.052914 | <NA> |
3 | 2829 | Itaewon books | 302 | 용산구 | 용산구 녹사평대로 208 | 02-793-8249 | <NA> | 1 | 헌책방 | 37.536246 | 126.987227 | https://www.instagram.com/itaewon_foreign_bookstore |
4 | 3121 | PRNT | 319 | 동작구 | 동작구 만양로1길 1 1층 | 070-4177-0021 | https://prntseoul.com | 2 | 새책방 | 37.505733 | 126.946737 | https://www.instagram.com/prntseoul |
5 | 2287 | SK문고 | 308 | 강북구 | 강북구 솔샘로 215 2층 | 02-945-1959 | <NA> | 2 | 새책방 | 37.620201 | 127.016609 | <NA> |
6 | 2754 | YES24 중고매장(목동점) | 314 | 양천구 | 양천구 오목로 325 지하 1층 | 1566-4295 | <NA> | 1 | 헌책방 | 37.525107 | 126.873627 | <NA> |
7 | 2753 | YES24(강서NC점) | 315 | 강서구 | 강서구 강서로56길 17 NC백화점 강서점 8, 9층 | <NA> | <NA> | 2 | 새책방 | 37.559934 | 126.840499 | <NA> |
8 | 2288 | 가가77페이지 | 313 | 마포구 | 마포구 망원로 74-1 지하1층 | <NA> | http://77page.com | 2 | 새책방 | 37.557261 | 126.905179 | https://www.instagram.com/gaga77page |
9 | 2289 | 가람프라자 | 317 | 금천구 | 금천구 시흥대로41길 95 | 02-891-7474 | <NA> | 2 | 새책방 | 37.450662 | 126.900035 | <NA> |
연번 | 책방 이름 | 구 코드 | 구 이름 | 주소 | 전화번호 | 홈페이지 url | 책방 구분 | 책방 구분명 | 위도 | 경도 | SNS url | |
---|---|---|---|---|---|---|---|---|---|---|---|---|
596 | 2745 | 홍익문고 | 312 | 서대문구 | 서대문구 연세로 2 | 02-392-2020 | http://cafe.naver.com/hongikbook | 2 | 새책방 | 37.555785 | 126.937044 | https://www.instagram.com/hongikmungo |
597 | 2746 | 홍제문고 | 312 | 서대문구 | 서대문구 통일로39가길 30 지하1층 | 02-3217-5552 | https://blog.naver.com/js3217555 | 2 | 새책방 | 37.588969 | 126.944292 | <NA> |
598 | 2747 | 화랑문고 | 310 | 노원구 | 노원구 공릉로 34길 62 | 02-973-8580 | <NA> | 2 | 새책방 | 37.623366 | 127.082121 | <NA> |
599 | 2748 | 환일서점 | 313 | 마포구 | 마포구 환일길 48 | 02-313-3156 | <NA> | 2 | 새책방 | 37.554091 | 126.960892 | <NA> |
600 | 2833 | 황룡서점 | 322 | 강남구 | 강남구 일원로3길 56 | 02-2226-9414 | <NA> | 2 | 새책방 | 37.492497 | 127.084466 | <NA> |
601 | 2749 | 황룡서점 | 323 | 송파구 | 송파구 마천로 271 | 02-400-4501 | <NA> | 2 | 새책방 | 37.497805 | 127.147109 | <NA> |
602 | 3350 | 회전문서재 | 320 | 관악구 | 조원로 2길 55 1층 4호 노란색 벽 | 0507-1455-7025 | https://booking.naver.com/afterwork-library | 2 | 새책방 | 37.48128 | 126.90138 | <NA> |
603 | 2834 | 흙서점 | 320 | 관악구 | 관악구 남부순환로 1916 | 02-884-8454 | <NA> | 1 | 헌책방 | 37.477359 | 126.962068 | <NA> |
604 | 2750 | 흥인서점 | 315 | 강서구 | 강서구 화곡로24길 40 | 02-2696-2320 | <NA> | 2 | 새책방 | 37.539183 | 126.838881 | <NA> |
605 | 2751 | 희망문고 | 307 | 성북구 | 성북구 성북로2길 27 삼선교 맥도날드 뒷문 | 02-744-9534 | <NA> | 2 | 새책방 | 37.589564 | 127.007181 | <NA> |