Overview

Dataset statistics

Number of variables7
Number of observations49
Missing cells2
Missing cells (%)0.6%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.9 KiB
Average record size in memory60.7 B

Variable types

Categorical2
Text1
Numeric2
DateTime2

Dataset

Description남동구 도서관홈페이지 자료현황에 대한 데이터로 기관명, 내용, 인덱스, 수정날짜, 등록날짜, 제목 항목을 제공합니다.
Author인천광역시 남동구
URLhttps://data.incheon.go.kr/findData/publicDataDetail?dataId=15117170&srcSe=7661IVAWM27C61E190

Alerts

인덱스 is highly overall correlated with 제목High correlation
제목 is highly overall correlated with 인덱스High correlation
수정날짜 has 2 (4.1%) missing valuesMissing
인덱스 has unique valuesUnique

Reproduction

Analysis started2024-01-28 05:04:13.731435
Analysis finished2024-01-28 05:04:14.456589
Duration0.73 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

기관명
Categorical

Distinct6
Distinct (%)12.2%
Missing0
Missing (%)0.0%
Memory size524.0 B
남동논현도서관
12 
만수2동 어린이도서관
10 
소래도서관
서창도서관
간석3동 어린이도서관

Length

Max length11
Median length10
Mean length7.7959184
Min length5

Unique

Unique1 ?
Unique (%)2.0%

Sample

1st row소래도서관
2nd row서창도서관
3rd row간석3동 어린이도서관
4th row만수2동 어린이도서관
5th row소래도서관

Common Values

ValueCountFrequency (%)
남동논현도서관 12
24.5%
만수2동 어린이도서관 10
20.4%
소래도서관 9
18.4%
서창도서관 9
18.4%
간석3동 어린이도서관 8
16.3%
간석4동 작은도서관 1
 
2.0%

Length

2024-01-28T14:04:14.518873image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-28T14:04:14.642714image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
어린이도서관 18
26.5%
남동논현도서관 12
17.6%
만수2동 10
14.7%
소래도서관 9
13.2%
서창도서관 9
13.2%
간석3동 8
11.8%
간석4동 1
 
1.5%
작은도서관 1
 
1.5%

내용
Text

Distinct35
Distinct (%)71.4%
Missing0
Missing (%)0.0%
Memory size524.0 B
2024-01-28T14:04:14.786958image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length36
Median length35
Mean length32.795918
Min length30

Characters and Unicode

Total characters1607
Distinct characters37
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique26 ?
Unique (%)53.1%

Sample

1st row소래도서관(기준 : 2019-04-01 현재) 자료현황
2nd row서창도서관(기준 : 2019-04-01 현재) 자료현황
3rd row간석3동 어린이도서관(기준 : 2019-04-01 현재) 자료현황
4th row만수2동 어린이도서관(기준 : 2019-04-01 현재) 자료현황
5th row소래도서관(기준 : 2019-12-31 현재) 자료현황
ValueCountFrequency (%)
49
18.6%
현재 49
18.6%
자료현황 49
18.6%
어린이도서관(기준 18
 
6.8%
2021-04-30 13
 
4.9%
남동논현도서관(기준 12
 
4.5%
만수2동 10
 
3.8%
소래도서관(기준 9
 
3.4%
서창도서관(기준 9
 
3.4%
간석3동 8
 
3.0%
Other values (9) 38
14.4%
2024-01-28T14:04:15.053201image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
215
 
13.4%
2 136
 
8.5%
0 123
 
7.7%
110
 
6.8%
- 98
 
6.1%
3 60
 
3.7%
58
 
3.6%
1 53
 
3.3%
49
 
3.0%
49
 
3.0%
Other values (27) 656
40.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 736
45.8%
Decimal Number 411
25.6%
Space Separator 215
 
13.4%
Dash Punctuation 98
 
6.1%
Other Punctuation 49
 
3.0%
Close Punctuation 49
 
3.0%
Open Punctuation 49
 
3.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
110
14.9%
58
 
7.9%
49
 
6.7%
49
 
6.7%
49
 
6.7%
49
 
6.7%
49
 
6.7%
49
 
6.7%
49
 
6.7%
49
 
6.7%
Other values (15) 176
23.9%
Decimal Number
ValueCountFrequency (%)
2 136
33.1%
0 123
29.9%
3 60
14.6%
1 53
 
12.9%
4 18
 
4.4%
9 14
 
3.4%
8 7
 
1.7%
Space Separator
ValueCountFrequency (%)
215
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 98
100.0%
Other Punctuation
ValueCountFrequency (%)
: 49
100.0%
Close Punctuation
ValueCountFrequency (%)
) 49
100.0%
Open Punctuation
ValueCountFrequency (%)
( 49
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 871
54.2%
Hangul 736
45.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
110
14.9%
58
 
7.9%
49
 
6.7%
49
 
6.7%
49
 
6.7%
49
 
6.7%
49
 
6.7%
49
 
6.7%
49
 
6.7%
49
 
6.7%
Other values (15) 176
23.9%
Common
ValueCountFrequency (%)
215
24.7%
2 136
15.6%
0 123
14.1%
- 98
11.3%
3 60
 
6.9%
1 53
 
6.1%
: 49
 
5.6%
) 49
 
5.6%
( 49
 
5.6%
4 18
 
2.1%
Other values (2) 21
 
2.4%

Most occurring blocks

ValueCountFrequency (%)
ASCII 871
54.2%
Hangul 736
45.8%

Most frequent character per block

ASCII
ValueCountFrequency (%)
215
24.7%
2 136
15.6%
0 123
14.1%
- 98
11.3%
3 60
 
6.9%
1 53
 
6.1%
: 49
 
5.6%
) 49
 
5.6%
( 49
 
5.6%
4 18
 
2.1%
Other values (2) 21
 
2.4%
Hangul
ValueCountFrequency (%)
110
14.9%
58
 
7.9%
49
 
6.7%
49
 
6.7%
49
 
6.7%
49
 
6.7%
49
 
6.7%
49
 
6.7%
49
 
6.7%
49
 
6.7%
Other values (15) 176
23.9%

조회수
Real number (ℝ)

Distinct17
Distinct (%)34.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean9.6122449
Minimum2
Maximum81
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size573.0 B
2024-01-28T14:04:15.162262image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2
5-th percentile2
Q14
median8
Q310
95-th percentile23
Maximum81
Range79
Interquartile range (IQR)6

Descriptive statistics

Standard deviation11.858219
Coefficient of variation (CV)1.2336576
Kurtosis28.258802
Mean9.6122449
Median Absolute Deviation (MAD)3
Skewness4.9057829
Sum471
Variance140.61735
MonotonicityNot monotonic
2024-01-28T14:04:15.263812image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=17)
ValueCountFrequency (%)
4 11
22.4%
8 7
14.3%
2 4
 
8.2%
7 4
 
8.2%
9 4
 
8.2%
11 3
 
6.1%
10 3
 
6.1%
6 3
 
6.1%
12 2
 
4.1%
32 1
 
2.0%
Other values (7) 7
14.3%
ValueCountFrequency (%)
2 4
 
8.2%
3 1
 
2.0%
4 11
22.4%
5 1
 
2.0%
6 3
 
6.1%
7 4
 
8.2%
8 7
14.3%
9 4
 
8.2%
10 3
 
6.1%
11 3
 
6.1%
ValueCountFrequency (%)
81 1
 
2.0%
32 1
 
2.0%
27 1
 
2.0%
17 1
 
2.0%
15 1
 
2.0%
14 1
 
2.0%
12 2
4.1%
11 3
6.1%
10 3
6.1%
9 4
8.2%

인덱스
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct49
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean35.367347
Minimum1
Maximum60
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size573.0 B
2024-01-28T14:04:15.372406image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile12.4
Q123
median36
Q348
95-th percentile57.6
Maximum60
Range59
Interquartile range (IQR)25

Descriptive statistics

Standard deviation15.224287
Coefficient of variation (CV)0.43046166
Kurtosis-0.94605173
Mean35.367347
Median Absolute Deviation (MAD)13
Skewness-0.16987928
Sum1733
Variance231.77891
MonotonicityNot monotonic
2024-01-28T14:04:15.500394image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=49)
ValueCountFrequency (%)
11 1
 
2.0%
49 1
 
2.0%
39 1
 
2.0%
40 1
 
2.0%
41 1
 
2.0%
42 1
 
2.0%
43 1
 
2.0%
44 1
 
2.0%
45 1
 
2.0%
46 1
 
2.0%
Other values (39) 39
79.6%
ValueCountFrequency (%)
1 1
2.0%
11 1
2.0%
12 1
2.0%
13 1
2.0%
14 1
2.0%
16 1
2.0%
17 1
2.0%
18 1
2.0%
19 1
2.0%
20 1
2.0%
ValueCountFrequency (%)
60 1
2.0%
59 1
2.0%
58 1
2.0%
57 1
2.0%
56 1
2.0%
55 1
2.0%
54 1
2.0%
53 1
2.0%
52 1
2.0%
51 1
2.0%

수정날짜
Date

MISSING 

Distinct42
Distinct (%)89.4%
Missing2
Missing (%)4.1%
Memory size524.0 B
Minimum2020-01-18 14:08:00
Maximum2023-04-19 16:31:00
2024-01-28T14:04:15.628245image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-28T14:04:15.741520image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=42)
Distinct45
Distinct (%)91.8%
Missing0
Missing (%)0.0%
Memory size524.0 B
Minimum2019-08-21 15:01:00
Maximum2023-04-13 17:42:00
2024-01-28T14:04:16.115213image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-28T14:04:16.222488image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=45)

제목
Categorical

HIGH CORRELATION 

Distinct17
Distinct (%)34.7%
Missing0
Missing (%)0.0%
Memory size524.0 B
자료수정일자_20221025
자료수정일자_20210430
자료수정일자_20220327
자료수정일자_20220307
자료수정일자_20230413
Other values (12)
23 

Length

Max length16
Median length15
Mean length15.020408
Min length15

Unique

Unique6 ?
Unique (%)12.2%

Sample

1st row자료현황수정일_20190821
2nd row자료수정일자_20190821
3rd row자료수정일자_20190821
4th row자료수정일자_20190821
5th row자료수정일자_20200118

Common Values

ValueCountFrequency (%)
자료수정일자_20221025 6
12.2%
자료수정일자_20210430 5
10.2%
자료수정일자_20220327 5
10.2%
자료수정일자_20220307 5
10.2%
자료수정일자_20230413 5
10.2%
자료수정일자_20210130 4
8.2%
자료수정일자_20200118 3
 
6.1%
자료수정일자_20230213 3
 
6.1%
자료수정일자_20190821 3
 
6.1%
자료수정일자_20220521 2
 
4.1%
Other values (7) 8
16.3%

Length

2024-01-28T14:04:16.321571image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
자료수정일자_20221025 6
12.2%
자료수정일자_20220327 5
10.2%
자료수정일자_20220307 5
10.2%
자료수정일자_20230413 5
10.2%
자료수정일자_20210430 5
10.2%
자료수정일자_20210130 4
8.2%
자료수정일자_20230213 3
 
6.1%
자료수정일자_20190821 3
 
6.1%
자료수정일자_20200118 3
 
6.1%
자료수정일자_20220521 2
 
4.1%
Other values (7) 8
16.3%

Interactions

2024-01-28T14:04:14.165940image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-28T14:04:14.012863image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-28T14:04:14.238613image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-28T14:04:14.094280image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-01-28T14:04:16.387707image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
기관명내용조회수인덱스수정날짜등록날짜제목
기관명1.0001.0000.2900.0000.6950.6340.000
내용1.0001.0000.0000.7870.9500.9260.000
조회수0.2900.0001.0000.0630.4850.9800.745
인덱스0.0000.7870.0631.0000.9890.8910.922
수정날짜0.6950.9500.4850.9891.0000.8430.986
등록날짜0.6340.9260.9800.8910.8431.0000.961
제목0.0000.0000.7450.9220.9860.9611.000
2024-01-28T14:04:16.471361image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
기관명제목
기관명1.0000.000
제목0.0001.000
2024-01-28T14:04:16.537157image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
조회수인덱스기관명제목
조회수1.000-0.4230.1800.435
인덱스-0.4231.0000.0000.629
기관명0.1800.0001.0000.000
제목0.4350.6290.0001.000

Missing values

2024-01-28T14:04:14.329712image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-01-28T14:04:14.420335image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

기관명내용조회수인덱스수정날짜등록날짜제목
0소래도서관소래도서관(기준 : 2019-04-01 현재) 자료현황8112020-01-18 14:08:002019-08-21 15:01:00자료현황수정일_20190821
1서창도서관서창도서관(기준 : 2019-04-01 현재) 자료현황4122020-01-18 14:34:002019-08-21 15:01:00자료수정일자_20190821
2간석3동 어린이도서관간석3동 어린이도서관(기준 : 2019-04-01 현재) 자료현황4132020-01-18 14:43:002019-08-21 15:01:00자료수정일자_20190821
3만수2동 어린이도서관만수2동 어린이도서관(기준 : 2019-04-01 현재) 자료현황4142020-01-18 15:03:002019-08-21 15:02:00자료수정일자_20190821
4소래도서관소래도서관(기준 : 2019-12-31 현재) 자료현황27162020-09-27 14:32:002020-01-18 14:08:00자료수정일자_20200118
5서창도서관서창도서관(기준 : 2019-12-31 현재) 자료현황9172020-09-27 15:01:002020-01-18 14:34:00자료수정일자_20200118
6간석3동 어린이도서관간석3동 어린이도서관(기준 : 2019-12-31 현재) 자료현황9182021-01-30 20:27:002020-01-18 14:43:00자료수정일자_20200831
7만수2동 어린이도서관만수2동 어린이도서관(기준 : 2019-12-31 현재) 자료현황11192020-10-24 11:49:002020-01-18 15:03:00자료수정일자_20200118
8소래도서관소래도서관(기준 : 2021-04-30 현재) 자료현황32202022-03-07 11:15:002020-09-27 14:32:00자료수정일자_20201231
9서창도서관서창도서관(기준 : 2021-04-30 현재) 자료현황15212021-01-30 20:23:002020-09-27 15:00:00자료수정일자_20200927
기관명내용조회수인덱스수정날짜등록날짜제목
39남동논현도서관남동논현도서관(기준 : 2023-01-30 현재) 자료현황7512023-04-13 16:41:002023-02-10 11:33:00자료수정일자_20230210
40소래도서관소래도서관(기준 : 2023-01-30 현재) 자료현황8522023-04-13 16:49:002023-02-13 11:50:00자료수정일자_20230213
41서창도서관서창도서관(기준 : 2023-01-30 현재) 자료현황4532023-04-13 16:59:002023-02-13 11:57:00자료수정일자_20230213
42간석3동 어린이도서관간석3동 어린이도서관(기준 : 2023-01-30 현재) 자료현황7542023-04-13 17:37:002023-02-13 13:44:00자료수정일자_20230213
43만수2동 어린이도서관만수2동 어린이도서관(기준 : 2023-01-30 현재) 자료현황4552023-04-13 17:42:002023-02-13 13:48:00자료수정일자_20230210
44남동논현도서관남동논현도서관(기준 : 2023-03-31 현재) 자료현황4562023-04-19 16:29:002023-04-13 16:41:00자료수정일자_20230413
45소래도서관소래도서관(기준 : 2023-03-31 현재) 자료현황5572023-04-19 16:30:002023-04-13 16:49:00자료수정일자_20230413
46서창도서관서창도서관(기준 : 2023-03-31 현재) 자료현황2582023-04-19 16:30:002023-04-13 16:59:00자료수정일자_20230413
47간석3동 어린이도서관간석3동 어린이도서관(기준 : 2023-01-30 현재) 자료현황2592023-04-19 16:30:002023-04-13 17:37:00자료수정일자_20230413
48만수2동 어린이도서관만수2동 어린이도서관(기준 : 2023-03-31 현재) 자료현황3602023-04-19 16:31:002023-04-13 17:42:00자료수정일자_20230413