Amino acid dipepetide frequency for Pseudoplusia includens SNPV IE

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.914AlaAla: 2.914 ± 0.352
0.836AlaCys: 0.836 ± 0.127
2.675AlaAsp: 2.675 ± 0.241
2.675AlaGlu: 2.675 ± 0.281
2.102AlaPhe: 2.102 ± 0.198
1.744AlaGly: 1.744 ± 0.241
0.979AlaHis: 0.979 ± 0.126
3.702AlaIle: 3.702 ± 0.275
2.651AlaLys: 2.651 ± 0.256
4.204AlaLeu: 4.204 ± 0.322
1.553AlaMet: 1.553 ± 0.218
3.559AlaAsn: 3.559 ± 0.293
1.696AlaPro: 1.696 ± 0.206
1.362AlaGln: 1.362 ± 0.171
2.006AlaArg: 2.006 ± 0.212
2.914AlaSer: 2.914 ± 0.269
2.819AlaThr: 2.819 ± 0.292
3.607AlaVal: 3.607 ± 0.297
0.382AlaTrp: 0.382 ± 0.106
1.768AlaTyr: 1.768 ± 0.196
0.0AlaXaa: 0.0 ± 0.0
Cys
1.218CysAla: 1.218 ± 0.145
0.526CysCys: 0.526 ± 0.122
1.385CysAsp: 1.385 ± 0.179
0.932CysGlu: 0.932 ± 0.122
0.884CysPhe: 0.884 ± 0.15
0.955CysGly: 0.955 ± 0.175
0.334CysHis: 0.334 ± 0.089
1.457CysIle: 1.457 ± 0.128
1.433CysLys: 1.433 ± 0.226
1.815CysLeu: 1.815 ± 0.179
0.43CysMet: 0.43 ± 0.106
1.17CysAsn: 1.17 ± 0.176
0.955CysPro: 0.955 ± 0.209
0.836CysGln: 0.836 ± 0.145
1.409CysArg: 1.409 ± 0.201
1.242CysSer: 1.242 ± 0.18
0.908CysThr: 0.908 ± 0.149
1.553CysVal: 1.553 ± 0.176
0.263CysTrp: 0.263 ± 0.081
0.979CysTyr: 0.979 ± 0.165
0.0CysXaa: 0.0 ± 0.0
Asp
2.771AspAla: 2.771 ± 0.307
1.385AspCys: 1.385 ± 0.211
6.879AspAsp: 6.879 ± 0.554
4.156AspGlu: 4.156 ± 0.335
3.392AspPhe: 3.392 ± 0.297
2.556AspGly: 2.556 ± 0.251
1.147AspHis: 1.147 ± 0.142
4.897AspIle: 4.897 ± 0.314
3.989AspLys: 3.989 ± 0.284
5.685AspLeu: 5.685 ± 0.418
1.362AspMet: 1.362 ± 0.175
5.064AspAsn: 5.064 ± 0.336
1.792AspPro: 1.792 ± 0.228
1.672AspGln: 1.672 ± 0.173
3.344AspArg: 3.344 ± 0.279
5.016AspSer: 5.016 ± 0.318
2.962AspThr: 2.962 ± 0.249
3.726AspVal: 3.726 ± 0.295
0.645AspTrp: 0.645 ± 0.143
3.846AspTyr: 3.846 ± 0.342
0.0AspXaa: 0.0 ± 0.0
Glu
2.15GluAla: 2.15 ± 0.22
1.362GluCys: 1.362 ± 0.179
3.416GluAsp: 3.416 ± 0.297
3.894GluGlu: 3.894 ± 0.41
2.389GluPhe: 2.389 ± 0.222
1.481GluGly: 1.481 ± 0.207
1.457GluHis: 1.457 ± 0.19
4.801GluIle: 4.801 ± 0.37
4.013GluLys: 4.013 ± 0.31
4.968GluLeu: 4.968 ± 0.364
1.577GluMet: 1.577 ± 0.201
4.586GluAsn: 4.586 ± 0.354
2.102GluPro: 2.102 ± 0.399
1.672GluGln: 1.672 ± 0.227
2.723GluArg: 2.723 ± 0.256
4.347GluSer: 4.347 ± 0.305
3.105GluThr: 3.105 ± 0.264
2.15GluVal: 2.15 ± 0.249
0.406GluTrp: 0.406 ± 0.089
3.249GluTyr: 3.249 ± 0.26
0.024GluXaa: 0.024 ± 0.027
Phe
1.959PheAla: 1.959 ± 0.23
1.147PheCys: 1.147 ± 0.154
4.658PheAsp: 4.658 ± 0.281
2.89PheGlu: 2.89 ± 0.234
2.317PhePhe: 2.317 ± 0.324
1.863PheGly: 1.863 ± 0.192
1.051PheHis: 1.051 ± 0.15
4.18PheIle: 4.18 ± 0.33
3.702PheLys: 3.702 ± 0.299
3.846PheLeu: 3.846 ± 0.325
1.29PheMet: 1.29 ± 0.18
4.037PheAsn: 4.037 ± 0.342
1.051PhePro: 1.051 ± 0.173
1.266PheGln: 1.266 ± 0.181
2.03PheArg: 2.03 ± 0.21
3.177PheSer: 3.177 ± 0.313
2.03PheThr: 2.03 ± 0.231
3.511PheVal: 3.511 ± 0.274
0.311PheTrp: 0.311 ± 0.076
2.46PheTyr: 2.46 ± 0.236
0.0PheXaa: 0.0 ± 0.0
Gly
1.744GlyAla: 1.744 ± 0.199
0.597GlyCys: 0.597 ± 0.111
2.389GlyAsp: 2.389 ± 0.247
1.815GlyGlu: 1.815 ± 0.244
1.624GlyPhe: 1.624 ± 0.186
2.245GlyGly: 2.245 ± 0.271
0.74GlyHis: 0.74 ± 0.13
2.532GlyIle: 2.532 ± 0.198
2.078GlyLys: 2.078 ± 0.238
2.795GlyLeu: 2.795 ± 0.311
0.86GlyMet: 0.86 ± 0.136
2.46GlyAsn: 2.46 ± 0.26
0.645GlyPro: 0.645 ± 0.143
1.17GlyGln: 1.17 ± 0.158
1.696GlyArg: 1.696 ± 0.251
2.556GlySer: 2.556 ± 0.253
1.672GlyThr: 1.672 ± 0.214
2.269GlyVal: 2.269 ± 0.255
0.263GlyTrp: 0.263 ± 0.069
1.481GlyTyr: 1.481 ± 0.158
0.0GlyXaa: 0.0 ± 0.0
His
0.932HisAla: 0.932 ± 0.152
0.263HisCys: 0.263 ± 0.07
1.815HisAsp: 1.815 ± 0.201
1.099HisGlu: 1.099 ± 0.142
0.955HisPhe: 0.955 ± 0.155
0.645HisGly: 0.645 ± 0.141
1.003HisHis: 1.003 ± 0.199
1.529HisIle: 1.529 ± 0.213
1.218HisLys: 1.218 ± 0.177
2.484HisLeu: 2.484 ± 0.305
0.597HisMet: 0.597 ± 0.117
1.624HisAsn: 1.624 ± 0.202
0.717HisPro: 0.717 ± 0.146
1.147HisGln: 1.147 ± 0.235
0.86HisArg: 0.86 ± 0.136
1.123HisSer: 1.123 ± 0.17
0.74HisThr: 0.74 ± 0.13
1.648HisVal: 1.648 ± 0.219
0.167HisTrp: 0.167 ± 0.061
1.29HisTyr: 1.29 ± 0.18
0.0HisXaa: 0.0 ± 0.0
Ile
3.416IleAla: 3.416 ± 0.261
1.624IleCys: 1.624 ± 0.224
6.282IleAsp: 6.282 ± 0.389
5.781IleGlu: 5.781 ± 0.455
3.511IlePhe: 3.511 ± 0.278
2.15IleGly: 2.15 ± 0.221
1.577IleHis: 1.577 ± 0.192
6.449IleIle: 6.449 ± 0.437
6.354IleLys: 6.354 ± 0.429
6.497IleLeu: 6.497 ± 0.4
2.078IleMet: 2.078 ± 0.248
6.545IleAsn: 6.545 ± 0.359
2.484IlePro: 2.484 ± 0.237
2.699IleGln: 2.699 ± 0.262
3.583IleArg: 3.583 ± 0.342
4.491IleSer: 4.491 ± 0.359
3.631IleThr: 3.631 ± 0.281
5.303IleVal: 5.303 ± 0.313
0.454IleTrp: 0.454 ± 0.092
3.392IleTyr: 3.392 ± 0.305
0.048IleXaa: 0.048 ± 0.034
Lys
2.365LysAla: 2.365 ± 0.284
1.863LysCys: 1.863 ± 0.187
2.699LysAsp: 2.699 ± 0.292
3.01LysGlu: 3.01 ± 0.284
4.013LysPhe: 4.013 ± 0.354
2.03LysGly: 2.03 ± 0.205
1.815LysHis: 1.815 ± 0.214
6.903LysIle: 6.903 ± 0.446
4.801LysLys: 4.801 ± 0.447
6.402LysLeu: 6.402 ± 0.385
2.126LysMet: 2.126 ± 0.22
6.019LysAsn: 6.019 ± 0.403
2.198LysPro: 2.198 ± 0.224
2.628LysGln: 2.628 ± 0.329
5.016LysArg: 5.016 ± 0.401
4.204LysSer: 4.204 ± 0.299
4.3LysThr: 4.3 ± 0.331
2.556LysVal: 2.556 ± 0.256
0.43LysTrp: 0.43 ± 0.101
3.679LysTyr: 3.679 ± 0.308
0.024LysXaa: 0.024 ± 0.027
Leu
3.822LeuAla: 3.822 ± 0.285
1.815LeuCys: 1.815 ± 0.214
4.3LeuAsp: 4.3 ± 0.285
4.897LeuGlu: 4.897 ± 0.402
4.658LeuPhe: 4.658 ± 0.323
2.341LeuGly: 2.341 ± 0.247
2.078LeuHis: 2.078 ± 0.281
6.426LeuIle: 6.426 ± 0.369
7.023LeuLys: 7.023 ± 0.452
9.292LeuLeu: 9.292 ± 0.557
2.413LeuMet: 2.413 ± 0.204
7.644LeuAsn: 7.644 ± 0.407
3.464LeuPro: 3.464 ± 0.305
4.132LeuGln: 4.132 ± 0.324
4.18LeuArg: 4.18 ± 0.313
5.828LeuSer: 5.828 ± 0.412
4.849LeuThr: 4.849 ± 0.345
4.897LeuVal: 4.897 ± 0.355
0.74LeuTrp: 0.74 ± 0.133
4.801LeuTyr: 4.801 ± 0.33
0.096LeuXaa: 0.096 ± 0.055
Met
1.839MetAla: 1.839 ± 0.21
0.693MetCys: 0.693 ± 0.143
1.385MetAsp: 1.385 ± 0.203
1.457MetGlu: 1.457 ± 0.168
1.529MetPhe: 1.529 ± 0.193
0.645MetGly: 0.645 ± 0.134
0.43MetHis: 0.43 ± 0.096
1.983MetIle: 1.983 ± 0.2
1.792MetLys: 1.792 ± 0.184
2.198MetLeu: 2.198 ± 0.202
0.669MetMet: 0.669 ± 0.132
2.198MetAsn: 2.198 ± 0.189
1.194MetPro: 1.194 ± 0.178
1.147MetGln: 1.147 ± 0.166
0.932MetArg: 0.932 ± 0.114
2.58MetSer: 2.58 ± 0.212
1.242MetThr: 1.242 ± 0.163
1.194MetVal: 1.194 ± 0.168
0.311MetTrp: 0.311 ± 0.091
1.123MetTyr: 1.123 ± 0.149
0.0MetXaa: 0.0 ± 0.0
Asn
3.01AsnAla: 3.01 ± 0.236
1.481AsnCys: 1.481 ± 0.216
5.518AsnAsp: 5.518 ± 0.417
4.753AsnGlu: 4.753 ± 0.357
3.679AsnPhe: 3.679 ± 0.314
3.01AsnGly: 3.01 ± 0.276
1.266AsnHis: 1.266 ± 0.167
5.351AsnIle: 5.351 ± 0.386
5.709AsnLys: 5.709 ± 0.365
6.426AsnLeu: 6.426 ± 0.407
1.792AsnMet: 1.792 ± 0.177
7.668AsnAsn: 7.668 ± 0.54
2.198AsnPro: 2.198 ± 0.289
3.177AsnGln: 3.177 ± 0.489
4.467AsnArg: 4.467 ± 0.358
5.16AsnSer: 5.16 ± 0.361
4.634AsnThr: 4.634 ± 0.333
5.828AsnVal: 5.828 ± 0.364
0.239AsnTrp: 0.239 ± 0.063
4.132AsnTyr: 4.132 ± 0.291
0.0AsnXaa: 0.0 ± 0.0
Pro
1.815ProAla: 1.815 ± 0.263
0.549ProCys: 0.549 ± 0.153
2.126ProAsp: 2.126 ± 0.29
1.911ProGlu: 1.911 ± 0.256
1.768ProPhe: 1.768 ± 0.215
1.218ProGly: 1.218 ± 0.198
0.621ProHis: 0.621 ± 0.111
2.819ProIle: 2.819 ± 0.309
1.935ProLys: 1.935 ± 0.219
2.914ProLeu: 2.914 ± 0.217
0.836ProMet: 0.836 ± 0.144
2.174ProAsn: 2.174 ± 0.271
2.962ProPro: 2.962 ± 0.597
1.123ProGln: 1.123 ± 0.165
1.242ProArg: 1.242 ± 0.186
2.413ProSer: 2.413 ± 0.257
2.245ProThr: 2.245 ± 0.284
2.054ProVal: 2.054 ± 0.242
0.239ProTrp: 0.239 ± 0.081
1.505ProTyr: 1.505 ± 0.178
0.0ProXaa: 0.0 ± 0.0
Gln
1.314GlnAla: 1.314 ± 0.186
0.764GlnCys: 0.764 ± 0.131
1.648GlnAsp: 1.648 ± 0.19
1.529GlnGlu: 1.529 ± 0.2
2.269GlnPhe: 2.269 ± 0.246
0.645GlnGly: 0.645 ± 0.122
1.266GlnHis: 1.266 ± 0.245
2.58GlnIle: 2.58 ± 0.264
2.795GlnLys: 2.795 ± 0.315
4.132GlnLeu: 4.132 ± 0.368
1.218GlnMet: 1.218 ± 0.191
3.296GlnAsn: 3.296 ± 0.451
0.908GlnPro: 0.908 ± 0.15
2.269GlnGln: 2.269 ± 0.438
1.577GlnArg: 1.577 ± 0.224
2.986GlnSer: 2.986 ± 0.41
1.959GlnThr: 1.959 ± 0.198
1.266GlnVal: 1.266 ± 0.156
0.287GlnTrp: 0.287 ± 0.084
1.983GlnTyr: 1.983 ± 0.255
0.0GlnXaa: 0.0 ± 0.0
Arg
2.15ArgAla: 2.15 ± 0.235
0.86ArgCys: 0.86 ± 0.149
2.46ArgAsp: 2.46 ± 0.217
2.532ArgGlu: 2.532 ± 0.255
2.269ArgPhe: 2.269 ± 0.241
1.577ArgGly: 1.577 ± 0.19
1.266ArgHis: 1.266 ± 0.193
3.965ArgIle: 3.965 ± 0.316
3.32ArgLys: 3.32 ± 0.248
5.327ArgLeu: 5.327 ± 0.341
1.218ArgMet: 1.218 ± 0.173
3.702ArgAsn: 3.702 ± 0.308
1.983ArgPro: 1.983 ± 0.224
2.58ArgGln: 2.58 ± 0.241
3.941ArgArg: 3.941 ± 0.685
3.392ArgSer: 3.392 ± 0.436
2.413ArgThr: 2.413 ± 0.218
3.034ArgVal: 3.034 ± 0.27
0.358ArgTrp: 0.358 ± 0.092
2.198ArgTyr: 2.198 ± 0.267
0.0ArgXaa: 0.0 ± 0.0
Ser
3.655SerAla: 3.655 ± 0.306
1.314SerCys: 1.314 ± 0.195
4.849SerAsp: 4.849 ± 0.335
3.344SerGlu: 3.344 ± 0.249
3.464SerPhe: 3.464 ± 0.279
2.938SerGly: 2.938 ± 0.354
1.433SerHis: 1.433 ± 0.193
5.709SerIle: 5.709 ± 0.366
4.347SerLys: 4.347 ± 0.32
6.736SerLeu: 6.736 ± 0.459
1.72SerMet: 1.72 ± 0.227
4.419SerAsn: 4.419 ± 0.354
2.484SerPro: 2.484 ± 0.221
2.413SerGln: 2.413 ± 0.218
3.201SerArg: 3.201 ± 0.278
7.023SerSer: 7.023 ± 0.576
4.849SerThr: 4.849 ± 0.34
4.204SerVal: 4.204 ± 0.262
0.621SerTrp: 0.621 ± 0.115
2.484SerTyr: 2.484 ± 0.231
0.024SerXaa: 0.024 ± 0.027
Thr
3.034ThrAla: 3.034 ± 0.26
1.003ThrCys: 1.003 ± 0.159
3.416ThrAsp: 3.416 ± 0.296
2.723ThrGlu: 2.723 ± 0.215
2.556ThrPhe: 2.556 ± 0.186
1.839ThrGly: 1.839 ± 0.202
0.836ThrHis: 0.836 ± 0.159
4.777ThrIle: 4.777 ± 0.339
3.177ThrLys: 3.177 ± 0.305
4.276ThrLeu: 4.276 ± 0.344
1.839ThrMet: 1.839 ± 0.225
3.702ThrAsn: 3.702 ± 0.314
1.887ThrPro: 1.887 ± 0.207
2.198ThrGln: 2.198 ± 0.252
2.747ThrArg: 2.747 ± 0.275
4.18ThrSer: 4.18 ± 0.28
4.61ThrThr: 4.61 ± 0.467
3.273ThrVal: 3.273 ± 0.224
0.334ThrTrp: 0.334 ± 0.093
2.102ThrTyr: 2.102 ± 0.23
0.024ThrXaa: 0.024 ± 0.022
Val
3.464ValAla: 3.464 ± 0.273
1.481ValCys: 1.481 ± 0.188
4.921ValAsp: 4.921 ± 0.397
3.487ValGlu: 3.487 ± 0.287
2.962ValPhe: 2.962 ± 0.246
1.792ValGly: 1.792 ± 0.209
1.242ValHis: 1.242 ± 0.153
4.324ValIle: 4.324 ± 0.251
4.228ValLys: 4.228 ± 0.302
4.658ValLeu: 4.658 ± 0.304
1.409ValMet: 1.409 ± 0.159
4.73ValAsn: 4.73 ± 0.347
2.413ValPro: 2.413 ± 0.33
1.696ValGln: 1.696 ± 0.195
2.747ValArg: 2.747 ± 0.242
4.515ValSer: 4.515 ± 0.339
2.628ValThr: 2.628 ± 0.222
4.013ValVal: 4.013 ± 0.311
0.454ValTrp: 0.454 ± 0.103
2.962ValTyr: 2.962 ± 0.274
0.0ValXaa: 0.0 ± 0.0
Trp
0.382TrpAla: 0.382 ± 0.107
0.167TrpCys: 0.167 ± 0.055
0.406TrpAsp: 0.406 ± 0.097
0.382TrpGlu: 0.382 ± 0.101
0.239TrpPhe: 0.239 ± 0.081
0.119TrpGly: 0.119 ± 0.05
0.263TrpHis: 0.263 ± 0.085
0.549TrpIle: 0.549 ± 0.106
0.478TrpLys: 0.478 ± 0.1
0.478TrpLeu: 0.478 ± 0.098
0.239TrpMet: 0.239 ± 0.078
0.621TrpAsn: 0.621 ± 0.127
0.239TrpPro: 0.239 ± 0.089
0.311TrpGln: 0.311 ± 0.095
0.406TrpArg: 0.406 ± 0.093
0.621TrpSer: 0.621 ± 0.145
0.573TrpThr: 0.573 ± 0.105
0.334TrpVal: 0.334 ± 0.084
0.143TrpTrp: 0.143 ± 0.058
0.358TrpTyr: 0.358 ± 0.087
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.174TyrAla: 2.174 ± 0.224
0.955TyrCys: 0.955 ± 0.176
3.058TyrAsp: 3.058 ± 0.263
2.58TyrGlu: 2.58 ± 0.209
2.198TyrPhe: 2.198 ± 0.193
1.839TyrGly: 1.839 ± 0.231
1.027TyrHis: 1.027 ± 0.16
3.416TyrIle: 3.416 ± 0.284
3.941TyrLys: 3.941 ± 0.263
4.562TyrLeu: 4.562 ± 0.328
1.29TyrMet: 1.29 ± 0.176
4.228TyrAsn: 4.228 ± 0.297
1.123TyrPro: 1.123 ± 0.149
1.194TyrGln: 1.194 ± 0.152
2.436TyrArg: 2.436 ± 0.275
3.44TyrSer: 3.44 ± 0.258
2.365TyrThr: 2.365 ± 0.225
3.702TyrVal: 3.702 ± 0.284
0.263TyrTrp: 0.263 ± 0.077
2.651TyrTyr: 2.651 ± 0.269
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.024XaaAla: 0.024 ± 0.027
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.024XaaGlu: 0.024 ± 0.028
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.024XaaHis: 0.024 ± 0.025
0.0XaaIle: 0.0 ± 0.0
0.096XaaLys: 0.096 ± 0.05
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.048XaaAsn: 0.048 ± 0.031
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.024XaaSer: 0.024 ± 0.027
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 141 proteins (41865 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski