Amino acid dipepetide frequency for Common midwife toad virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
10.646AlaAla: 10.646 ± 0.836
1.78AlaCys: 1.78 ± 0.231
4.318AlaAsp: 4.318 ± 0.39
5.669AlaGlu: 5.669 ± 0.457
2.901AlaPhe: 2.901 ± 0.339
7.054AlaGly: 7.054 ± 0.594
1.879AlaHis: 1.879 ± 0.27
2.406AlaIle: 2.406 ± 0.272
4.483AlaLys: 4.483 ± 0.463
7.12AlaLeu: 7.12 ± 0.557
2.966AlaMet: 2.966 ± 0.299
1.648AlaAsn: 1.648 ± 0.274
5.406AlaPro: 5.406 ± 1.089
3.098AlaGln: 3.098 ± 0.652
4.977AlaArg: 4.977 ± 0.521
6.658AlaSer: 6.658 ± 0.667
4.417AlaThr: 4.417 ± 0.396
9.13AlaVal: 9.13 ± 0.623
1.253AlaTrp: 1.253 ± 0.226
2.604AlaTyr: 2.604 ± 0.255
0.0AlaXaa: 0.0 ± 0.0
Cys
1.912CysAla: 1.912 ± 0.316
0.692CysCys: 0.692 ± 0.162
1.253CysAsp: 1.253 ± 0.193
1.022CysGlu: 1.022 ± 0.183
0.494CysPhe: 0.494 ± 0.136
1.549CysGly: 1.549 ± 0.24
0.527CysHis: 0.527 ± 0.138
0.659CysIle: 0.659 ± 0.149
1.318CysLys: 1.318 ± 0.215
1.549CysLeu: 1.549 ± 0.259
0.692CysMet: 0.692 ± 0.184
0.593CysAsn: 0.593 ± 0.162
1.648CysPro: 1.648 ± 0.351
0.494CysGln: 0.494 ± 0.122
1.516CysArg: 1.516 ± 0.245
1.549CysSer: 1.549 ± 0.222
0.725CysThr: 0.725 ± 0.155
1.615CysVal: 1.615 ± 0.22
0.461CysTrp: 0.461 ± 0.126
0.626CysTyr: 0.626 ± 0.161
0.0CysXaa: 0.0 ± 0.0
Asp
5.274AspAla: 5.274 ± 0.438
1.22AspCys: 1.22 ± 0.179
3.329AspAsp: 3.329 ± 0.372
2.934AspGlu: 2.934 ± 0.346
1.747AspPhe: 1.747 ± 0.264
4.779AspGly: 4.779 ± 0.508
1.055AspHis: 1.055 ± 0.14
2.373AspIle: 2.373 ± 0.311
2.604AspLys: 2.604 ± 0.285
5.109AspLeu: 5.109 ± 0.483
1.978AspMet: 1.978 ± 0.28
1.879AspAsn: 1.879 ± 0.346
4.944AspPro: 4.944 ± 0.503
1.417AspGln: 1.417 ± 0.261
3.988AspArg: 3.988 ± 0.378
4.713AspSer: 4.713 ± 0.477
2.406AspThr: 2.406 ± 0.256
4.878AspVal: 4.878 ± 0.432
0.956AspTrp: 0.956 ± 0.193
2.34AspTyr: 2.34 ± 0.296
0.0AspXaa: 0.0 ± 0.0
Glu
6.328GluAla: 6.328 ± 0.61
1.351GluCys: 1.351 ± 0.295
3.593GluAsp: 3.593 ± 0.4
4.021GluGlu: 4.021 ± 0.723
1.879GluPhe: 1.879 ± 0.222
4.054GluGly: 4.054 ± 0.373
0.824GluHis: 0.824 ± 0.175
1.813GluIle: 1.813 ± 0.249
2.901GluLys: 2.901 ± 0.313
3.362GluLeu: 3.362 ± 0.342
2.077GluMet: 2.077 ± 0.287
1.121GluAsn: 1.121 ± 0.195
2.868GluPro: 2.868 ± 0.362
1.78GluGln: 1.78 ± 0.41
3.955GluArg: 3.955 ± 0.343
3.56GluSer: 3.56 ± 0.441
3.725GluThr: 3.725 ± 0.397
3.626GluVal: 3.626 ± 0.329
1.187GluTrp: 1.187 ± 0.199
1.978GluTyr: 1.978 ± 0.245
0.0GluXaa: 0.0 ± 0.0
Phe
3.065PheAla: 3.065 ± 0.368
0.626PheCys: 0.626 ± 0.15
1.417PheAsp: 1.417 ± 0.213
1.846PheGlu: 1.846 ± 0.234
1.253PhePhe: 1.253 ± 0.205
2.571PheGly: 2.571 ± 0.342
0.626PheHis: 0.626 ± 0.153
1.022PheIle: 1.022 ± 0.204
1.417PheLys: 1.417 ± 0.205
2.835PheLeu: 2.835 ± 0.274
0.923PheMet: 0.923 ± 0.183
1.154PheAsn: 1.154 ± 0.18
1.978PhePro: 1.978 ± 0.282
0.659PheGln: 0.659 ± 0.136
2.439PheArg: 2.439 ± 0.368
2.736PheSer: 2.736 ± 0.275
2.044PheThr: 2.044 ± 0.33
2.835PheVal: 2.835 ± 0.311
0.297PheTrp: 0.297 ± 0.106
1.055PheTyr: 1.055 ± 0.193
0.0PheXaa: 0.0 ± 0.0
Gly
6.032GlyAla: 6.032 ± 0.487
1.714GlyCys: 1.714 ± 0.283
4.647GlyAsp: 4.647 ± 0.484
3.197GlyGlu: 3.197 ± 0.309
2.67GlyPhe: 2.67 ± 0.347
5.57GlyGly: 5.57 ± 0.484
1.747GlyHis: 1.747 ± 0.314
2.241GlyIle: 2.241 ± 0.312
4.153GlyLys: 4.153 ± 0.402
5.801GlyLeu: 5.801 ± 0.491
2.011GlyMet: 2.011 ± 0.266
1.285GlyAsn: 1.285 ± 0.259
4.68GlyPro: 4.68 ± 0.69
2.044GlyGln: 2.044 ± 0.283
5.966GlyArg: 5.966 ± 0.614
5.735GlySer: 5.735 ± 0.398
4.417GlyThr: 4.417 ± 0.414
5.603GlyVal: 5.603 ± 0.486
1.384GlyTrp: 1.384 ± 0.258
2.538GlyTyr: 2.538 ± 0.294
0.0GlyXaa: 0.0 ± 0.0
His
1.747HisAla: 1.747 ± 0.196
0.363HisCys: 0.363 ± 0.098
1.154HisAsp: 1.154 ± 0.208
0.626HisGlu: 0.626 ± 0.147
0.527HisPhe: 0.527 ± 0.114
1.714HisGly: 1.714 ± 0.268
0.56HisHis: 0.56 ± 0.239
0.758HisIle: 0.758 ± 0.147
0.857HisLys: 0.857 ± 0.161
1.978HisLeu: 1.978 ± 0.258
0.593HisMet: 0.593 ± 0.153
0.626HisAsn: 0.626 ± 0.205
1.615HisPro: 1.615 ± 0.282
0.692HisGln: 0.692 ± 0.188
1.318HisArg: 1.318 ± 0.189
1.285HisSer: 1.285 ± 0.244
1.285HisThr: 1.285 ± 0.241
2.241HisVal: 2.241 ± 0.36
0.231HisTrp: 0.231 ± 0.086
0.791HisTyr: 0.791 ± 0.179
0.0HisXaa: 0.0 ± 0.0
Ile
2.307IleAla: 2.307 ± 0.276
0.593IleCys: 0.593 ± 0.171
1.846IleAsp: 1.846 ± 0.267
1.516IleGlu: 1.516 ± 0.227
1.154IlePhe: 1.154 ± 0.175
1.681IleGly: 1.681 ± 0.232
0.89IleHis: 0.89 ± 0.182
0.956IleIle: 0.956 ± 0.161
2.208IleLys: 2.208 ± 0.204
3.362IleLeu: 3.362 ± 0.301
1.154IleMet: 1.154 ± 0.168
0.857IleAsn: 0.857 ± 0.189
2.109IlePro: 2.109 ± 0.263
0.956IleGln: 0.956 ± 0.23
2.769IleArg: 2.769 ± 0.28
2.208IleSer: 2.208 ± 0.252
1.45IleThr: 1.45 ± 0.295
2.67IleVal: 2.67 ± 0.362
0.198IleTrp: 0.198 ± 0.085
1.022IleTyr: 1.022 ± 0.207
0.0IleXaa: 0.0 ± 0.0
Lys
4.45LysAla: 4.45 ± 0.497
0.89LysCys: 0.89 ± 0.184
2.802LysAsp: 2.802 ± 0.312
2.999LysGlu: 2.999 ± 0.368
1.318LysPhe: 1.318 ± 0.182
4.186LysGly: 4.186 ± 0.389
0.626LysHis: 0.626 ± 0.133
2.34LysIle: 2.34 ± 0.269
3.988LysLys: 3.988 ± 0.73
3.955LysLeu: 3.955 ± 0.422
1.78LysMet: 1.78 ± 0.25
1.615LysAsn: 1.615 ± 0.187
3.692LysPro: 3.692 ± 0.615
1.351LysGln: 1.351 ± 0.265
5.208LysArg: 5.208 ± 0.838
3.56LysSer: 3.56 ± 0.783
3.823LysThr: 3.823 ± 0.384
3.329LysVal: 3.329 ± 0.28
0.593LysTrp: 0.593 ± 0.163
1.879LysTyr: 1.879 ± 0.252
0.0LysXaa: 0.0 ± 0.0
Leu
6.394LeuAla: 6.394 ± 0.51
1.978LeuCys: 1.978 ± 0.298
5.439LeuAsp: 5.439 ± 0.386
5.076LeuGlu: 5.076 ± 0.461
2.769LeuPhe: 2.769 ± 0.297
5.406LeuGly: 5.406 ± 0.558
1.648LeuHis: 1.648 ± 0.299
2.406LeuIle: 2.406 ± 0.307
4.812LeuLys: 4.812 ± 0.412
6.328LeuLeu: 6.328 ± 0.614
2.274LeuMet: 2.274 ± 0.289
2.472LeuAsn: 2.472 ± 0.273
4.483LeuPro: 4.483 ± 0.478
1.351LeuGln: 1.351 ± 0.227
6.493LeuArg: 6.493 ± 0.537
6.23LeuSer: 6.23 ± 0.476
5.208LeuThr: 5.208 ± 0.462
5.801LeuVal: 5.801 ± 0.46
1.088LeuTrp: 1.088 ± 0.211
2.142LeuTyr: 2.142 ± 0.214
0.0LeuXaa: 0.0 ± 0.0
Met
3.131MetAla: 3.131 ± 0.392
0.89MetCys: 0.89 ± 0.222
2.109MetAsp: 2.109 ± 0.241
1.912MetGlu: 1.912 ± 0.237
1.22MetPhe: 1.22 ± 0.189
2.637MetGly: 2.637 ± 0.293
0.89MetHis: 0.89 ± 0.183
0.692MetIle: 0.692 ± 0.147
0.758MetLys: 0.758 ± 0.16
2.011MetLeu: 2.011 ± 0.245
0.824MetMet: 0.824 ± 0.164
0.428MetAsn: 0.428 ± 0.137
1.384MetPro: 1.384 ± 0.205
0.725MetGln: 0.725 ± 0.142
2.011MetArg: 2.011 ± 0.235
2.999MetSer: 2.999 ± 0.365
2.274MetThr: 2.274 ± 0.242
2.34MetVal: 2.34 ± 0.303
0.461MetTrp: 0.461 ± 0.15
0.626MetTyr: 0.626 ± 0.188
0.0MetXaa: 0.0 ± 0.0
Asn
2.241AsnAla: 2.241 ± 0.313
0.527AsnCys: 0.527 ± 0.124
0.824AsnAsp: 0.824 ± 0.174
0.956AsnGlu: 0.956 ± 0.152
0.725AsnPhe: 0.725 ± 0.146
1.615AsnGly: 1.615 ± 0.209
0.428AsnHis: 0.428 ± 0.104
1.253AsnIle: 1.253 ± 0.274
1.022AsnLys: 1.022 ± 0.153
2.637AsnLeu: 2.637 ± 0.343
0.89AsnMet: 0.89 ± 0.192
0.659AsnAsn: 0.659 ± 0.187
2.208AsnPro: 2.208 ± 0.405
0.692AsnGln: 0.692 ± 0.129
1.681AsnArg: 1.681 ± 0.24
1.549AsnSer: 1.549 ± 0.222
1.253AsnThr: 1.253 ± 0.256
2.571AsnVal: 2.571 ± 0.278
0.428AsnTrp: 0.428 ± 0.114
0.824AsnTyr: 0.824 ± 0.186
0.0AsnXaa: 0.0 ± 0.0
Pro
7.416ProAla: 7.416 ± 1.388
1.121ProCys: 1.121 ± 0.231
3.791ProAsp: 3.791 ± 0.401
4.746ProGlu: 4.746 ± 0.573
2.109ProPhe: 2.109 ± 0.301
4.516ProGly: 4.516 ± 0.474
1.813ProHis: 1.813 ± 0.246
1.813ProIle: 1.813 ± 0.24
3.593ProLys: 3.593 ± 0.643
4.351ProLeu: 4.351 ± 0.611
1.253ProMet: 1.253 ± 0.2
1.318ProAsn: 1.318 ± 0.253
4.186ProPro: 4.186 ± 0.584
1.879ProGln: 1.879 ± 0.28
3.955ProArg: 3.955 ± 0.674
5.01ProSer: 5.01 ± 0.463
3.395ProThr: 3.395 ± 0.609
6.658ProVal: 6.658 ± 0.838
1.121ProTrp: 1.121 ± 0.251
1.516ProTyr: 1.516 ± 0.247
0.0ProXaa: 0.0 ± 0.0
Gln
2.34GlnAla: 2.34 ± 0.24
0.692GlnCys: 0.692 ± 0.15
1.714GlnAsp: 1.714 ± 0.297
2.044GlnGlu: 2.044 ± 0.435
0.692GlnPhe: 0.692 ± 0.142
1.846GlnGly: 1.846 ± 0.292
0.626GlnHis: 0.626 ± 0.196
1.055GlnIle: 1.055 ± 0.183
1.187GlnLys: 1.187 ± 0.173
1.846GlnLeu: 1.846 ± 0.256
0.725GlnMet: 0.725 ± 0.161
0.659GlnAsn: 0.659 ± 0.128
1.681GlnPro: 1.681 ± 0.376
3.032GlnGln: 3.032 ± 1.784
2.208GlnArg: 2.208 ± 0.687
2.175GlnSer: 2.175 ± 0.45
1.846GlnThr: 1.846 ± 0.309
2.077GlnVal: 2.077 ± 0.283
0.33GlnTrp: 0.33 ± 0.107
0.659GlnTyr: 0.659 ± 0.115
0.0GlnXaa: 0.0 ± 0.0
Arg
5.406ArgAla: 5.406 ± 0.439
1.088ArgCys: 1.088 ± 0.18
4.68ArgAsp: 4.68 ± 0.465
4.549ArgGlu: 4.549 ± 0.357
2.011ArgPhe: 2.011 ± 0.222
5.57ArgGly: 5.57 ± 0.51
1.78ArgHis: 1.78 ± 0.331
2.044ArgIle: 2.044 ± 0.276
4.219ArgLys: 4.219 ± 0.85
6.032ArgLeu: 6.032 ± 0.528
2.307ArgMet: 2.307 ± 0.319
2.077ArgAsn: 2.077 ± 0.347
4.944ArgPro: 4.944 ± 0.626
2.274ArgGln: 2.274 ± 0.311
6.361ArgArg: 6.361 ± 0.573
4.549ArgSer: 4.549 ± 0.735
3.856ArgThr: 3.856 ± 0.513
5.603ArgVal: 5.603 ± 0.421
0.923ArgTrp: 0.923 ± 0.19
2.109ArgTyr: 2.109 ± 0.264
0.0ArgXaa: 0.0 ± 0.0
Ser
6.427SerAla: 6.427 ± 0.635
1.549SerCys: 1.549 ± 0.223
5.274SerAsp: 5.274 ± 0.396
3.889SerGlu: 3.889 ± 0.402
2.999SerPhe: 2.999 ± 0.279
5.34SerGly: 5.34 ± 0.368
1.582SerHis: 1.582 ± 0.255
1.978SerIle: 1.978 ± 0.237
3.494SerLys: 3.494 ± 0.429
6.394SerLeu: 6.394 ± 0.594
2.077SerMet: 2.077 ± 0.289
1.648SerAsn: 1.648 ± 0.258
5.966SerPro: 5.966 ± 1.483
1.912SerGln: 1.912 ± 0.312
4.417SerArg: 4.417 ± 0.487
5.603SerSer: 5.603 ± 0.585
3.131SerThr: 3.131 ± 0.292
6.263SerVal: 6.263 ± 0.565
1.22SerTrp: 1.22 ± 0.21
1.747SerTyr: 1.747 ± 0.229
0.0SerXaa: 0.0 ± 0.0
Thr
5.933ThrAla: 5.933 ± 0.568
1.055ThrCys: 1.055 ± 0.185
3.494ThrAsp: 3.494 ± 0.354
2.472ThrGlu: 2.472 ± 0.286
2.208ThrPhe: 2.208 ± 0.251
5.01ThrGly: 5.01 ± 0.456
0.692ThrHis: 0.692 ± 0.117
1.945ThrIle: 1.945 ± 0.221
2.637ThrLys: 2.637 ± 0.3
4.713ThrLeu: 4.713 ± 0.477
1.879ThrMet: 1.879 ± 0.221
1.154ThrAsn: 1.154 ± 0.221
4.087ThrPro: 4.087 ± 0.6
1.714ThrGln: 1.714 ± 0.35
3.494ThrArg: 3.494 ± 0.334
2.999ThrSer: 2.999 ± 0.361
2.571ThrThr: 2.571 ± 0.869
6.296ThrVal: 6.296 ± 0.511
0.396ThrTrp: 0.396 ± 0.165
1.318ThrTyr: 1.318 ± 0.277
0.0ThrXaa: 0.0 ± 0.0
Val
6.098ValAla: 6.098 ± 0.495
1.846ValCys: 1.846 ± 0.283
4.845ValAsp: 4.845 ± 0.401
4.252ValGlu: 4.252 ± 0.467
2.835ValPhe: 2.835 ± 0.305
5.01ValGly: 5.01 ± 0.596
2.109ValHis: 2.109 ± 0.292
2.373ValIle: 2.373 ± 0.286
6.296ValLys: 6.296 ± 0.705
6.889ValLeu: 6.889 ± 0.597
2.538ValMet: 2.538 ± 0.336
2.538ValAsn: 2.538 ± 0.268
5.076ValPro: 5.076 ± 0.548
2.175ValGln: 2.175 ± 0.262
7.021ValArg: 7.021 ± 0.696
6.625ValSer: 6.625 ± 0.509
5.043ValThr: 5.043 ± 0.591
6.988ValVal: 6.988 ± 0.675
1.187ValTrp: 1.187 ± 0.196
2.439ValTyr: 2.439 ± 0.312
0.0ValXaa: 0.0 ± 0.0
Trp
0.923TrpAla: 0.923 ± 0.238
0.297TrpCys: 0.297 ± 0.086
1.154TrpAsp: 1.154 ± 0.182
0.758TrpGlu: 0.758 ± 0.176
0.527TrpPhe: 0.527 ± 0.141
0.956TrpGly: 0.956 ± 0.19
0.297TrpHis: 0.297 ± 0.115
0.461TrpIle: 0.461 ± 0.147
0.89TrpLys: 0.89 ± 0.164
1.351TrpLeu: 1.351 ± 0.191
0.396TrpMet: 0.396 ± 0.111
0.527TrpAsn: 0.527 ± 0.148
0.692TrpPro: 0.692 ± 0.137
0.264TrpGln: 0.264 ± 0.091
0.923TrpArg: 0.923 ± 0.181
0.725TrpSer: 0.725 ± 0.149
1.483TrpThr: 1.483 ± 0.26
0.857TrpVal: 0.857 ± 0.217
0.231TrpTrp: 0.231 ± 0.085
0.461TrpTyr: 0.461 ± 0.131
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.34TyrAla: 2.34 ± 0.318
0.626TyrCys: 0.626 ± 0.153
2.241TyrAsp: 2.241 ± 0.258
1.516TyrGlu: 1.516 ± 0.256
0.758TyrPhe: 0.758 ± 0.153
2.373TyrGly: 2.373 ± 0.255
0.363TyrHis: 0.363 ± 0.101
1.318TyrIle: 1.318 ± 0.186
1.549TyrLys: 1.549 ± 0.215
2.175TyrLeu: 2.175 ± 0.239
0.956TyrMet: 0.956 ± 0.217
0.725TyrAsn: 0.725 ± 0.156
1.912TyrPro: 1.912 ± 0.251
0.923TyrGln: 0.923 ± 0.164
1.714TyrArg: 1.714 ± 0.267
2.439TyrSer: 2.439 ± 0.254
1.714TyrThr: 1.714 ± 0.225
2.802TyrVal: 2.802 ± 0.325
0.231TyrTrp: 0.231 ± 0.087
0.791TyrTyr: 0.791 ± 0.184
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 104 proteins (30340 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski