Amino acid dipepetide frequency for Tai Forest reovirus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.987AlaAla: 4.987 ± 1.41
0.943AlaCys: 0.943 ± 0.265
3.369AlaAsp: 3.369 ± 0.582
4.178AlaGlu: 4.178 ± 0.936
2.695AlaPhe: 2.695 ± 0.493
4.987AlaGly: 4.987 ± 0.898
1.482AlaHis: 1.482 ± 0.313
3.369AlaIle: 3.369 ± 0.596
3.774AlaLys: 3.774 ± 0.922
9.03AlaLeu: 9.03 ± 0.432
2.426AlaMet: 2.426 ± 0.549
2.426AlaAsn: 2.426 ± 0.514
2.83AlaPro: 2.83 ± 0.797
2.426AlaGln: 2.426 ± 0.507
6.334AlaArg: 6.334 ± 0.759
3.235AlaSer: 3.235 ± 0.617
2.022AlaThr: 2.022 ± 0.55
5.526AlaVal: 5.526 ± 0.594
1.078AlaTrp: 1.078 ± 0.342
2.156AlaTyr: 2.156 ± 0.46
0.0AlaXaa: 0.0 ± 0.0
Cys
0.809CysAla: 0.809 ± 0.323
0.135CysCys: 0.135 ± 0.151
1.213CysAsp: 1.213 ± 0.728
0.809CysGlu: 0.809 ± 0.271
1.213CysPhe: 1.213 ± 0.278
1.482CysGly: 1.482 ± 0.55
0.404CysHis: 0.404 ± 0.162
0.404CysIle: 0.404 ± 0.281
0.674CysLys: 0.674 ± 0.34
0.539CysLeu: 0.539 ± 0.297
0.135CysMet: 0.135 ± 0.167
0.674CysAsn: 0.674 ± 0.268
0.674CysPro: 0.674 ± 0.196
0.27CysGln: 0.27 ± 0.15
0.943CysArg: 0.943 ± 0.308
1.348CysSer: 1.348 ± 0.243
0.539CysThr: 0.539 ± 0.244
1.617CysVal: 1.617 ± 0.562
0.135CysTrp: 0.135 ± 0.112
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
5.391AspAla: 5.391 ± 1.162
0.943AspCys: 0.943 ± 0.307
5.256AspAsp: 5.256 ± 0.945
5.256AspGlu: 5.256 ± 1.166
2.291AspPhe: 2.291 ± 0.526
4.447AspGly: 4.447 ± 0.562
1.078AspHis: 1.078 ± 0.388
2.965AspIle: 2.965 ± 0.652
2.022AspLys: 2.022 ± 0.559
5.93AspLeu: 5.93 ± 1.038
2.022AspMet: 2.022 ± 0.343
1.887AspAsn: 1.887 ± 0.707
3.908AspPro: 3.908 ± 0.702
2.022AspGln: 2.022 ± 0.437
4.313AspArg: 4.313 ± 0.78
2.695AspSer: 2.695 ± 0.768
0.674AspThr: 0.674 ± 0.311
6.469AspVal: 6.469 ± 1.309
1.482AspTrp: 1.482 ± 0.448
1.752AspTyr: 1.752 ± 0.381
0.0AspXaa: 0.0 ± 0.0
Glu
5.526GluAla: 5.526 ± 0.747
1.078GluCys: 1.078 ± 0.422
3.774GluAsp: 3.774 ± 0.537
2.561GluGlu: 2.561 ± 0.553
3.235GluPhe: 3.235 ± 1.06
5.93GluGly: 5.93 ± 0.902
0.943GluHis: 0.943 ± 0.403
3.369GluIle: 3.369 ± 0.4
4.852GluLys: 4.852 ± 1.17
5.256GluLeu: 5.256 ± 0.79
2.695GluMet: 2.695 ± 0.47
3.504GluAsn: 3.504 ± 0.763
2.291GluPro: 2.291 ± 0.303
1.482GluGln: 1.482 ± 0.307
6.604GluArg: 6.604 ± 0.952
3.369GluSer: 3.369 ± 0.359
3.235GluThr: 3.235 ± 0.672
5.93GluVal: 5.93 ± 0.83
1.348GluTrp: 1.348 ± 0.319
2.291GluTyr: 2.291 ± 0.593
0.0GluXaa: 0.0 ± 0.0
Phe
2.022PheAla: 2.022 ± 0.524
1.078PheCys: 1.078 ± 0.277
3.235PheAsp: 3.235 ± 0.436
3.235PheGlu: 3.235 ± 0.733
1.887PhePhe: 1.887 ± 0.343
3.639PheGly: 3.639 ± 0.83
1.482PheHis: 1.482 ± 0.29
2.561PheIle: 2.561 ± 0.79
1.752PheLys: 1.752 ± 0.37
4.043PheLeu: 4.043 ± 0.556
1.348PheMet: 1.348 ± 0.268
1.887PheAsn: 1.887 ± 0.346
2.022PhePro: 2.022 ± 0.458
1.752PheGln: 1.752 ± 0.283
2.561PheArg: 2.561 ± 0.336
3.639PheSer: 3.639 ± 0.795
0.943PheThr: 0.943 ± 0.401
2.965PheVal: 2.965 ± 0.633
0.135PheTrp: 0.135 ± 0.131
0.809PheTyr: 0.809 ± 0.343
0.0PheXaa: 0.0 ± 0.0
Gly
6.199GlyAla: 6.199 ± 0.564
1.617GlyCys: 1.617 ± 0.529
4.987GlyAsp: 4.987 ± 0.721
4.447GlyGlu: 4.447 ± 1.11
2.83GlyPhe: 2.83 ± 0.384
5.391GlyGly: 5.391 ± 0.541
0.539GlyHis: 0.539 ± 0.252
3.369GlyIle: 3.369 ± 0.598
3.369GlyLys: 3.369 ± 0.642
7.143GlyLeu: 7.143 ± 0.659
3.639GlyMet: 3.639 ± 0.83
1.482GlyAsn: 1.482 ± 0.473
2.561GlyPro: 2.561 ± 0.722
2.022GlyGln: 2.022 ± 0.385
6.739GlyArg: 6.739 ± 0.487
3.504GlySer: 3.504 ± 0.734
4.043GlyThr: 4.043 ± 0.683
6.739GlyVal: 6.739 ± 0.797
2.561GlyTrp: 2.561 ± 0.423
2.156GlyTyr: 2.156 ± 0.64
0.0GlyXaa: 0.0 ± 0.0
His
1.617HisAla: 1.617 ± 0.299
0.135HisCys: 0.135 ± 0.131
1.213HisAsp: 1.213 ± 0.573
1.752HisGlu: 1.752 ± 0.577
0.809HisPhe: 0.809 ± 0.29
2.022HisGly: 2.022 ± 0.692
0.404HisHis: 0.404 ± 0.223
1.078HisIle: 1.078 ± 0.461
0.27HisLys: 0.27 ± 0.267
1.482HisLeu: 1.482 ± 0.334
0.674HisMet: 0.674 ± 0.246
0.674HisAsn: 0.674 ± 0.268
1.213HisPro: 1.213 ± 0.42
0.943HisGln: 0.943 ± 0.237
1.887HisArg: 1.887 ± 0.418
0.943HisSer: 0.943 ± 0.421
1.213HisThr: 1.213 ± 0.256
2.022HisVal: 2.022 ± 0.423
0.539HisTrp: 0.539 ± 0.188
1.213HisTyr: 1.213 ± 0.474
0.0HisXaa: 0.0 ± 0.0
Ile
3.639IleAla: 3.639 ± 0.921
1.078IleCys: 1.078 ± 0.394
3.504IleAsp: 3.504 ± 0.654
2.83IleGlu: 2.83 ± 0.459
2.426IlePhe: 2.426 ± 0.46
3.774IleGly: 3.774 ± 0.664
1.617IleHis: 1.617 ± 0.482
2.695IleIle: 2.695 ± 0.506
2.291IleLys: 2.291 ± 0.495
4.717IleLeu: 4.717 ± 0.399
1.617IleMet: 1.617 ± 0.362
1.482IleAsn: 1.482 ± 0.548
4.717IlePro: 4.717 ± 1.124
2.426IleGln: 2.426 ± 0.635
3.504IleArg: 3.504 ± 0.749
2.965IleSer: 2.965 ± 0.619
2.695IleThr: 2.695 ± 0.402
3.369IleVal: 3.369 ± 0.938
0.27IleTrp: 0.27 ± 0.15
2.022IleTyr: 2.022 ± 0.554
0.0IleXaa: 0.0 ± 0.0
Lys
2.426LysAla: 2.426 ± 0.813
0.404LysCys: 0.404 ± 0.152
2.561LysAsp: 2.561 ± 0.588
4.447LysGlu: 4.447 ± 0.677
2.561LysPhe: 2.561 ± 0.57
2.561LysGly: 2.561 ± 0.785
1.348LysHis: 1.348 ± 0.353
3.235LysIle: 3.235 ± 0.628
2.291LysLys: 2.291 ± 0.459
4.447LysLeu: 4.447 ± 0.882
1.617LysMet: 1.617 ± 0.594
1.348LysAsn: 1.348 ± 0.4
1.617LysPro: 1.617 ± 0.252
1.752LysGln: 1.752 ± 0.453
2.965LysArg: 2.965 ± 0.674
2.561LysSer: 2.561 ± 0.611
2.291LysThr: 2.291 ± 0.638
3.908LysVal: 3.908 ± 0.472
0.404LysTrp: 0.404 ± 0.291
1.348LysTyr: 1.348 ± 0.584
0.0LysXaa: 0.0 ± 0.0
Leu
5.391LeuAla: 5.391 ± 1.143
1.213LeuCys: 1.213 ± 0.239
4.852LeuAsp: 4.852 ± 0.576
7.278LeuGlu: 7.278 ± 0.971
3.908LeuPhe: 3.908 ± 0.763
7.008LeuGly: 7.008 ± 1.192
2.426LeuHis: 2.426 ± 0.747
4.313LeuIle: 4.313 ± 0.728
4.447LeuLys: 4.447 ± 0.592
7.547LeuLeu: 7.547 ± 0.982
3.639LeuMet: 3.639 ± 0.638
3.369LeuAsn: 3.369 ± 0.522
5.66LeuPro: 5.66 ± 0.724
3.235LeuGln: 3.235 ± 0.34
8.491LeuArg: 8.491 ± 1.042
7.143LeuSer: 7.143 ± 0.663
3.235LeuThr: 3.235 ± 0.527
4.178LeuVal: 4.178 ± 0.35
0.943LeuTrp: 0.943 ± 0.31
2.291LeuTyr: 2.291 ± 0.498
0.135LeuXaa: 0.135 ± 0.106
Met
2.83MetAla: 2.83 ± 0.608
0.539MetCys: 0.539 ± 0.283
2.561MetAsp: 2.561 ± 0.579
2.156MetGlu: 2.156 ± 0.406
1.617MetPhe: 1.617 ± 0.271
1.887MetGly: 1.887 ± 0.414
0.809MetHis: 0.809 ± 0.211
2.156MetIle: 2.156 ± 0.713
1.482MetLys: 1.482 ± 0.577
3.504MetLeu: 3.504 ± 0.478
2.156MetMet: 2.156 ± 0.318
1.752MetAsn: 1.752 ± 0.49
1.617MetPro: 1.617 ± 0.457
0.539MetGln: 0.539 ± 0.255
2.426MetArg: 2.426 ± 0.479
1.617MetSer: 1.617 ± 0.355
1.887MetThr: 1.887 ± 0.358
2.965MetVal: 2.965 ± 0.638
0.539MetTrp: 0.539 ± 0.141
1.078MetTyr: 1.078 ± 0.283
0.0MetXaa: 0.0 ± 0.0
Asn
1.887AsnAla: 1.887 ± 0.611
0.539AsnCys: 0.539 ± 0.278
2.426AsnAsp: 2.426 ± 0.96
2.426AsnGlu: 2.426 ± 0.303
1.348AsnPhe: 1.348 ± 0.278
2.83AsnGly: 2.83 ± 0.643
1.213AsnHis: 1.213 ± 0.472
2.022AsnIle: 2.022 ± 0.371
0.404AsnLys: 0.404 ± 0.152
2.695AsnLeu: 2.695 ± 0.778
1.617AsnMet: 1.617 ± 0.496
0.809AsnAsn: 0.809 ± 0.314
1.617AsnPro: 1.617 ± 0.403
1.617AsnGln: 1.617 ± 0.434
2.561AsnArg: 2.561 ± 0.431
1.887AsnSer: 1.887 ± 0.493
1.482AsnThr: 1.482 ± 0.329
3.908AsnVal: 3.908 ± 0.644
0.943AsnTrp: 0.943 ± 0.282
1.617AsnTyr: 1.617 ± 0.418
0.0AsnXaa: 0.0 ± 0.0
Pro
3.639ProAla: 3.639 ± 0.586
0.135ProCys: 0.135 ± 0.133
2.83ProAsp: 2.83 ± 0.62
4.043ProGlu: 4.043 ± 0.689
2.022ProPhe: 2.022 ± 0.565
2.695ProGly: 2.695 ± 0.436
1.078ProHis: 1.078 ± 0.428
2.022ProIle: 2.022 ± 0.625
2.156ProLys: 2.156 ± 0.415
5.121ProLeu: 5.121 ± 0.768
1.213ProMet: 1.213 ± 0.336
1.752ProAsn: 1.752 ± 0.474
2.561ProPro: 2.561 ± 0.656
1.213ProGln: 1.213 ± 0.387
2.156ProArg: 2.156 ± 0.47
3.774ProSer: 3.774 ± 0.611
2.561ProThr: 2.561 ± 0.417
3.908ProVal: 3.908 ± 0.603
0.404ProTrp: 0.404 ± 0.194
2.426ProTyr: 2.426 ± 0.459
0.0ProXaa: 0.0 ± 0.0
Gln
3.369GlnAla: 3.369 ± 0.639
0.0GlnCys: 0.0 ± 0.0
1.213GlnAsp: 1.213 ± 0.413
2.426GlnGlu: 2.426 ± 0.517
1.078GlnPhe: 1.078 ± 0.33
1.752GlnGly: 1.752 ± 0.545
1.078GlnHis: 1.078 ± 0.317
2.022GlnIle: 2.022 ± 0.607
2.426GlnLys: 2.426 ± 0.425
2.426GlnLeu: 2.426 ± 0.525
1.348GlnMet: 1.348 ± 0.232
1.617GlnAsn: 1.617 ± 0.537
0.674GlnPro: 0.674 ± 0.351
1.078GlnGln: 1.078 ± 0.278
1.482GlnArg: 1.482 ± 0.276
0.943GlnSer: 0.943 ± 0.261
1.078GlnThr: 1.078 ± 0.323
3.1GlnVal: 3.1 ± 0.701
0.674GlnTrp: 0.674 ± 0.314
0.809GlnTyr: 0.809 ± 0.264
0.0GlnXaa: 0.0 ± 0.0
Arg
4.582ArgAla: 4.582 ± 0.685
0.674ArgCys: 0.674 ± 0.25
4.043ArgAsp: 4.043 ± 0.903
6.334ArgGlu: 6.334 ± 0.421
2.156ArgPhe: 2.156 ± 0.402
6.604ArgGly: 6.604 ± 0.855
1.887ArgHis: 1.887 ± 0.312
4.447ArgIle: 4.447 ± 0.751
3.639ArgLys: 3.639 ± 0.579
7.278ArgLeu: 7.278 ± 0.983
2.965ArgMet: 2.965 ± 0.397
3.1ArgAsn: 3.1 ± 0.577
3.369ArgPro: 3.369 ± 0.806
1.482ArgGln: 1.482 ± 0.378
6.334ArgArg: 6.334 ± 1.073
2.695ArgSer: 2.695 ± 0.562
3.369ArgThr: 3.369 ± 0.736
4.582ArgVal: 4.582 ± 0.665
2.291ArgTrp: 2.291 ± 0.324
3.1ArgTyr: 3.1 ± 0.58
0.0ArgXaa: 0.0 ± 0.0
Ser
4.313SerAla: 4.313 ± 0.94
0.943SerCys: 0.943 ± 0.229
3.369SerAsp: 3.369 ± 0.594
4.717SerGlu: 4.717 ± 0.82
2.83SerPhe: 2.83 ± 0.669
4.447SerGly: 4.447 ± 0.486
0.809SerHis: 0.809 ± 0.272
2.156SerIle: 2.156 ± 0.561
2.965SerLys: 2.965 ± 0.661
4.987SerLeu: 4.987 ± 0.377
1.617SerMet: 1.617 ± 0.463
2.156SerAsn: 2.156 ± 0.217
1.617SerPro: 1.617 ± 0.444
1.213SerGln: 1.213 ± 0.541
3.235SerArg: 3.235 ± 0.366
3.1SerSer: 3.1 ± 0.663
4.043SerThr: 4.043 ± 0.4
3.639SerVal: 3.639 ± 0.493
0.809SerTrp: 0.809 ± 0.304
1.887SerTyr: 1.887 ± 0.431
0.27SerXaa: 0.27 ± 0.119
Thr
2.965ThrAla: 2.965 ± 0.422
0.404ThrCys: 0.404 ± 0.218
3.235ThrAsp: 3.235 ± 0.763
2.291ThrGlu: 2.291 ± 0.495
2.695ThrPhe: 2.695 ± 0.528
3.639ThrGly: 3.639 ± 0.714
0.674ThrHis: 0.674 ± 0.216
2.83ThrIle: 2.83 ± 0.601
2.022ThrLys: 2.022 ± 0.652
3.369ThrLeu: 3.369 ± 0.738
1.617ThrMet: 1.617 ± 0.442
1.213ThrAsn: 1.213 ± 0.488
1.887ThrPro: 1.887 ± 0.518
0.809ThrGln: 0.809 ± 0.233
2.156ThrArg: 2.156 ± 0.684
3.1ThrSer: 3.1 ± 0.763
2.156ThrThr: 2.156 ± 0.551
4.043ThrVal: 4.043 ± 0.63
0.943ThrTrp: 0.943 ± 0.285
2.156ThrTyr: 2.156 ± 0.359
0.0ThrXaa: 0.0 ± 0.0
Val
5.121ValAla: 5.121 ± 0.846
1.617ValCys: 1.617 ± 0.557
5.391ValAsp: 5.391 ± 0.654
4.178ValGlu: 4.178 ± 0.645
3.774ValPhe: 3.774 ± 0.544
5.66ValGly: 5.66 ± 0.965
1.482ValHis: 1.482 ± 0.447
4.178ValIle: 4.178 ± 0.43
3.369ValLys: 3.369 ± 0.718
6.334ValLeu: 6.334 ± 0.893
2.156ValMet: 2.156 ± 0.431
3.1ValAsn: 3.1 ± 0.94
4.717ValPro: 4.717 ± 0.943
2.291ValGln: 2.291 ± 0.561
6.469ValArg: 6.469 ± 0.604
4.987ValSer: 4.987 ± 0.468
3.504ValThr: 3.504 ± 0.503
5.526ValVal: 5.526 ± 0.925
1.078ValTrp: 1.078 ± 0.246
2.83ValTyr: 2.83 ± 0.421
0.0ValXaa: 0.0 ± 0.0
Trp
0.943TrpAla: 0.943 ± 0.287
0.404TrpCys: 0.404 ± 0.225
0.809TrpAsp: 0.809 ± 0.323
1.348TrpGlu: 1.348 ± 0.51
0.539TrpPhe: 0.539 ± 0.224
0.674TrpGly: 0.674 ± 0.213
0.135TrpHis: 0.135 ± 0.112
1.617TrpIle: 1.617 ± 0.52
0.539TrpLys: 0.539 ± 0.262
2.426TrpLeu: 2.426 ± 0.474
0.539TrpMet: 0.539 ± 0.181
0.404TrpAsn: 0.404 ± 0.142
0.27TrpPro: 0.27 ± 0.267
0.674TrpGln: 0.674 ± 0.242
1.887TrpArg: 1.887 ± 0.512
0.539TrpSer: 0.539 ± 0.259
0.943TrpThr: 0.943 ± 0.226
1.348TrpVal: 1.348 ± 0.323
0.404TrpTrp: 0.404 ± 0.171
0.674TrpTyr: 0.674 ± 0.274
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.617TyrAla: 1.617 ± 0.331
0.27TyrCys: 0.27 ± 0.173
2.965TyrAsp: 2.965 ± 0.551
2.561TyrGlu: 2.561 ± 0.312
1.078TyrPhe: 1.078 ± 0.354
3.908TyrGly: 3.908 ± 0.572
1.213TyrHis: 1.213 ± 0.364
2.83TyrIle: 2.83 ± 0.482
1.213TyrLys: 1.213 ± 0.355
2.426TyrLeu: 2.426 ± 0.51
0.943TyrMet: 0.943 ± 0.486
1.213TyrAsn: 1.213 ± 0.328
1.887TyrPro: 1.887 ± 0.435
1.348TyrGln: 1.348 ± 0.389
1.887TyrArg: 1.887 ± 0.513
1.078TyrSer: 1.078 ± 0.415
2.156TyrThr: 2.156 ± 0.458
2.022TyrVal: 2.022 ± 0.731
0.0TyrTrp: 0.0 ± 0.0
1.348TyrTyr: 1.348 ± 0.275
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.135XaaLys: 0.135 ± 0.106
0.0XaaLeu: 0.0 ± 0.0
0.135XaaMet: 0.135 ± 0.101
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.135XaaTrp: 0.135 ± 0.106
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 10 proteins (7421 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski