Amino acid dipepetide frequency for Cangyuan orthoreovirus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.722AlaAla: 8.722 ± 1.595
1.09AlaCys: 1.09 ± 0.258
5.042AlaAsp: 5.042 ± 0.736
2.453AlaGlu: 2.453 ± 0.551
2.453AlaPhe: 2.453 ± 0.617
3.271AlaGly: 3.271 ± 0.543
1.363AlaHis: 1.363 ± 0.52
4.361AlaIle: 4.361 ± 0.602
2.998AlaLys: 2.998 ± 0.568
7.495AlaLeu: 7.495 ± 1.043
2.589AlaMet: 2.589 ± 0.609
2.862AlaAsn: 2.862 ± 0.601
5.179AlaPro: 5.179 ± 0.857
3.407AlaGln: 3.407 ± 0.774
4.77AlaArg: 4.77 ± 0.88
8.858AlaSer: 8.858 ± 0.772
5.315AlaThr: 5.315 ± 0.842
5.179AlaVal: 5.179 ± 0.611
1.772AlaTrp: 1.772 ± 0.518
1.772AlaTyr: 1.772 ± 0.319
0.0AlaXaa: 0.0 ± 0.0
Cys
1.09CysAla: 1.09 ± 0.238
0.273CysCys: 0.273 ± 0.3
0.954CysAsp: 0.954 ± 0.304
0.136CysGlu: 0.136 ± 0.112
0.545CysPhe: 0.545 ± 0.198
0.818CysGly: 0.818 ± 0.309
0.409CysHis: 0.409 ± 0.365
0.273CysIle: 0.273 ± 0.171
0.136CysLys: 0.136 ± 0.123
2.317CysLeu: 2.317 ± 0.532
0.273CysMet: 0.273 ± 0.129
0.681CysAsn: 0.681 ± 0.483
0.545CysPro: 0.545 ± 0.222
0.409CysGln: 0.409 ± 0.141
0.681CysArg: 0.681 ± 0.168
1.772CysSer: 1.772 ± 0.341
0.954CysThr: 0.954 ± 0.32
0.954CysVal: 0.954 ± 0.175
0.273CysTrp: 0.273 ± 0.171
0.545CysTyr: 0.545 ± 0.296
0.0CysXaa: 0.0 ± 0.0
Asp
5.587AspAla: 5.587 ± 0.893
0.681AspCys: 0.681 ± 0.213
4.77AspAsp: 4.77 ± 0.904
1.908AspGlu: 1.908 ± 0.699
3.271AspPhe: 3.271 ± 0.407
2.726AspGly: 2.726 ± 0.582
1.635AspHis: 1.635 ± 0.498
2.044AspIle: 2.044 ± 0.538
1.908AspLys: 1.908 ± 0.51
5.451AspLeu: 5.451 ± 0.704
0.954AspMet: 0.954 ± 0.337
2.317AspAsn: 2.317 ± 0.436
3.134AspPro: 3.134 ± 0.736
1.635AspGln: 1.635 ± 0.629
2.862AspArg: 2.862 ± 0.626
4.77AspSer: 4.77 ± 0.878
2.317AspThr: 2.317 ± 0.379
6.405AspVal: 6.405 ± 0.933
1.772AspTrp: 1.772 ± 0.771
2.18AspTyr: 2.18 ± 1.195
0.0AspXaa: 0.0 ± 0.0
Glu
1.772GluAla: 1.772 ± 0.467
1.09GluCys: 1.09 ± 0.298
2.044GluAsp: 2.044 ± 0.607
1.226GluGlu: 1.226 ± 0.729
1.09GluPhe: 1.09 ± 0.318
1.09GluGly: 1.09 ± 0.463
1.226GluHis: 1.226 ± 0.618
1.363GluIle: 1.363 ± 0.453
1.772GluLys: 1.772 ± 0.626
4.088GluLeu: 4.088 ± 0.949
1.09GluMet: 1.09 ± 0.534
1.09GluAsn: 1.09 ± 0.404
0.954GluPro: 0.954 ± 0.327
0.681GluGln: 0.681 ± 0.295
2.18GluArg: 2.18 ± 0.581
3.407GluSer: 3.407 ± 0.588
3.407GluThr: 3.407 ± 1.055
2.044GluVal: 2.044 ± 0.624
0.681GluTrp: 0.681 ± 0.147
1.908GluTyr: 1.908 ± 0.438
0.0GluXaa: 0.0 ± 0.0
Phe
2.18PheAla: 2.18 ± 0.553
0.818PheCys: 0.818 ± 0.33
2.317PheAsp: 2.317 ± 0.595
1.908PheGlu: 1.908 ± 0.606
1.226PhePhe: 1.226 ± 0.662
2.862PheGly: 2.862 ± 0.561
0.409PheHis: 0.409 ± 0.185
1.908PheIle: 1.908 ± 0.497
1.499PheLys: 1.499 ± 0.53
3.816PheLeu: 3.816 ± 0.631
0.954PheMet: 0.954 ± 0.262
2.044PheAsn: 2.044 ± 0.613
3.407PhePro: 3.407 ± 0.686
1.772PheGln: 1.772 ± 0.51
1.635PheArg: 1.635 ± 0.361
4.088PheSer: 4.088 ± 0.745
3.271PheThr: 3.271 ± 0.799
2.589PheVal: 2.589 ± 0.47
0.545PheTrp: 0.545 ± 0.291
0.681PheTyr: 0.681 ± 0.249
0.0PheXaa: 0.0 ± 0.0
Gly
3.543GlyAla: 3.543 ± 0.332
0.409GlyCys: 0.409 ± 0.228
2.862GlyAsp: 2.862 ± 0.569
0.681GlyGlu: 0.681 ± 0.319
1.772GlyPhe: 1.772 ± 0.356
1.635GlyGly: 1.635 ± 0.323
0.818GlyHis: 0.818 ± 0.361
2.18GlyIle: 2.18 ± 0.469
1.908GlyLys: 1.908 ± 0.422
5.451GlyLeu: 5.451 ± 0.82
1.635GlyMet: 1.635 ± 0.452
1.772GlyAsn: 1.772 ± 0.479
1.908GlyPro: 1.908 ± 0.426
1.499GlyGln: 1.499 ± 0.522
2.317GlyArg: 2.317 ± 0.493
5.451GlySer: 5.451 ± 1.32
2.453GlyThr: 2.453 ± 0.803
5.042GlyVal: 5.042 ± 0.661
0.818GlyTrp: 0.818 ± 0.391
1.772GlyTyr: 1.772 ± 0.501
0.0GlyXaa: 0.0 ± 0.0
His
0.818HisAla: 0.818 ± 0.19
0.0HisCys: 0.0 ± 0.0
0.954HisAsp: 0.954 ± 0.459
0.545HisGlu: 0.545 ± 0.233
0.818HisPhe: 0.818 ± 0.263
1.635HisGly: 1.635 ± 0.373
0.818HisHis: 0.818 ± 0.368
0.409HisIle: 0.409 ± 0.292
0.545HisLys: 0.545 ± 0.168
1.499HisLeu: 1.499 ± 0.397
0.409HisMet: 0.409 ± 0.172
0.136HisAsn: 0.136 ± 0.112
1.226HisPro: 1.226 ± 0.395
1.226HisGln: 1.226 ± 0.686
0.954HisArg: 0.954 ± 0.435
2.589HisSer: 2.589 ± 0.648
0.818HisThr: 0.818 ± 0.422
2.726HisVal: 2.726 ± 0.762
0.273HisTrp: 0.273 ± 0.15
0.681HisTyr: 0.681 ± 0.249
0.0HisXaa: 0.0 ± 0.0
Ile
4.361IleAla: 4.361 ± 0.829
0.818IleCys: 0.818 ± 0.254
3.543IleAsp: 3.543 ± 0.411
1.09IleGlu: 1.09 ± 0.254
1.772IlePhe: 1.772 ± 0.527
1.908IleGly: 1.908 ± 0.509
1.226IleHis: 1.226 ± 0.321
1.772IleIle: 1.772 ± 0.568
1.09IleLys: 1.09 ± 0.274
3.271IleLeu: 3.271 ± 0.723
0.681IleMet: 0.681 ± 0.299
1.772IleAsn: 1.772 ± 0.406
2.18IlePro: 2.18 ± 0.772
2.18IleGln: 2.18 ± 0.596
4.088IleArg: 4.088 ± 0.669
4.088IleSer: 4.088 ± 0.604
3.407IleThr: 3.407 ± 0.668
3.543IleVal: 3.543 ± 0.586
0.273IleTrp: 0.273 ± 0.173
0.954IleTyr: 0.954 ± 0.441
0.0IleXaa: 0.0 ± 0.0
Lys
2.453LysAla: 2.453 ± 0.403
0.545LysCys: 0.545 ± 0.227
1.908LysAsp: 1.908 ± 0.431
1.772LysGlu: 1.772 ± 0.653
1.499LysPhe: 1.499 ± 0.461
2.044LysGly: 2.044 ± 0.583
0.818LysHis: 0.818 ± 0.222
1.09LysIle: 1.09 ± 0.349
0.681LysLys: 0.681 ± 0.283
3.952LysLeu: 3.952 ± 0.747
0.954LysMet: 0.954 ± 0.33
1.09LysAsn: 1.09 ± 0.379
2.044LysPro: 2.044 ± 0.591
1.226LysGln: 1.226 ± 0.376
0.818LysArg: 0.818 ± 0.345
2.453LysSer: 2.453 ± 0.719
3.134LysThr: 3.134 ± 0.553
2.589LysVal: 2.589 ± 0.477
0.545LysTrp: 0.545 ± 0.239
1.499LysTyr: 1.499 ± 0.535
0.0LysXaa: 0.0 ± 0.0
Leu
9.267LeuAla: 9.267 ± 1.134
1.226LeuCys: 1.226 ± 0.324
5.315LeuAsp: 5.315 ± 0.857
3.543LeuGlu: 3.543 ± 0.609
4.088LeuPhe: 4.088 ± 0.41
3.271LeuGly: 3.271 ± 0.605
0.954LeuHis: 0.954 ± 0.317
3.271LeuIle: 3.271 ± 0.469
3.816LeuLys: 3.816 ± 0.569
12.81LeuLeu: 12.81 ± 1.422
2.453LeuMet: 2.453 ± 0.381
5.724LeuAsn: 5.724 ± 0.659
6.405LeuPro: 6.405 ± 0.914
3.679LeuGln: 3.679 ± 0.584
6.541LeuArg: 6.541 ± 0.774
14.445LeuSer: 14.445 ± 1.229
9.403LeuThr: 9.403 ± 1.255
6.269LeuVal: 6.269 ± 1.007
1.09LeuTrp: 1.09 ± 0.314
1.772LeuTyr: 1.772 ± 0.482
0.0LeuXaa: 0.0 ± 0.0
Met
2.18MetAla: 2.18 ± 0.527
0.273MetCys: 0.273 ± 0.191
1.499MetAsp: 1.499 ± 0.547
0.818MetGlu: 0.818 ± 0.316
1.226MetPhe: 1.226 ± 0.287
1.09MetGly: 1.09 ± 0.333
0.409MetHis: 0.409 ± 0.191
0.954MetIle: 0.954 ± 0.379
0.818MetLys: 0.818 ± 0.313
2.317MetLeu: 2.317 ± 0.602
1.908MetMet: 1.908 ± 0.556
1.635MetAsn: 1.635 ± 0.429
1.499MetPro: 1.499 ± 0.385
1.363MetGln: 1.363 ± 0.363
1.226MetArg: 1.226 ± 0.372
2.726MetSer: 2.726 ± 0.604
2.317MetThr: 2.317 ± 0.611
1.635MetVal: 1.635 ± 0.359
0.409MetTrp: 0.409 ± 0.228
1.09MetTyr: 1.09 ± 0.375
0.0MetXaa: 0.0 ± 0.0
Asn
3.407AsnAla: 3.407 ± 0.828
0.681AsnCys: 0.681 ± 0.193
2.044AsnAsp: 2.044 ± 0.466
1.09AsnGlu: 1.09 ± 0.48
1.226AsnPhe: 1.226 ± 0.286
2.862AsnGly: 2.862 ± 0.493
0.681AsnHis: 0.681 ± 0.23
2.453AsnIle: 2.453 ± 0.838
0.818AsnLys: 0.818 ± 0.431
3.134AsnLeu: 3.134 ± 0.824
0.954AsnMet: 0.954 ± 0.482
1.363AsnAsn: 1.363 ± 0.51
3.134AsnPro: 3.134 ± 0.836
2.317AsnGln: 2.317 ± 0.7
2.317AsnArg: 2.317 ± 0.573
3.543AsnSer: 3.543 ± 0.811
2.726AsnThr: 2.726 ± 0.749
3.952AsnVal: 3.952 ± 0.785
0.409AsnTrp: 0.409 ± 0.26
0.954AsnTyr: 0.954 ± 0.475
0.0AsnXaa: 0.0 ± 0.0
Pro
4.361ProAla: 4.361 ± 0.873
0.409ProCys: 0.409 ± 0.141
3.543ProAsp: 3.543 ± 0.674
2.589ProGlu: 2.589 ± 0.801
3.407ProPhe: 3.407 ± 0.588
2.726ProGly: 2.726 ± 0.612
1.09ProHis: 1.09 ± 0.577
3.679ProIle: 3.679 ± 0.405
1.499ProLys: 1.499 ± 0.559
6.814ProLeu: 6.814 ± 0.805
1.772ProMet: 1.772 ± 0.465
2.726ProAsn: 2.726 ± 0.58
4.088ProPro: 4.088 ± 0.98
1.499ProGln: 1.499 ± 0.309
2.044ProArg: 2.044 ± 0.623
8.04ProSer: 8.04 ± 1.242
4.361ProThr: 4.361 ± 0.583
5.451ProVal: 5.451 ± 0.793
1.363ProTrp: 1.363 ± 0.286
1.499ProTyr: 1.499 ± 0.57
0.0ProXaa: 0.0 ± 0.0
Gln
2.998GlnAla: 2.998 ± 0.924
1.499GlnCys: 1.499 ± 0.379
1.226GlnAsp: 1.226 ± 0.244
0.545GlnGlu: 0.545 ± 0.166
1.499GlnPhe: 1.499 ± 0.288
1.363GlnGly: 1.363 ± 0.293
0.681GlnHis: 0.681 ± 0.299
1.635GlnIle: 1.635 ± 0.451
1.499GlnLys: 1.499 ± 0.451
5.315GlnLeu: 5.315 ± 0.571
0.954GlnMet: 0.954 ± 0.242
1.772GlnAsn: 1.772 ± 0.739
1.908GlnPro: 1.908 ± 0.631
1.908GlnGln: 1.908 ± 0.353
2.18GlnArg: 2.18 ± 0.863
4.088GlnSer: 4.088 ± 0.796
2.589GlnThr: 2.589 ± 0.439
3.543GlnVal: 3.543 ± 0.437
0.545GlnTrp: 0.545 ± 0.241
1.226GlnTyr: 1.226 ± 0.37
0.0GlnXaa: 0.0 ± 0.0
Arg
4.497ArgAla: 4.497 ± 0.541
0.954ArgCys: 0.954 ± 0.453
3.134ArgAsp: 3.134 ± 0.386
2.044ArgGlu: 2.044 ± 0.765
1.908ArgPhe: 1.908 ± 0.452
2.18ArgGly: 2.18 ± 0.64
1.499ArgHis: 1.499 ± 0.26
2.726ArgIle: 2.726 ± 0.469
1.09ArgLys: 1.09 ± 0.41
5.724ArgLeu: 5.724 ± 0.895
1.09ArgMet: 1.09 ± 0.545
1.499ArgAsn: 1.499 ± 0.274
2.862ArgPro: 2.862 ± 0.375
2.726ArgGln: 2.726 ± 0.477
4.633ArgArg: 4.633 ± 0.954
5.315ArgSer: 5.315 ± 0.864
2.589ArgThr: 2.589 ± 0.616
4.906ArgVal: 4.906 ± 0.897
1.09ArgTrp: 1.09 ± 0.303
1.499ArgTyr: 1.499 ± 0.386
0.136ArgXaa: 0.136 ± 0.144
Ser
9.403SerAla: 9.403 ± 0.881
1.226SerCys: 1.226 ± 0.432
6.132SerAsp: 6.132 ± 0.713
4.906SerGlu: 4.906 ± 0.841
4.361SerPhe: 4.361 ± 0.709
5.587SerGly: 5.587 ± 1.034
1.499SerHis: 1.499 ± 0.632
4.906SerIle: 4.906 ± 0.994
2.862SerLys: 2.862 ± 0.544
10.63SerLeu: 10.63 ± 1.687
2.453SerMet: 2.453 ± 0.487
3.271SerAsn: 3.271 ± 0.546
8.177SerPro: 8.177 ± 1.337
4.361SerGln: 4.361 ± 0.415
4.77SerArg: 4.77 ± 0.78
11.856SerSer: 11.856 ± 1.287
8.177SerThr: 8.177 ± 0.923
8.313SerVal: 8.313 ± 1.484
1.772SerTrp: 1.772 ± 0.383
3.134SerTyr: 3.134 ± 0.268
0.0SerXaa: 0.0 ± 0.0
Thr
5.315ThrAla: 5.315 ± 0.78
0.818ThrCys: 0.818 ± 0.356
3.407ThrAsp: 3.407 ± 0.488
2.726ThrGlu: 2.726 ± 0.719
3.407ThrPhe: 3.407 ± 0.786
3.134ThrGly: 3.134 ± 0.41
1.499ThrHis: 1.499 ± 0.301
2.453ThrIle: 2.453 ± 0.368
3.679ThrLys: 3.679 ± 0.454
7.359ThrLeu: 7.359 ± 1.431
1.908ThrMet: 1.908 ± 0.588
2.726ThrAsn: 2.726 ± 0.842
5.724ThrPro: 5.724 ± 1.206
2.726ThrGln: 2.726 ± 0.765
2.317ThrArg: 2.317 ± 0.542
7.904ThrSer: 7.904 ± 0.97
4.906ThrThr: 4.906 ± 1.104
6.678ThrVal: 6.678 ± 1.077
1.499ThrTrp: 1.499 ± 0.355
0.818ThrTyr: 0.818 ± 0.197
0.0ThrXaa: 0.0 ± 0.0
Val
5.996ValAla: 5.996 ± 0.952
0.954ValCys: 0.954 ± 0.314
4.906ValAsp: 4.906 ± 0.684
2.589ValGlu: 2.589 ± 0.649
2.18ValPhe: 2.18 ± 0.496
3.679ValGly: 3.679 ± 0.401
1.635ValHis: 1.635 ± 0.404
4.77ValIle: 4.77 ± 0.964
2.726ValLys: 2.726 ± 0.641
8.177ValLeu: 8.177 ± 1.546
2.453ValMet: 2.453 ± 0.499
3.816ValAsn: 3.816 ± 0.516
6.132ValPro: 6.132 ± 0.891
2.726ValGln: 2.726 ± 0.411
5.724ValArg: 5.724 ± 0.51
7.904ValSer: 7.904 ± 0.881
6.269ValThr: 6.269 ± 1.114
4.77ValVal: 4.77 ± 0.385
0.954ValTrp: 0.954 ± 0.33
1.09ValTyr: 1.09 ± 0.297
0.0ValXaa: 0.0 ± 0.0
Trp
1.363TrpAla: 1.363 ± 0.527
0.136TrpCys: 0.136 ± 0.137
0.954TrpAsp: 0.954 ± 0.244
0.681TrpGlu: 0.681 ± 0.355
1.09TrpPhe: 1.09 ± 0.282
0.409TrpGly: 0.409 ± 0.245
0.0TrpHis: 0.0 ± 0.0
0.545TrpIle: 0.545 ± 0.271
0.818TrpLys: 0.818 ± 0.456
2.044TrpLeu: 2.044 ± 0.29
1.09TrpMet: 1.09 ± 0.316
0.545TrpAsn: 0.545 ± 0.219
0.954TrpPro: 0.954 ± 0.223
0.818TrpGln: 0.818 ± 0.231
1.09TrpArg: 1.09 ± 0.343
0.954TrpSer: 0.954 ± 0.325
1.772TrpThr: 1.772 ± 0.669
0.954TrpVal: 0.954 ± 0.287
0.136TrpTrp: 0.136 ± 0.106
0.409TrpTyr: 0.409 ± 0.207
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.635TyrAla: 1.635 ± 0.437
0.136TyrCys: 0.136 ± 0.15
1.772TyrAsp: 1.772 ± 0.47
0.818TyrGlu: 0.818 ± 0.197
1.09TyrPhe: 1.09 ± 0.42
1.635TyrGly: 1.635 ± 0.415
0.409TyrHis: 0.409 ± 0.203
1.226TyrIle: 1.226 ± 0.456
0.954TyrLys: 0.954 ± 0.303
3.679TyrLeu: 3.679 ± 0.588
0.681TyrMet: 0.681 ± 0.21
1.226TyrAsn: 1.226 ± 0.337
1.499TyrPro: 1.499 ± 0.347
0.818TyrGln: 0.818 ± 0.253
0.954TyrArg: 0.954 ± 0.429
3.543TyrSer: 3.543 ± 0.586
0.954TyrThr: 0.954 ± 0.438
1.908TyrVal: 1.908 ± 0.49
0.545TyrTrp: 0.545 ± 0.177
0.954TyrTyr: 0.954 ± 0.271
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.136XaaGlu: 0.136 ± 0.144
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 10 proteins (7339 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski