Amino acid dipepetide frequency for Parainfluenza virus 5 (strain W3) (PIV5) (Simian virus 5)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.972AlaAla: 7.972 ± 3.242
0.613AlaCys: 0.613 ± 0.244
3.475AlaAsp: 3.475 ± 0.953
3.271AlaGlu: 3.271 ± 0.662
2.453AlaPhe: 2.453 ± 0.678
4.293AlaGly: 4.293 ± 0.616
0.613AlaHis: 0.613 ± 0.371
4.906AlaIle: 4.906 ± 1.316
2.657AlaLys: 2.657 ± 1.532
7.155AlaLeu: 7.155 ± 1.537
1.226AlaMet: 1.226 ± 0.482
3.475AlaAsn: 3.475 ± 1.096
2.044AlaPro: 2.044 ± 0.58
2.862AlaGln: 2.862 ± 1.03
3.066AlaArg: 3.066 ± 1.26
3.884AlaSer: 3.884 ± 1.444
5.519AlaThr: 5.519 ± 1.851
3.679AlaVal: 3.679 ± 1.211
1.226AlaTrp: 1.226 ± 0.423
1.226AlaTyr: 1.226 ± 0.423
0.0AlaXaa: 0.0 ± 0.0
Cys
0.818CysAla: 0.818 ± 0.387
0.409CysCys: 0.409 ± 0.247
0.0CysAsp: 0.0 ± 0.0
0.613CysGlu: 0.613 ± 0.342
1.226CysPhe: 1.226 ± 0.482
0.818CysGly: 0.818 ± 0.303
0.613CysHis: 0.613 ± 0.342
1.431CysIle: 1.431 ± 0.481
1.022CysLys: 1.022 ± 0.535
2.044CysLeu: 2.044 ± 0.612
1.022CysMet: 1.022 ± 0.235
1.226CysAsn: 1.226 ± 0.365
1.022CysPro: 1.022 ± 0.545
0.613CysGln: 0.613 ± 0.244
0.818CysArg: 0.818 ± 0.47
2.453CysSer: 2.453 ± 0.94
1.84CysThr: 1.84 ± 0.508
1.022CysVal: 1.022 ± 0.354
0.0CysTrp: 0.0 ± 0.0
0.613CysTyr: 0.613 ± 0.247
0.0CysXaa: 0.0 ± 0.0
Asp
2.862AspAla: 2.862 ± 0.692
0.613AspCys: 0.613 ± 0.331
2.862AspAsp: 2.862 ± 0.502
2.657AspGlu: 2.657 ± 0.67
1.635AspPhe: 1.635 ± 0.398
1.84AspGly: 1.84 ± 0.157
0.818AspHis: 0.818 ± 0.235
3.271AspIle: 3.271 ± 0.838
2.044AspLys: 2.044 ± 0.499
6.337AspLeu: 6.337 ± 0.875
1.226AspMet: 1.226 ± 0.477
2.249AspAsn: 2.249 ± 0.938
3.884AspPro: 3.884 ± 0.839
2.044AspGln: 2.044 ± 0.642
1.635AspArg: 1.635 ± 0.431
3.679AspSer: 3.679 ± 0.507
3.271AspThr: 3.271 ± 1.035
1.431AspVal: 1.431 ± 0.746
0.409AspTrp: 0.409 ± 0.247
1.84AspTyr: 1.84 ± 0.769
0.0AspXaa: 0.0 ± 0.0
Glu
2.249GluAla: 2.249 ± 0.822
1.022GluCys: 1.022 ± 0.598
2.657GluAsp: 2.657 ± 0.716
3.271GluGlu: 3.271 ± 1.284
1.431GluPhe: 1.431 ± 0.724
3.066GluGly: 3.066 ± 0.733
0.0GluHis: 0.0 ± 0.0
4.497GluIle: 4.497 ± 1.1
2.044GluLys: 2.044 ± 0.616
5.519GluLeu: 5.519 ± 1.15
1.022GluMet: 1.022 ± 0.725
3.475GluAsn: 3.475 ± 0.744
1.226GluPro: 1.226 ± 0.44
2.453GluGln: 2.453 ± 0.466
2.657GluArg: 2.657 ± 0.674
5.315GluSer: 5.315 ± 0.977
2.453GluThr: 2.453 ± 0.464
1.84GluVal: 1.84 ± 0.461
0.613GluTrp: 0.613 ± 0.342
2.453GluTyr: 2.453 ± 0.706
0.0GluXaa: 0.0 ± 0.0
Phe
1.635PheAla: 1.635 ± 0.602
0.613PheCys: 0.613 ± 0.247
1.022PheAsp: 1.022 ± 0.453
2.453PheGlu: 2.453 ± 0.757
1.635PhePhe: 1.635 ± 0.447
2.044PheGly: 2.044 ± 0.664
1.022PheHis: 1.022 ± 0.363
3.884PheIle: 3.884 ± 0.858
1.431PheLys: 1.431 ± 0.495
3.679PheLeu: 3.679 ± 0.681
0.613PheMet: 0.613 ± 0.371
1.431PheAsn: 1.431 ± 0.453
1.431PhePro: 1.431 ± 0.484
2.044PheGln: 2.044 ± 0.623
1.84PheArg: 1.84 ± 0.531
3.271PheSer: 3.271 ± 0.942
3.066PheThr: 3.066 ± 0.522
2.453PheVal: 2.453 ± 0.545
0.204PheTrp: 0.204 ± 0.124
0.613PheTyr: 0.613 ± 0.244
0.0PheXaa: 0.0 ± 0.0
Gly
3.271GlyAla: 3.271 ± 1.194
2.044GlyCys: 2.044 ± 0.742
4.497GlyAsp: 4.497 ± 1.213
2.249GlyGlu: 2.249 ± 0.715
1.84GlyPhe: 1.84 ± 0.527
3.475GlyGly: 3.475 ± 0.968
1.431GlyHis: 1.431 ± 0.543
3.066GlyIle: 3.066 ± 0.638
3.475GlyLys: 3.475 ± 1.534
5.724GlyLeu: 5.724 ± 1.153
1.84GlyMet: 1.84 ± 1.022
2.453GlyAsn: 2.453 ± 0.77
1.635GlyPro: 1.635 ± 0.732
1.431GlyGln: 1.431 ± 0.536
2.862GlyArg: 2.862 ± 0.693
5.519GlySer: 5.519 ± 2.258
3.884GlyThr: 3.884 ± 0.921
3.884GlyVal: 3.884 ± 1.368
1.226GlyTrp: 1.226 ± 0.44
1.226GlyTyr: 1.226 ± 0.558
0.0GlyXaa: 0.0 ± 0.0
His
1.022HisAla: 1.022 ± 0.311
0.0HisCys: 0.0 ± 0.0
0.818HisAsp: 0.818 ± 0.346
0.818HisGlu: 0.818 ± 0.462
0.613HisPhe: 0.613 ± 0.264
0.409HisGly: 0.409 ± 0.247
1.226HisHis: 1.226 ± 0.742
1.226HisIle: 1.226 ± 0.438
0.409HisLys: 0.409 ± 0.24
2.453HisLeu: 2.453 ± 0.91
0.0HisMet: 0.0 ± 0.0
1.431HisAsn: 1.431 ± 0.475
0.818HisPro: 0.818 ± 0.295
1.022HisGln: 1.022 ± 0.602
0.818HisArg: 0.818 ± 0.391
0.204HisSer: 0.204 ± 0.124
2.453HisThr: 2.453 ± 0.816
1.022HisVal: 1.022 ± 0.279
0.613HisTrp: 0.613 ± 0.244
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
3.679IleAla: 3.679 ± 0.716
1.022IleCys: 1.022 ± 0.383
4.293IleAsp: 4.293 ± 1.048
5.519IleGlu: 5.519 ± 0.878
2.862IlePhe: 2.862 ± 0.903
2.453IleGly: 2.453 ± 0.931
1.022IleHis: 1.022 ± 0.311
7.972IleIle: 7.972 ± 1.353
3.271IleLys: 3.271 ± 0.644
7.563IleLeu: 7.563 ± 1.659
1.84IleMet: 1.84 ± 0.534
6.541IleAsn: 6.541 ± 0.944
4.702IlePro: 4.702 ± 0.801
3.475IleGln: 3.475 ± 1.054
2.249IleArg: 2.249 ± 0.282
5.519IleSer: 5.519 ± 1.022
5.928IleThr: 5.928 ± 1.36
5.519IleVal: 5.519 ± 0.596
0.613IleTrp: 0.613 ± 0.371
1.431IleTyr: 1.431 ± 0.581
0.0IleXaa: 0.0 ± 0.0
Lys
3.271LysAla: 3.271 ± 0.853
1.635LysCys: 1.635 ± 0.485
0.818LysAsp: 0.818 ± 0.481
2.044LysGlu: 2.044 ± 0.634
1.84LysPhe: 1.84 ± 0.541
2.657LysGly: 2.657 ± 0.879
0.818LysHis: 0.818 ± 0.387
3.066LysIle: 3.066 ± 1.631
2.249LysLys: 2.249 ± 0.843
6.541LysLeu: 6.541 ± 1.089
0.818LysMet: 0.818 ± 0.34
1.635LysAsn: 1.635 ± 0.517
2.862LysPro: 2.862 ± 1.221
2.249LysGln: 2.249 ± 0.787
2.249LysArg: 2.249 ± 0.532
3.475LysSer: 3.475 ± 0.986
3.271LysThr: 3.271 ± 1.083
2.249LysVal: 2.249 ± 0.611
0.0LysTrp: 0.0 ± 0.0
1.84LysTyr: 1.84 ± 0.625
0.0LysXaa: 0.0 ± 0.0
Leu
7.155LeuAla: 7.155 ± 1.452
2.044LeuCys: 2.044 ± 0.526
6.132LeuAsp: 6.132 ± 0.775
6.746LeuGlu: 6.746 ± 1.113
3.884LeuPhe: 3.884 ± 1.002
7.563LeuGly: 7.563 ± 0.66
1.226LeuHis: 1.226 ± 0.588
7.563LeuIle: 7.563 ± 1.444
6.746LeuLys: 6.746 ± 1.692
10.221LeuLeu: 10.221 ± 0.875
3.475LeuMet: 3.475 ± 0.655
6.337LeuAsn: 6.337 ± 0.775
5.315LeuPro: 5.315 ± 0.663
3.679LeuGln: 3.679 ± 1.35
4.702LeuArg: 4.702 ± 1.159
10.221LeuSer: 10.221 ± 1.577
11.447LeuThr: 11.447 ± 1.452
3.475LeuVal: 3.475 ± 0.926
1.635LeuTrp: 1.635 ± 0.655
2.249LeuTyr: 2.249 ± 0.65
0.0LeuXaa: 0.0 ± 0.0
Met
2.044MetAla: 2.044 ± 0.96
0.409MetCys: 0.409 ± 0.225
1.431MetAsp: 1.431 ± 0.588
0.613MetGlu: 0.613 ± 0.308
0.204MetPhe: 0.204 ± 0.249
1.022MetGly: 1.022 ± 0.645
0.0MetHis: 0.0 ± 0.0
1.635MetIle: 1.635 ± 0.461
1.022MetLys: 1.022 ± 0.335
1.635MetLeu: 1.635 ± 0.357
0.204MetMet: 0.204 ± 0.211
1.022MetAsn: 1.022 ± 0.301
1.022MetPro: 1.022 ± 0.326
1.022MetGln: 1.022 ± 0.836
2.249MetArg: 2.249 ± 0.607
1.635MetSer: 1.635 ± 0.521
2.453MetThr: 2.453 ± 0.602
2.249MetVal: 2.249 ± 0.734
0.204MetTrp: 0.204 ± 0.226
2.453MetTyr: 2.453 ± 0.711
0.0MetXaa: 0.0 ± 0.0
Asn
5.11AsnAla: 5.11 ± 0.874
0.818AsnCys: 0.818 ± 0.581
2.862AsnAsp: 2.862 ± 0.624
1.431AsnGlu: 1.431 ± 0.31
1.635AsnPhe: 1.635 ± 0.793
2.657AsnGly: 2.657 ± 0.762
1.84AsnHis: 1.84 ± 0.575
3.475AsnIle: 3.475 ± 1.498
1.635AsnLys: 1.635 ± 0.44
7.155AsnLeu: 7.155 ± 1.786
0.613AsnMet: 0.613 ± 0.376
2.453AsnAsn: 2.453 ± 0.576
4.702AsnPro: 4.702 ± 0.925
3.679AsnGln: 3.679 ± 0.926
3.475AsnArg: 3.475 ± 0.667
2.862AsnSer: 2.862 ± 0.685
2.862AsnThr: 2.862 ± 1.243
2.657AsnVal: 2.657 ± 0.426
1.226AsnTrp: 1.226 ± 0.607
1.226AsnTyr: 1.226 ± 0.529
0.0AsnXaa: 0.0 ± 0.0
Pro
3.271ProAla: 3.271 ± 1.558
0.0ProCys: 0.0 ± 0.0
2.657ProAsp: 2.657 ± 0.608
3.066ProGlu: 3.066 ± 0.868
2.044ProPhe: 2.044 ± 0.593
3.884ProGly: 3.884 ± 1.315
1.431ProHis: 1.431 ± 0.673
4.293ProIle: 4.293 ± 1.192
2.044ProLys: 2.044 ± 0.979
5.11ProLeu: 5.11 ± 0.656
1.226ProMet: 1.226 ± 0.441
2.453ProAsn: 2.453 ± 0.403
3.066ProPro: 3.066 ± 0.588
1.022ProGln: 1.022 ± 0.331
3.271ProArg: 3.271 ± 0.649
4.088ProSer: 4.088 ± 1.419
5.315ProThr: 5.315 ± 1.449
1.84ProVal: 1.84 ± 0.487
0.0ProTrp: 0.0 ± 0.0
1.022ProTyr: 1.022 ± 0.618
0.0ProXaa: 0.0 ± 0.0
Gln
3.884GlnAla: 3.884 ± 0.746
1.022GlnCys: 1.022 ± 0.363
2.249GlnAsp: 2.249 ± 0.353
2.044GlnGlu: 2.044 ± 0.45
1.84GlnPhe: 1.84 ± 0.686
1.84GlnGly: 1.84 ± 0.448
0.409GlnHis: 0.409 ± 0.24
3.475GlnIle: 3.475 ± 1.306
1.84GlnLys: 1.84 ± 0.643
5.519GlnLeu: 5.519 ± 1.075
1.022GlnMet: 1.022 ± 0.285
2.044GlnAsn: 2.044 ± 0.699
1.84GlnPro: 1.84 ± 1.077
2.044GlnGln: 2.044 ± 0.744
2.044GlnArg: 2.044 ± 0.636
3.475GlnSer: 3.475 ± 1.516
1.84GlnThr: 1.84 ± 0.516
3.475GlnVal: 3.475 ± 0.696
0.204GlnTrp: 0.204 ± 0.124
1.022GlnTyr: 1.022 ± 0.618
0.0GlnXaa: 0.0 ± 0.0
Arg
0.818ArgAla: 0.818 ± 0.384
0.409ArgCys: 0.409 ± 0.307
2.044ArgAsp: 2.044 ± 0.621
2.453ArgGlu: 2.453 ± 0.728
2.657ArgPhe: 2.657 ± 0.857
3.271ArgGly: 3.271 ± 0.936
1.022ArgHis: 1.022 ± 0.397
4.702ArgIle: 4.702 ± 0.731
4.293ArgLys: 4.293 ± 1.161
4.088ArgLeu: 4.088 ± 1.426
1.431ArgMet: 1.431 ± 0.234
2.657ArgAsn: 2.657 ± 0.543
1.431ArgPro: 1.431 ± 0.58
1.84ArgGln: 1.84 ± 0.406
3.475ArgArg: 3.475 ± 0.967
3.679ArgSer: 3.679 ± 0.625
1.84ArgThr: 1.84 ± 0.468
2.657ArgVal: 2.657 ± 0.796
0.204ArgTrp: 0.204 ± 0.249
1.84ArgTyr: 1.84 ± 0.892
0.0ArgXaa: 0.0 ± 0.0
Ser
4.497SerAla: 4.497 ± 1.41
2.657SerCys: 2.657 ± 0.793
3.884SerAsp: 3.884 ± 0.943
2.657SerGlu: 2.657 ± 0.488
2.862SerPhe: 2.862 ± 0.77
5.315SerGly: 5.315 ± 0.621
1.226SerHis: 1.226 ± 0.558
5.315SerIle: 5.315 ± 1.045
1.84SerLys: 1.84 ± 0.62
10.221SerLeu: 10.221 ± 0.959
1.84SerMet: 1.84 ± 0.493
4.702SerAsn: 4.702 ± 0.971
4.702SerPro: 4.702 ± 1.383
3.271SerGln: 3.271 ± 0.729
2.249SerArg: 2.249 ± 0.691
7.359SerSer: 7.359 ± 1.633
5.519SerThr: 5.519 ± 1.477
4.702SerVal: 4.702 ± 1.086
1.226SerTrp: 1.226 ± 0.574
3.271SerTyr: 3.271 ± 0.481
0.0SerXaa: 0.0 ± 0.0
Thr
5.315ThrAla: 5.315 ± 0.611
1.431ThrCys: 1.431 ± 0.7
2.249ThrAsp: 2.249 ± 0.381
2.862ThrGlu: 2.862 ± 0.762
3.066ThrPhe: 3.066 ± 0.596
5.11ThrGly: 5.11 ± 1.989
1.635ThrHis: 1.635 ± 0.406
5.724ThrIle: 5.724 ± 2.047
2.044ThrLys: 2.044 ± 0.576
9.812ThrLeu: 9.812 ± 1.665
1.84ThrMet: 1.84 ± 0.476
3.475ThrAsn: 3.475 ± 1.076
2.862ThrPro: 2.862 ± 0.373
4.906ThrGln: 4.906 ± 1.38
4.088ThrArg: 4.088 ± 0.754
5.519ThrSer: 5.519 ± 1.428
4.497ThrThr: 4.497 ± 0.48
5.11ThrVal: 5.11 ± 1.733
0.409ThrTrp: 0.409 ± 0.247
2.657ThrTyr: 2.657 ± 0.834
0.0ThrXaa: 0.0 ± 0.0
Val
3.884ValAla: 3.884 ± 0.891
0.613ValCys: 0.613 ± 0.44
1.84ValAsp: 1.84 ± 0.583
2.453ValGlu: 2.453 ± 0.525
0.818ValPhe: 0.818 ± 0.303
3.271ValGly: 3.271 ± 0.857
0.613ValHis: 0.613 ± 0.294
3.679ValIle: 3.679 ± 0.898
3.475ValLys: 3.475 ± 1.3
5.11ValLeu: 5.11 ± 0.781
1.226ValMet: 1.226 ± 0.434
2.044ValAsn: 2.044 ± 0.344
4.293ValPro: 4.293 ± 1.627
1.431ValGln: 1.431 ± 1.013
2.453ValArg: 2.453 ± 0.598
4.088ValSer: 4.088 ± 0.759
5.928ValThr: 5.928 ± 2.217
4.497ValVal: 4.497 ± 2.259
0.818ValTrp: 0.818 ± 0.288
2.657ValTyr: 2.657 ± 0.782
0.0ValXaa: 0.0 ± 0.0
Trp
0.818TrpAla: 0.818 ± 0.336
0.613TrpCys: 0.613 ± 0.352
0.613TrpAsp: 0.613 ± 0.264
0.409TrpGlu: 0.409 ± 0.241
0.409TrpPhe: 0.409 ± 0.191
0.818TrpGly: 0.818 ± 0.462
0.204TrpHis: 0.204 ± 0.124
1.431TrpIle: 1.431 ± 0.673
0.818TrpLys: 0.818 ± 0.354
0.818TrpLeu: 0.818 ± 0.288
0.409TrpMet: 0.409 ± 0.231
0.818TrpAsn: 0.818 ± 0.336
0.818TrpPro: 0.818 ± 0.239
0.613TrpGln: 0.613 ± 0.294
0.204TrpArg: 0.204 ± 0.124
0.818TrpSer: 0.818 ± 0.46
0.204TrpThr: 0.204 ± 0.124
0.409TrpVal: 0.409 ± 0.408
0.204TrpTrp: 0.204 ± 0.211
0.204TrpTyr: 0.204 ± 0.124
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.044TyrAla: 2.044 ± 1.382
1.635TyrCys: 1.635 ± 0.379
0.204TyrAsp: 0.204 ± 0.124
1.635TyrGlu: 1.635 ± 0.46
1.431TyrPhe: 1.431 ± 0.808
1.226TyrGly: 1.226 ± 0.586
0.204TyrHis: 0.204 ± 0.263
3.066TyrIle: 3.066 ± 1.106
1.226TyrLys: 1.226 ± 0.425
5.11TyrLeu: 5.11 ± 0.942
1.431TyrMet: 1.431 ± 0.39
2.453TyrAsn: 2.453 ± 0.736
1.431TyrPro: 1.431 ± 0.383
1.431TyrGln: 1.431 ± 0.479
0.409TyrArg: 0.409 ± 0.225
2.249TyrSer: 2.249 ± 0.611
1.022TyrThr: 1.022 ± 0.257
1.022TyrVal: 1.022 ± 0.448
0.613TyrTrp: 0.613 ± 0.294
2.657TyrTyr: 2.657 ± 0.793
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 8 proteins (4893 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski