Amino acid dipepetide frequency for Bat Paramyxovirus Eid_hel/GH-M74a/GHA/2009

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.05AlaAla: 4.05 ± 0.794
0.736AlaCys: 0.736 ± 0.436
1.105AlaAsp: 1.105 ± 0.4
3.866AlaGlu: 3.866 ± 0.888
1.289AlaPhe: 1.289 ± 0.615
2.577AlaGly: 2.577 ± 0.9
0.736AlaHis: 0.736 ± 0.205
4.602AlaIle: 4.602 ± 1.091
1.657AlaLys: 1.657 ± 0.743
4.602AlaLeu: 4.602 ± 0.953
1.841AlaMet: 1.841 ± 0.566
2.209AlaAsn: 2.209 ± 0.663
2.025AlaPro: 2.025 ± 1.031
2.209AlaGln: 2.209 ± 0.577
2.946AlaArg: 2.946 ± 0.908
4.05AlaSer: 4.05 ± 1.493
2.761AlaThr: 2.761 ± 0.832
3.13AlaVal: 3.13 ± 0.492
0.368AlaTrp: 0.368 ± 0.225
1.473AlaTyr: 1.473 ± 0.309
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.184CysCys: 0.184 ± 0.109
0.736CysAsp: 0.736 ± 0.319
0.552CysGlu: 0.552 ± 0.331
0.552CysPhe: 0.552 ± 0.236
0.368CysGly: 0.368 ± 0.218
0.552CysHis: 0.552 ± 0.233
1.657CysIle: 1.657 ± 0.556
0.552CysLys: 0.552 ± 0.271
2.025CysLeu: 2.025 ± 0.726
0.0CysMet: 0.0 ± 0.0
0.368CysAsn: 0.368 ± 0.218
0.92CysPro: 0.92 ± 0.512
0.92CysGln: 0.92 ± 0.401
1.105CysArg: 1.105 ± 0.629
2.025CysSer: 2.025 ± 0.423
1.105CysThr: 1.105 ± 0.757
0.368CysVal: 0.368 ± 0.222
0.184CysTrp: 0.184 ± 0.188
1.289CysTyr: 1.289 ± 0.456
0.0CysXaa: 0.0 ± 0.0
Asp
1.105AspAla: 1.105 ± 0.491
0.552AspCys: 0.552 ± 0.355
4.418AspAsp: 4.418 ± 1.316
4.418AspGlu: 4.418 ± 0.803
2.577AspPhe: 2.577 ± 0.54
2.393AspGly: 2.393 ± 0.987
1.289AspHis: 1.289 ± 0.495
3.866AspIle: 3.866 ± 0.507
5.707AspLys: 5.707 ± 0.831
5.339AspLeu: 5.339 ± 0.806
0.552AspMet: 0.552 ± 0.232
3.682AspAsn: 3.682 ± 0.759
4.418AspPro: 4.418 ± 0.986
2.025AspGln: 2.025 ± 0.605
2.393AspArg: 2.393 ± 0.725
2.025AspSer: 2.025 ± 0.484
1.657AspThr: 1.657 ± 0.516
2.946AspVal: 2.946 ± 0.591
0.736AspTrp: 0.736 ± 0.205
1.841AspTyr: 1.841 ± 0.756
0.0AspXaa: 0.0 ± 0.0
Glu
1.841GluAla: 1.841 ± 0.485
1.841GluCys: 1.841 ± 0.51
3.13GluAsp: 3.13 ± 1.135
4.234GluGlu: 4.234 ± 0.83
2.209GluPhe: 2.209 ± 0.663
3.682GluGly: 3.682 ± 0.588
1.105GluHis: 1.105 ± 0.454
6.075GluIle: 6.075 ± 0.878
4.05GluLys: 4.05 ± 0.872
6.075GluLeu: 6.075 ± 0.778
1.473GluMet: 1.473 ± 1.111
4.418GluAsn: 4.418 ± 0.917
2.209GluPro: 2.209 ± 0.921
2.393GluGln: 2.393 ± 0.358
3.498GluArg: 3.498 ± 0.343
3.866GluSer: 3.866 ± 0.586
5.155GluThr: 5.155 ± 1.164
2.209GluVal: 2.209 ± 0.947
0.92GluTrp: 0.92 ± 0.658
1.657GluTyr: 1.657 ± 0.487
0.0GluXaa: 0.0 ± 0.0
Phe
2.393PheAla: 2.393 ± 0.952
1.105PheCys: 1.105 ± 0.654
0.368PheAsp: 0.368 ± 0.218
1.289PheGlu: 1.289 ± 0.448
2.025PhePhe: 2.025 ± 0.865
2.761PheGly: 2.761 ± 0.782
0.736PheHis: 0.736 ± 0.275
3.13PheIle: 3.13 ± 0.506
1.657PheLys: 1.657 ± 0.448
2.946PheLeu: 2.946 ± 1.039
1.289PheMet: 1.289 ± 0.748
1.657PheAsn: 1.657 ± 0.62
1.105PhePro: 1.105 ± 0.345
1.289PheGln: 1.289 ± 0.484
1.657PheArg: 1.657 ± 0.943
2.393PheSer: 2.393 ± 0.646
1.657PheThr: 1.657 ± 0.462
1.289PheVal: 1.289 ± 0.29
0.552PheTrp: 0.552 ± 0.236
1.289PheTyr: 1.289 ± 0.472
0.0PheXaa: 0.0 ± 0.0
Gly
2.577GlyAla: 2.577 ± 0.704
0.184GlyCys: 0.184 ± 0.109
2.025GlyAsp: 2.025 ± 0.442
2.393GlyGlu: 2.393 ± 0.704
1.657GlyPhe: 1.657 ± 0.694
4.05GlyGly: 4.05 ± 1.139
1.657GlyHis: 1.657 ± 0.592
3.866GlyIle: 3.866 ± 0.567
3.498GlyLys: 3.498 ± 0.729
4.05GlyLeu: 4.05 ± 0.687
1.473GlyMet: 1.473 ± 0.498
3.13GlyAsn: 3.13 ± 0.654
2.025GlyPro: 2.025 ± 0.429
0.92GlyGln: 0.92 ± 0.406
3.13GlyArg: 3.13 ± 1.167
4.971GlySer: 4.971 ± 0.896
2.393GlyThr: 2.393 ± 0.709
5.523GlyVal: 5.523 ± 1.483
0.184GlyTrp: 0.184 ± 0.233
1.841GlyTyr: 1.841 ± 0.717
0.0GlyXaa: 0.0 ± 0.0
His
0.368HisAla: 0.368 ± 0.222
0.184HisCys: 0.184 ± 0.109
0.736HisAsp: 0.736 ± 0.236
0.736HisGlu: 0.736 ± 0.309
0.0HisPhe: 0.0 ± 0.0
0.92HisGly: 0.92 ± 0.311
0.368HisHis: 0.368 ± 0.192
1.105HisIle: 1.105 ± 0.364
0.552HisLys: 0.552 ± 0.236
2.393HisLeu: 2.393 ± 0.702
0.92HisMet: 0.92 ± 0.409
1.105HisAsn: 1.105 ± 0.423
2.025HisPro: 2.025 ± 0.558
0.736HisGln: 0.736 ± 0.449
0.552HisArg: 0.552 ± 0.238
2.393HisSer: 2.393 ± 0.821
0.736HisThr: 0.736 ± 0.322
1.473HisVal: 1.473 ± 0.39
0.184HisTrp: 0.184 ± 0.109
1.105HisTyr: 1.105 ± 0.597
0.0HisXaa: 0.0 ± 0.0
Ile
4.234IleAla: 4.234 ± 1.09
1.473IleCys: 1.473 ± 0.497
5.155IleAsp: 5.155 ± 0.8
4.602IleGlu: 4.602 ± 1.376
1.473IlePhe: 1.473 ± 0.549
3.314IleGly: 3.314 ± 0.412
0.92IleHis: 0.92 ± 0.281
6.811IleIle: 6.811 ± 1.19
5.707IleLys: 5.707 ± 1.503
7.916IleLeu: 7.916 ± 1.704
2.209IleMet: 2.209 ± 0.53
5.707IleAsn: 5.707 ± 0.952
3.682IlePro: 3.682 ± 0.786
4.05IleGln: 4.05 ± 1.041
4.786IleArg: 4.786 ± 0.606
7.916IleSer: 7.916 ± 1.775
6.811IleThr: 6.811 ± 1.777
4.602IleVal: 4.602 ± 1.117
0.552IleTrp: 0.552 ± 0.266
2.577IleTyr: 2.577 ± 0.733
0.0IleXaa: 0.0 ± 0.0
Lys
2.946LysAla: 2.946 ± 0.57
0.92LysCys: 0.92 ± 0.412
3.682LysAsp: 3.682 ± 0.903
5.523LysGlu: 5.523 ± 1.316
2.577LysPhe: 2.577 ± 0.482
3.13LysGly: 3.13 ± 0.722
1.473LysHis: 1.473 ± 0.464
6.075LysIle: 6.075 ± 0.902
3.682LysLys: 3.682 ± 0.779
4.602LysLeu: 4.602 ± 0.904
2.577LysMet: 2.577 ± 0.773
3.498LysAsn: 3.498 ± 0.394
1.289LysPro: 1.289 ± 0.964
1.657LysGln: 1.657 ± 0.597
3.498LysArg: 3.498 ± 0.635
6.443LysSer: 6.443 ± 0.574
4.971LysThr: 4.971 ± 0.967
3.314LysVal: 3.314 ± 1.018
0.184LysTrp: 0.184 ± 0.109
1.473LysTyr: 1.473 ± 0.473
0.0LysXaa: 0.0 ± 0.0
Leu
2.761LeuAla: 2.761 ± 0.544
1.473LeuCys: 1.473 ± 0.472
6.075LeuAsp: 6.075 ± 0.729
7.364LeuGlu: 7.364 ± 1.835
3.682LeuPhe: 3.682 ± 0.943
4.234LeuGly: 4.234 ± 0.579
0.92LeuHis: 0.92 ± 0.311
8.284LeuIle: 8.284 ± 1.989
5.155LeuLys: 5.155 ± 1.36
8.1LeuLeu: 8.1 ± 1.308
1.657LeuMet: 1.657 ± 0.282
9.205LeuAsn: 9.205 ± 1.101
2.393LeuPro: 2.393 ± 0.484
2.761LeuGln: 2.761 ± 0.62
4.786LeuArg: 4.786 ± 1.376
7.548LeuSer: 7.548 ± 0.917
6.443LeuThr: 6.443 ± 1.134
5.523LeuVal: 5.523 ± 0.765
0.552LeuTrp: 0.552 ± 0.258
1.841LeuTyr: 1.841 ± 0.715
0.0LeuXaa: 0.0 ± 0.0
Met
2.209MetAla: 2.209 ± 1.386
0.0MetCys: 0.0 ± 0.0
1.105MetAsp: 1.105 ± 0.364
1.841MetGlu: 1.841 ± 1.087
1.105MetPhe: 1.105 ± 0.551
0.92MetGly: 0.92 ± 0.663
0.552MetHis: 0.552 ± 0.233
2.946MetIle: 2.946 ± 0.619
1.657MetLys: 1.657 ± 0.573
1.473MetLeu: 1.473 ± 0.566
0.552MetMet: 0.552 ± 0.229
0.92MetAsn: 0.92 ± 0.271
1.105MetPro: 1.105 ± 0.407
0.552MetGln: 0.552 ± 0.233
1.105MetArg: 1.105 ± 0.606
1.841MetSer: 1.841 ± 0.577
2.393MetThr: 2.393 ± 0.62
2.025MetVal: 2.025 ± 0.746
0.184MetTrp: 0.184 ± 0.109
1.105MetTyr: 1.105 ± 0.513
0.0MetXaa: 0.0 ± 0.0
Asn
3.13AsnAla: 3.13 ± 0.842
1.473AsnCys: 1.473 ± 0.792
3.866AsnAsp: 3.866 ± 0.781
2.761AsnGlu: 2.761 ± 0.883
1.841AsnPhe: 1.841 ± 0.69
2.761AsnGly: 2.761 ± 0.698
0.92AsnHis: 0.92 ± 0.401
6.259AsnIle: 6.259 ± 0.821
2.577AsnLys: 2.577 ± 0.807
6.811AsnLeu: 6.811 ± 1.585
0.736AsnMet: 0.736 ± 0.205
2.946AsnAsn: 2.946 ± 0.57
4.418AsnPro: 4.418 ± 1.514
3.866AsnGln: 3.866 ± 1.06
2.577AsnArg: 2.577 ± 1.226
4.234AsnSer: 4.234 ± 0.847
3.314AsnThr: 3.314 ± 1.093
2.209AsnVal: 2.209 ± 0.428
0.552AsnTrp: 0.552 ± 0.236
3.13AsnTyr: 3.13 ± 0.481
0.0AsnXaa: 0.0 ± 0.0
Pro
2.393ProAla: 2.393 ± 1.382
0.552ProCys: 0.552 ± 0.365
1.657ProAsp: 1.657 ± 0.773
3.682ProGlu: 3.682 ± 1.17
1.289ProPhe: 1.289 ± 0.481
2.577ProGly: 2.577 ± 0.642
0.368ProHis: 0.368 ± 0.184
2.577ProIle: 2.577 ± 0.727
3.314ProLys: 3.314 ± 1.376
3.866ProLeu: 3.866 ± 0.805
0.736ProMet: 0.736 ± 0.442
3.13ProAsn: 3.13 ± 1.778
1.841ProPro: 1.841 ± 1.248
1.657ProGln: 1.657 ± 0.434
4.234ProArg: 4.234 ± 0.482
5.339ProSer: 5.339 ± 0.873
3.866ProThr: 3.866 ± 1.174
2.025ProVal: 2.025 ± 0.831
0.552ProTrp: 0.552 ± 0.449
2.946ProTyr: 2.946 ± 0.624
0.0ProXaa: 0.0 ± 0.0
Gln
1.289GlnAla: 1.289 ± 0.474
0.736GlnCys: 0.736 ± 0.385
2.577GlnAsp: 2.577 ± 0.73
2.393GlnGlu: 2.393 ± 0.619
1.289GlnPhe: 1.289 ± 0.613
1.841GlnGly: 1.841 ± 1.123
0.368GlnHis: 0.368 ± 0.184
2.761GlnIle: 2.761 ± 1.159
2.946GlnLys: 2.946 ± 0.731
3.13GlnLeu: 3.13 ± 0.604
0.736GlnMet: 0.736 ± 0.236
1.473GlnAsn: 1.473 ± 0.435
2.946GlnPro: 2.946 ± 1.071
2.209GlnGln: 2.209 ± 0.744
2.393GlnArg: 2.393 ± 0.653
3.866GlnSer: 3.866 ± 0.589
1.657GlnThr: 1.657 ± 0.991
1.841GlnVal: 1.841 ± 0.689
0.184GlnTrp: 0.184 ± 0.351
1.289GlnTyr: 1.289 ± 0.442
0.0GlnXaa: 0.0 ± 0.0
Arg
3.498ArgAla: 3.498 ± 1.242
0.552ArgCys: 0.552 ± 0.185
3.682ArgAsp: 3.682 ± 0.336
3.498ArgGlu: 3.498 ± 0.524
1.105ArgPhe: 1.105 ± 0.37
4.418ArgGly: 4.418 ± 0.712
0.92ArgHis: 0.92 ± 0.311
3.866ArgIle: 3.866 ± 0.982
2.209ArgLys: 2.209 ± 0.603
5.155ArgLeu: 5.155 ± 1.201
1.657ArgMet: 1.657 ± 0.913
3.13ArgAsn: 3.13 ± 1.042
2.209ArgPro: 2.209 ± 0.53
1.289ArgGln: 1.289 ± 0.701
3.13ArgArg: 3.13 ± 0.919
6.259ArgSer: 6.259 ± 0.746
2.577ArgThr: 2.577 ± 0.645
3.682ArgVal: 3.682 ± 0.625
0.552ArgTrp: 0.552 ± 0.439
1.657ArgTyr: 1.657 ± 0.478
0.0ArgXaa: 0.0 ± 0.0
Ser
4.418SerAla: 4.418 ± 0.835
1.105SerCys: 1.105 ± 0.474
3.866SerAsp: 3.866 ± 0.47
5.523SerGlu: 5.523 ± 0.684
3.682SerPhe: 3.682 ± 0.482
4.234SerGly: 4.234 ± 1.558
2.209SerHis: 2.209 ± 0.663
6.075SerIle: 6.075 ± 1.352
6.443SerLys: 6.443 ± 0.791
6.627SerLeu: 6.627 ± 0.757
2.761SerMet: 2.761 ± 0.321
5.707SerAsn: 5.707 ± 0.994
2.946SerPro: 2.946 ± 0.751
3.682SerGln: 3.682 ± 1.094
4.971SerArg: 4.971 ± 0.952
5.891SerSer: 5.891 ± 0.913
5.523SerThr: 5.523 ± 0.744
3.682SerVal: 3.682 ± 0.719
0.736SerTrp: 0.736 ± 0.34
3.13SerTyr: 3.13 ± 0.807
0.0SerXaa: 0.0 ± 0.0
Thr
4.786ThrAla: 4.786 ± 0.493
0.552ThrCys: 0.552 ± 0.458
3.682ThrAsp: 3.682 ± 0.622
1.841ThrGlu: 1.841 ± 0.483
1.289ThrPhe: 1.289 ± 0.427
3.682ThrGly: 3.682 ± 1.051
0.92ThrHis: 0.92 ± 0.318
5.707ThrIle: 5.707 ± 0.871
5.707ThrLys: 5.707 ± 0.901
5.707ThrLeu: 5.707 ± 0.937
1.473ThrMet: 1.473 ± 0.666
3.314ThrAsn: 3.314 ± 0.815
4.602ThrPro: 4.602 ± 1.118
1.657ThrGln: 1.657 ± 0.402
3.13ThrArg: 3.13 ± 1.135
4.602ThrSer: 4.602 ± 0.822
4.234ThrThr: 4.234 ± 0.691
4.786ThrVal: 4.786 ± 0.99
1.289ThrTrp: 1.289 ± 0.431
2.393ThrTyr: 2.393 ± 0.602
0.0ThrXaa: 0.0 ± 0.0
Val
2.577ValAla: 2.577 ± 0.696
0.736ValCys: 0.736 ± 0.478
3.866ValAsp: 3.866 ± 0.826
3.682ValGlu: 3.682 ± 0.822
1.105ValPhe: 1.105 ± 0.377
2.025ValGly: 2.025 ± 0.941
1.841ValHis: 1.841 ± 0.653
5.339ValIle: 5.339 ± 1.055
2.761ValLys: 2.761 ± 0.758
4.786ValLeu: 4.786 ± 1.143
1.657ValMet: 1.657 ± 0.568
2.025ValAsn: 2.025 ± 0.645
4.05ValPro: 4.05 ± 0.393
1.657ValGln: 1.657 ± 0.517
2.761ValArg: 2.761 ± 0.905
3.866ValSer: 3.866 ± 0.553
5.339ValThr: 5.339 ± 0.452
1.841ValVal: 1.841 ± 0.363
0.552ValTrp: 0.552 ± 0.327
2.946ValTyr: 2.946 ± 0.329
0.0ValXaa: 0.0 ± 0.0
Trp
0.368TrpAla: 0.368 ± 0.218
0.184TrpCys: 0.184 ± 0.188
0.552TrpAsp: 0.552 ± 0.236
0.92TrpGlu: 0.92 ± 0.387
0.368TrpPhe: 0.368 ± 0.218
0.184TrpGly: 0.184 ± 0.109
0.184TrpHis: 0.184 ± 0.188
1.105TrpIle: 1.105 ± 0.513
0.736TrpLys: 0.736 ± 0.275
0.736TrpLeu: 0.736 ± 0.615
0.184TrpMet: 0.184 ± 0.109
0.368TrpAsn: 0.368 ± 0.22
0.184TrpPro: 0.184 ± 0.109
0.0TrpGln: 0.0 ± 0.0
0.552TrpArg: 0.552 ± 0.258
1.105TrpSer: 1.105 ± 0.305
0.368TrpThr: 0.368 ± 0.188
0.368TrpVal: 0.368 ± 0.403
0.184TrpTrp: 0.184 ± 0.109
0.552TrpTyr: 0.552 ± 0.343
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.289TyrAla: 1.289 ± 0.408
0.92TyrCys: 0.92 ± 0.458
1.657TyrAsp: 1.657 ± 0.429
0.92TyrGlu: 0.92 ± 0.311
1.473TyrPhe: 1.473 ± 0.736
1.473TyrGly: 1.473 ± 0.479
0.552TyrHis: 0.552 ± 0.236
2.393TyrIle: 2.393 ± 0.656
2.946TyrLys: 2.946 ± 0.823
4.418TyrLeu: 4.418 ± 1.235
0.92TyrMet: 0.92 ± 0.462
2.393TyrAsn: 2.393 ± 0.8
2.209TyrPro: 2.209 ± 0.511
2.393TyrGln: 2.393 ± 0.62
1.841TyrArg: 1.841 ± 0.592
2.577TyrSer: 2.577 ± 0.507
2.393TyrThr: 2.393 ± 0.726
2.577TyrVal: 2.577 ± 0.398
0.0TyrTrp: 0.0 ± 0.0
2.761TyrTyr: 2.761 ± 0.997
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 7 proteins (5433 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski