Amino acid dipepetide frequency for Simian-Human immunodeficiency virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.542AlaAla: 3.542 ± 2.034
0.787AlaCys: 0.787 ± 0.647
2.361AlaAsp: 2.361 ± 0.546
6.297AlaGlu: 6.297 ± 1.394
1.181AlaPhe: 1.181 ± 0.618
3.935AlaGly: 3.935 ± 0.738
1.968AlaHis: 1.968 ± 0.726
4.329AlaIle: 4.329 ± 1.629
3.542AlaLys: 3.542 ± 0.943
6.69AlaLeu: 6.69 ± 1.832
2.361AlaMet: 2.361 ± 0.494
3.542AlaAsn: 3.542 ± 1.165
3.935AlaPro: 3.935 ± 1.978
2.361AlaGln: 2.361 ± 0.762
3.148AlaArg: 3.148 ± 1.149
1.968AlaSer: 1.968 ± 0.993
0.394AlaThr: 0.394 ± 0.267
3.542AlaVal: 3.542 ± 0.748
1.968AlaTrp: 1.968 ± 0.385
1.574AlaTyr: 1.574 ± 1.191
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.787CysCys: 0.787 ± 0.745
0.394CysAsp: 0.394 ± 0.267
0.787CysGlu: 0.787 ± 0.4
1.181CysPhe: 1.181 ± 1.233
1.574CysGly: 1.574 ± 0.778
0.394CysHis: 0.394 ± 0.267
0.787CysIle: 0.787 ± 0.645
1.574CysLys: 1.574 ± 0.677
0.787CysLeu: 0.787 ± 0.958
0.394CysMet: 0.394 ± 0.267
0.787CysAsn: 0.787 ± 0.542
0.787CysPro: 0.787 ± 0.338
1.181CysGln: 1.181 ± 0.915
1.574CysArg: 1.574 ± 0.542
0.787CysSer: 0.787 ± 0.4
1.181CysThr: 1.181 ± 0.516
1.181CysVal: 1.181 ± 1.168
0.787CysTrp: 0.787 ± 0.778
1.574CysTyr: 1.574 ± 0.628
0.0CysXaa: 0.0 ± 0.0
Asp
1.181AspAla: 1.181 ± 0.468
1.574AspCys: 1.574 ± 1.051
2.755AspAsp: 2.755 ± 1.571
3.542AspGlu: 3.542 ± 1.539
1.181AspPhe: 1.181 ± 0.802
1.574AspGly: 1.574 ± 0.741
1.181AspHis: 1.181 ± 0.678
3.935AspIle: 3.935 ± 0.772
3.148AspLys: 3.148 ± 1.149
3.542AspLeu: 3.542 ± 1.03
0.787AspMet: 0.787 ± 0.4
1.181AspAsn: 1.181 ± 0.515
3.542AspPro: 3.542 ± 1.562
0.787AspGln: 0.787 ± 0.338
3.935AspArg: 3.935 ± 1.127
2.361AspSer: 2.361 ± 0.988
1.968AspThr: 1.968 ± 0.969
2.361AspVal: 2.361 ± 0.883
0.787AspTrp: 0.787 ± 0.524
1.181AspTyr: 1.181 ± 0.532
0.0AspXaa: 0.0 ± 0.0
Glu
7.871GluAla: 7.871 ± 1.692
0.0GluCys: 0.0 ± 0.0
3.542GluAsp: 3.542 ± 1.164
11.019GluGlu: 11.019 ± 2.557
0.787GluPhe: 0.787 ± 0.4
7.871GluGly: 7.871 ± 0.926
1.574GluHis: 1.574 ± 0.741
2.755GluIle: 2.755 ± 1.241
6.297GluLys: 6.297 ± 1.276
6.297GluLeu: 6.297 ± 1.924
3.148GluMet: 3.148 ± 1.111
1.968GluAsn: 1.968 ± 0.85
3.935GluPro: 3.935 ± 1.004
3.542GluGln: 3.542 ± 0.839
5.116GluArg: 5.116 ± 2.091
2.755GluSer: 2.755 ± 1.13
3.542GluThr: 3.542 ± 0.757
6.297GluVal: 6.297 ± 1.129
1.181GluTrp: 1.181 ± 0.727
1.574GluTyr: 1.574 ± 0.74
0.0GluXaa: 0.0 ± 0.0
Phe
1.968PheAla: 1.968 ± 0.927
0.0PheCys: 0.0 ± 0.0
0.787PheAsp: 0.787 ± 0.715
0.394PheGlu: 0.394 ± 0.479
0.394PhePhe: 0.394 ± 0.267
3.148PheGly: 3.148 ± 0.924
0.787PheHis: 0.787 ± 0.51
1.968PheIle: 1.968 ± 0.808
1.574PheLys: 1.574 ± 0.717
1.574PheLeu: 1.574 ± 0.684
0.787PheMet: 0.787 ± 0.805
0.394PheAsn: 0.394 ± 0.267
2.755PhePro: 2.755 ± 1.545
2.755PheGln: 2.755 ± 0.863
2.361PheArg: 2.361 ± 1.208
1.181PheSer: 1.181 ± 0.802
1.181PheThr: 1.181 ± 0.515
0.0PheVal: 0.0 ± 0.0
0.394PheTrp: 0.394 ± 0.444
0.394PheTyr: 0.394 ± 0.389
0.0PheXaa: 0.0 ± 0.0
Gly
3.542GlyAla: 3.542 ± 0.628
2.755GlyCys: 2.755 ± 1.015
4.329GlyAsp: 4.329 ± 0.639
6.297GlyGlu: 6.297 ± 2.268
2.755GlyPhe: 2.755 ± 1.141
7.871GlyGly: 7.871 ± 1.255
1.181GlyHis: 1.181 ± 0.824
5.51GlyIle: 5.51 ± 2.007
7.477GlyLys: 7.477 ± 2.572
5.903GlyLeu: 5.903 ± 2.371
1.181GlyMet: 1.181 ± 0.618
3.148GlyAsn: 3.148 ± 1.615
5.51GlyPro: 5.51 ± 0.924
4.329GlyGln: 4.329 ± 0.91
3.542GlyArg: 3.542 ± 1.04
3.935GlySer: 3.935 ± 0.51
4.329GlyThr: 4.329 ± 2.55
3.935GlyVal: 3.935 ± 0.715
1.968GlyTrp: 1.968 ± 1.007
1.574GlyTyr: 1.574 ± 0.548
0.0GlyXaa: 0.0 ± 0.0
His
1.181HisAla: 1.181 ± 0.468
1.181HisCys: 1.181 ± 0.707
1.181HisAsp: 1.181 ± 0.618
0.0HisGlu: 0.0 ± 0.0
1.574HisPhe: 1.574 ± 1.382
1.574HisGly: 1.574 ± 0.743
0.787HisHis: 0.787 ± 0.448
0.787HisIle: 0.787 ± 0.491
0.787HisLys: 0.787 ± 0.4
4.723HisLeu: 4.723 ± 0.991
0.394HisMet: 0.394 ± 0.267
0.0HisAsn: 0.0 ± 0.0
1.574HisPro: 1.574 ± 0.665
2.361HisGln: 2.361 ± 1.031
0.394HisArg: 0.394 ± 0.424
2.361HisSer: 2.361 ± 1.208
1.574HisThr: 1.574 ± 0.684
1.968HisVal: 1.968 ± 0.771
0.0HisTrp: 0.0 ± 0.0
0.787HisTyr: 0.787 ± 0.4
0.0HisXaa: 0.0 ± 0.0
Ile
2.361IleAla: 2.361 ± 0.949
0.394IleCys: 0.394 ± 0.267
2.755IleAsp: 2.755 ± 1.065
4.723IleGlu: 4.723 ± 1.998
1.574IlePhe: 1.574 ± 0.605
3.542IleGly: 3.542 ± 1.222
1.968IleHis: 1.968 ± 0.684
5.116IleIle: 5.116 ± 1.341
4.723IleLys: 4.723 ± 1.494
5.116IleLeu: 5.116 ± 1.312
0.394IleMet: 0.394 ± 0.267
2.361IleAsn: 2.361 ± 0.727
3.935IlePro: 3.935 ± 0.5
3.148IleGln: 3.148 ± 0.949
3.542IleArg: 3.542 ± 0.674
1.968IleSer: 1.968 ± 1.073
2.755IleThr: 2.755 ± 1.106
5.116IleVal: 5.116 ± 2.051
0.787IleTrp: 0.787 ± 0.338
2.755IleTyr: 2.755 ± 0.7
0.0IleXaa: 0.0 ± 0.0
Lys
3.935LysAla: 3.935 ± 1.105
2.361LysCys: 2.361 ± 1.45
3.542LysAsp: 3.542 ± 1.975
7.871LysGlu: 7.871 ± 1.13
2.755LysPhe: 2.755 ± 1.028
4.723LysGly: 4.723 ± 1.008
1.968LysHis: 1.968 ± 0.894
5.903LysIle: 5.903 ± 2.33
5.116LysLys: 5.116 ± 1.554
3.935LysLeu: 3.935 ± 1.008
1.574LysMet: 1.574 ± 0.684
3.935LysAsn: 3.935 ± 1.4
2.755LysPro: 2.755 ± 0.953
4.723LysGln: 4.723 ± 1.457
2.755LysArg: 2.755 ± 1.106
1.968LysSer: 1.968 ± 0.582
2.361LysThr: 2.361 ± 0.621
5.51LysVal: 5.51 ± 1.72
0.787LysTrp: 0.787 ± 0.534
2.755LysTyr: 2.755 ± 1.336
0.0LysXaa: 0.0 ± 0.0
Leu
5.51LeuAla: 5.51 ± 1.775
1.181LeuCys: 1.181 ± 0.618
3.148LeuAsp: 3.148 ± 0.747
8.658LeuGlu: 8.658 ± 1.162
1.968LeuPhe: 1.968 ± 0.639
6.69LeuGly: 6.69 ± 1.325
1.574LeuHis: 1.574 ± 0.413
5.51LeuIle: 5.51 ± 1.762
5.903LeuLys: 5.903 ± 1.472
6.69LeuLeu: 6.69 ± 0.941
2.361LeuMet: 2.361 ± 0.706
3.542LeuAsn: 3.542 ± 0.834
3.148LeuPro: 3.148 ± 1.046
3.935LeuGln: 3.935 ± 1.32
3.542LeuArg: 3.542 ± 1.498
5.116LeuSer: 5.116 ± 1.855
3.148LeuThr: 3.148 ± 0.693
6.297LeuVal: 6.297 ± 1.012
2.361LeuTrp: 2.361 ± 1.228
0.787LeuTyr: 0.787 ± 0.652
0.0LeuXaa: 0.0 ± 0.0
Met
3.542MetAla: 3.542 ± 0.775
0.0MetCys: 0.0 ± 0.0
1.574MetAsp: 1.574 ± 0.399
1.968MetGlu: 1.968 ± 0.822
0.0MetPhe: 0.0 ± 0.0
2.755MetGly: 2.755 ± 0.763
1.181MetHis: 1.181 ± 0.949
0.787MetIle: 0.787 ± 0.534
0.0MetLys: 0.0 ± 0.0
2.361MetLeu: 2.361 ± 0.976
0.0MetMet: 0.0 ± 0.0
1.181MetAsn: 1.181 ± 0.532
1.181MetPro: 1.181 ± 0.678
1.574MetGln: 1.574 ± 0.735
0.787MetArg: 0.787 ± 0.4
1.968MetSer: 1.968 ± 1.239
1.574MetThr: 1.574 ± 0.684
0.394MetVal: 0.394 ± 0.267
0.394MetTrp: 0.394 ± 0.267
0.787MetTyr: 0.787 ± 0.778
0.0MetXaa: 0.0 ± 0.0
Asn
1.181AsnAla: 1.181 ± 0.678
1.574AsnCys: 1.574 ± 0.758
0.394AsnAsp: 0.394 ± 0.267
1.968AsnGlu: 1.968 ± 1.262
2.755AsnPhe: 2.755 ± 0.849
1.574AsnGly: 1.574 ± 0.684
1.181AsnHis: 1.181 ± 0.727
1.574AsnIle: 1.574 ± 0.675
1.574AsnLys: 1.574 ± 0.684
1.181AsnLeu: 1.181 ± 0.468
1.181AsnMet: 1.181 ± 0.78
0.394AsnAsn: 0.394 ± 0.267
4.723AsnPro: 4.723 ± 1.432
2.755AsnGln: 2.755 ± 1.028
2.361AsnArg: 2.361 ± 1.404
3.542AsnSer: 3.542 ± 0.598
1.968AsnThr: 1.968 ± 0.562
1.181AsnVal: 1.181 ± 0.532
0.787AsnTrp: 0.787 ± 0.338
1.574AsnTyr: 1.574 ± 1.214
0.0AsnXaa: 0.0 ± 0.0
Pro
3.542ProAla: 3.542 ± 0.937
0.787ProCys: 0.787 ± 0.4
1.574ProAsp: 1.574 ± 0.658
3.148ProGlu: 3.148 ± 1.101
1.574ProPhe: 1.574 ± 0.675
5.51ProGly: 5.51 ± 1.548
1.574ProHis: 1.574 ± 0.717
3.148ProIle: 3.148 ± 0.973
4.329ProLys: 4.329 ± 1.174
4.329ProLeu: 4.329 ± 1.026
0.394ProMet: 0.394 ± 0.389
1.181ProAsn: 1.181 ± 0.809
6.69ProPro: 6.69 ± 3.956
3.935ProGln: 3.935 ± 1.095
5.51ProArg: 5.51 ± 1.479
3.148ProSer: 3.148 ± 0.781
6.297ProThr: 6.297 ± 1.72
3.935ProVal: 3.935 ± 0.768
2.755ProTrp: 2.755 ± 1.335
1.574ProTyr: 1.574 ± 0.675
0.0ProXaa: 0.0 ± 0.0
Gln
3.542GlnAla: 3.542 ± 0.88
0.394GlnCys: 0.394 ± 0.389
1.181GlnAsp: 1.181 ± 0.532
4.329GlnGlu: 4.329 ± 2.098
1.181GlnPhe: 1.181 ± 0.802
5.51GlnGly: 5.51 ± 1.302
0.394GlnHis: 0.394 ± 0.389
4.329GlnIle: 4.329 ± 1.372
4.329GlnLys: 4.329 ± 0.681
3.148GlnLeu: 3.148 ± 0.909
1.574GlnMet: 1.574 ± 0.633
1.968GlnAsn: 1.968 ± 1.131
2.361GlnPro: 2.361 ± 1.444
2.755GlnGln: 2.755 ± 1.727
5.51GlnArg: 5.51 ± 2.007
3.148GlnSer: 3.148 ± 0.956
2.755GlnThr: 2.755 ± 0.646
2.755GlnVal: 2.755 ± 0.593
3.148GlnTrp: 3.148 ± 0.849
2.361GlnTyr: 2.361 ± 0.87
0.0GlnXaa: 0.0 ± 0.0
Arg
3.148ArgAla: 3.148 ± 1.829
0.394ArgCys: 0.394 ± 0.479
1.181ArgAsp: 1.181 ± 0.417
6.69ArgGlu: 6.69 ± 1.369
1.181ArgPhe: 1.181 ± 0.863
6.297ArgGly: 6.297 ± 1.449
1.574ArgHis: 1.574 ± 0.568
4.329ArgIle: 4.329 ± 1.537
3.935ArgLys: 3.935 ± 1.094
7.084ArgLeu: 7.084 ± 1.67
1.181ArgMet: 1.181 ± 0.483
1.968ArgAsn: 1.968 ± 0.778
3.148ArgPro: 3.148 ± 0.915
5.903ArgGln: 5.903 ± 1.533
7.477ArgArg: 7.477 ± 3.391
2.361ArgSer: 2.361 ± 1.735
1.968ArgThr: 1.968 ± 0.729
1.574ArgVal: 1.574 ± 0.717
1.968ArgTrp: 1.968 ± 0.968
1.574ArgTyr: 1.574 ± 0.917
0.0ArgXaa: 0.0 ± 0.0
Ser
2.755SerAla: 2.755 ± 0.82
1.574SerCys: 1.574 ± 0.548
2.755SerAsp: 2.755 ± 0.668
2.755SerGlu: 2.755 ± 0.674
0.0SerPhe: 0.0 ± 0.0
5.51SerGly: 5.51 ± 1.85
1.181SerHis: 1.181 ± 0.535
1.968SerIle: 1.968 ± 0.816
3.542SerLys: 3.542 ± 1.29
3.935SerLeu: 3.935 ± 1.41
1.181SerMet: 1.181 ± 0.606
0.787SerAsn: 0.787 ± 0.542
3.148SerPro: 3.148 ± 1.035
4.329SerGln: 4.329 ± 1.53
3.148SerArg: 3.148 ± 1.529
2.361SerSer: 2.361 ± 0.621
1.574SerThr: 1.574 ± 0.8
2.361SerVal: 2.361 ± 0.906
0.394SerTrp: 0.394 ± 0.479
1.968SerTyr: 1.968 ± 0.819
0.0SerXaa: 0.0 ± 0.0
Thr
4.329ThrAla: 4.329 ± 0.686
0.787ThrCys: 0.787 ± 0.4
3.542ThrAsp: 3.542 ± 1.516
3.148ThrGlu: 3.148 ± 1.231
1.181ThrPhe: 1.181 ± 0.532
4.329ThrGly: 4.329 ± 1.657
1.574ThrHis: 1.574 ± 0.775
1.181ThrIle: 1.181 ± 0.618
2.755ThrLys: 2.755 ± 1.009
3.935ThrLeu: 3.935 ± 0.769
1.181ThrMet: 1.181 ± 0.381
1.968ThrAsn: 1.968 ± 0.624
3.935ThrPro: 3.935 ± 1.071
1.181ThrGln: 1.181 ± 0.546
0.787ThrArg: 0.787 ± 0.72
2.361ThrSer: 2.361 ± 1.381
1.574ThrThr: 1.574 ± 0.675
2.755ThrVal: 2.755 ± 1.066
2.361ThrTrp: 2.361 ± 1.2
2.361ThrTyr: 2.361 ± 1.316
0.0ThrXaa: 0.0 ± 0.0
Val
3.935ValAla: 3.935 ± 1.965
1.181ValCys: 1.181 ± 0.801
3.148ValAsp: 3.148 ± 0.974
5.116ValGlu: 5.116 ± 1.181
0.787ValPhe: 0.787 ± 0.534
4.329ValGly: 4.329 ± 0.716
1.574ValHis: 1.574 ± 0.675
3.148ValIle: 3.148 ± 0.918
3.935ValLys: 3.935 ± 1.317
6.69ValLeu: 6.69 ± 1.433
0.787ValMet: 0.787 ± 0.338
2.361ValAsn: 2.361 ± 0.576
5.116ValPro: 5.116 ± 1.167
2.361ValGln: 2.361 ± 0.64
4.329ValArg: 4.329 ± 1.672
1.574ValSer: 1.574 ± 0.879
2.755ValThr: 2.755 ± 1.009
4.329ValVal: 4.329 ± 1.102
1.574ValTrp: 1.574 ± 0.687
0.394ValTyr: 0.394 ± 0.267
0.0ValXaa: 0.0 ± 0.0
Trp
1.574TrpAla: 1.574 ± 0.413
0.394TrpCys: 0.394 ± 0.389
1.968TrpAsp: 1.968 ± 0.757
1.181TrpGlu: 1.181 ± 0.618
0.0TrpPhe: 0.0 ± 0.0
1.181TrpGly: 1.181 ± 0.381
1.181TrpHis: 1.181 ± 1.014
1.181TrpIle: 1.181 ± 0.417
3.935TrpLys: 3.935 ± 0.907
1.181TrpLeu: 1.181 ± 0.827
1.181TrpMet: 1.181 ± 0.678
0.394TrpAsn: 0.394 ± 0.389
1.181TrpPro: 1.181 ± 0.802
1.574TrpGln: 1.574 ± 0.643
2.755TrpArg: 2.755 ± 1.079
1.181TrpSer: 1.181 ± 0.559
1.968TrpThr: 1.968 ± 0.883
1.181TrpVal: 1.181 ± 0.504
1.181TrpTrp: 1.181 ± 0.515
0.394TrpTyr: 0.394 ± 0.444
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.787TyrAla: 0.787 ± 0.888
0.394TyrCys: 0.394 ± 0.551
0.394TyrAsp: 0.394 ± 0.389
1.181TyrGlu: 1.181 ± 0.532
0.787TyrPhe: 0.787 ± 0.4
1.574TyrGly: 1.574 ± 0.708
0.394TyrHis: 0.394 ± 0.267
0.787TyrIle: 0.787 ± 0.534
2.755TyrLys: 2.755 ± 0.487
2.361TyrLeu: 2.361 ± 1.576
1.574TyrMet: 1.574 ± 0.658
2.361TyrAsn: 2.361 ± 0.602
1.574TyrPro: 1.574 ± 0.733
1.181TyrGln: 1.181 ± 0.667
2.361TyrArg: 2.361 ± 0.869
1.181TyrSer: 1.181 ± 0.996
2.361TyrThr: 2.361 ± 1.064
2.755TyrVal: 2.755 ± 0.582
1.181TyrTrp: 1.181 ± 0.572
1.574TyrTyr: 1.574 ± 0.741
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 9 proteins (2542 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski