Amino acid dipepetide frequency for Simian immunodeficiency virus (isolate PBj14/BCL-3) (SIV-sm) (Simian immunodeficiency virus sooty mangabey monkey)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.723AlaAla: 3.723 ± 0.943
1.064AlaCys: 1.064 ± 0.498
3.723AlaAsp: 3.723 ± 1.046
6.915AlaGlu: 6.915 ± 1.504
1.862AlaPhe: 1.862 ± 0.611
4.255AlaGly: 4.255 ± 0.947
1.064AlaHis: 1.064 ± 0.709
3.989AlaIle: 3.989 ± 1.178
3.723AlaLys: 3.723 ± 0.885
4.255AlaLeu: 4.255 ± 1.027
2.394AlaMet: 2.394 ± 0.604
2.926AlaAsn: 2.926 ± 0.799
4.521AlaPro: 4.521 ± 0.758
2.394AlaGln: 2.394 ± 0.825
3.457AlaArg: 3.457 ± 0.612
3.457AlaSer: 3.457 ± 1.105
2.128AlaThr: 2.128 ± 0.543
2.66AlaVal: 2.66 ± 1.015
3.191AlaTrp: 3.191 ± 0.737
1.33AlaTyr: 1.33 ± 0.406
0.0AlaXaa: 0.0 ± 0.0
Cys
0.798CysAla: 0.798 ± 0.743
0.532CysCys: 0.532 ± 0.497
0.532CysAsp: 0.532 ± 0.198
1.064CysGlu: 1.064 ± 0.569
0.532CysPhe: 0.532 ± 0.497
1.33CysGly: 1.33 ± 0.656
0.798CysHis: 0.798 ± 0.409
1.064CysIle: 1.064 ± 0.367
1.862CysLys: 1.862 ± 0.615
1.064CysLeu: 1.064 ± 0.766
0.266CysMet: 0.266 ± 0.183
1.33CysAsn: 1.33 ± 0.889
0.798CysPro: 0.798 ± 0.362
1.862CysGln: 1.862 ± 0.759
2.128CysArg: 2.128 ± 0.561
0.532CysSer: 0.532 ± 0.355
1.596CysThr: 1.596 ± 0.497
2.128CysVal: 2.128 ± 0.634
1.33CysTrp: 1.33 ± 0.442
1.862CysTyr: 1.862 ± 1.378
0.0CysXaa: 0.0 ± 0.0
Asp
2.394AspAla: 2.394 ± 0.437
0.798AspCys: 0.798 ± 0.388
1.064AspAsp: 1.064 ± 0.603
2.926AspGlu: 2.926 ± 1.144
0.798AspPhe: 0.798 ± 0.55
1.596AspGly: 1.596 ± 0.558
0.266AspHis: 0.266 ± 0.183
1.862AspIle: 1.862 ± 0.601
2.66AspLys: 2.66 ± 0.439
3.723AspLeu: 3.723 ± 1.051
0.532AspMet: 0.532 ± 0.302
1.064AspAsn: 1.064 ± 0.395
2.926AspPro: 2.926 ± 0.927
0.798AspGln: 0.798 ± 0.216
3.457AspArg: 3.457 ± 0.791
2.128AspSer: 2.128 ± 0.593
2.926AspThr: 2.926 ± 0.725
2.394AspVal: 2.394 ± 0.34
1.596AspTrp: 1.596 ± 0.551
1.596AspTyr: 1.596 ± 0.47
0.0AspXaa: 0.0 ± 0.0
Glu
7.447GluAla: 7.447 ± 1.271
0.0GluCys: 0.0 ± 0.0
2.394GluAsp: 2.394 ± 0.72
9.309GluGlu: 9.309 ± 2.02
0.798GluPhe: 0.798 ± 0.299
6.383GluGly: 6.383 ± 1.368
1.862GluHis: 1.862 ± 0.609
2.926GluIle: 2.926 ± 0.521
5.851GluLys: 5.851 ± 1.159
6.915GluLeu: 6.915 ± 1.136
1.33GluMet: 1.33 ± 0.566
2.394GluAsn: 2.394 ± 0.567
3.457GluPro: 3.457 ± 0.858
4.787GluGln: 4.787 ± 1.12
2.926GluArg: 2.926 ± 0.734
2.66GluSer: 2.66 ± 1.117
5.585GluThr: 5.585 ± 1.572
5.585GluVal: 5.585 ± 0.638
1.064GluTrp: 1.064 ± 0.431
1.596GluTyr: 1.596 ± 0.45
0.0GluXaa: 0.0 ± 0.0
Phe
2.128PheAla: 2.128 ± 0.405
0.266PheCys: 0.266 ± 0.248
1.33PheAsp: 1.33 ± 0.758
0.266PheGlu: 0.266 ± 0.248
0.532PhePhe: 0.532 ± 0.302
3.457PheGly: 3.457 ± 0.604
0.532PheHis: 0.532 ± 0.198
1.064PheIle: 1.064 ± 0.459
0.798PheLys: 0.798 ± 0.55
2.66PheLeu: 2.66 ± 1.24
0.798PheMet: 0.798 ± 0.533
0.798PheAsn: 0.798 ± 0.409
1.862PhePro: 1.862 ± 1.425
2.394PheGln: 2.394 ± 1.055
2.128PheArg: 2.128 ± 0.655
1.596PheSer: 1.596 ± 0.471
0.798PheThr: 0.798 ± 0.55
1.064PheVal: 1.064 ± 0.458
0.266PheTrp: 0.266 ± 0.343
1.33PheTyr: 1.33 ± 0.376
0.0PheXaa: 0.0 ± 0.0
Gly
3.457GlyAla: 3.457 ± 0.806
3.191GlyCys: 3.191 ± 0.728
2.394GlyAsp: 2.394 ± 0.947
5.585GlyGlu: 5.585 ± 1.416
3.191GlyPhe: 3.191 ± 0.685
7.979GlyGly: 7.979 ± 1.135
1.064GlyHis: 1.064 ± 0.571
5.319GlyIle: 5.319 ± 0.97
6.383GlyLys: 6.383 ± 2.617
6.915GlyLeu: 6.915 ± 1.689
0.532GlyMet: 0.532 ± 0.307
4.521GlyAsn: 4.521 ± 0.793
4.521GlyPro: 4.521 ± 1.68
3.723GlyGln: 3.723 ± 1.139
3.191GlyArg: 3.191 ± 0.521
3.457GlySer: 3.457 ± 0.717
3.723GlyThr: 3.723 ± 0.605
2.926GlyVal: 2.926 ± 0.41
1.596GlyTrp: 1.596 ± 0.772
1.33GlyTyr: 1.33 ± 0.437
0.0GlyXaa: 0.0 ± 0.0
His
0.532HisAla: 0.532 ± 0.285
1.33HisCys: 1.33 ± 0.729
0.532HisAsp: 0.532 ± 0.307
0.532HisGlu: 0.532 ± 0.367
1.064HisPhe: 1.064 ± 0.712
1.596HisGly: 1.596 ± 0.488
0.532HisHis: 0.532 ± 0.355
0.798HisIle: 0.798 ± 0.216
1.33HisLys: 1.33 ± 0.72
4.255HisLeu: 4.255 ± 1.459
0.0HisMet: 0.0 ± 0.0
0.0HisAsn: 0.0 ± 0.0
1.596HisPro: 1.596 ± 0.509
1.33HisGln: 1.33 ± 0.508
0.0HisArg: 0.0 ± 0.0
1.862HisSer: 1.862 ± 0.513
1.596HisThr: 1.596 ± 0.538
1.33HisVal: 1.33 ± 0.547
0.0HisTrp: 0.0 ± 0.0
0.532HisTyr: 0.532 ± 0.382
0.0HisXaa: 0.0 ± 0.0
Ile
2.394IleAla: 2.394 ± 0.647
0.532IleCys: 0.532 ± 0.332
1.064IleAsp: 1.064 ± 0.316
3.723IleGlu: 3.723 ± 1.075
1.33IlePhe: 1.33 ± 0.356
2.66IleGly: 2.66 ± 0.804
1.862IleHis: 1.862 ± 0.426
4.255IleIle: 4.255 ± 1.314
5.053IleLys: 5.053 ± 1.172
3.723IleLeu: 3.723 ± 0.763
0.798IleMet: 0.798 ± 0.299
3.723IleAsn: 3.723 ± 0.768
3.989IlePro: 3.989 ± 0.382
4.787IleGln: 4.787 ± 0.754
4.255IleArg: 4.255 ± 0.729
0.532IleSer: 0.532 ± 0.43
2.394IleThr: 2.394 ± 0.979
4.521IleVal: 4.521 ± 1.375
0.532IleTrp: 0.532 ± 0.367
2.926IleTyr: 2.926 ± 0.398
0.0IleXaa: 0.0 ± 0.0
Lys
4.787LysAla: 4.787 ± 1.168
2.926LysCys: 2.926 ± 1.042
3.191LysAsp: 3.191 ± 1.031
6.915LysGlu: 6.915 ± 1.309
2.66LysPhe: 2.66 ± 0.463
4.521LysGly: 4.521 ± 1.384
2.128LysHis: 2.128 ± 0.361
5.319LysIle: 5.319 ± 2.347
5.319LysLys: 5.319 ± 1.527
4.787LysLeu: 4.787 ± 0.918
2.128LysMet: 2.128 ± 0.54
3.457LysAsn: 3.457 ± 0.596
1.064LysPro: 1.064 ± 0.246
2.66LysGln: 2.66 ± 0.561
2.926LysArg: 2.926 ± 0.843
1.862LysSer: 1.862 ± 0.527
1.596LysThr: 1.596 ± 0.444
4.255LysVal: 4.255 ± 1.3
0.798LysTrp: 0.798 ± 0.449
1.862LysTyr: 1.862 ± 0.756
0.0LysXaa: 0.0 ± 0.0
Leu
7.181LeuAla: 7.181 ± 1.879
0.798LeuCys: 0.798 ± 0.288
3.457LeuAsp: 3.457 ± 0.819
7.979LeuGlu: 7.979 ± 0.964
1.862LeuPhe: 1.862 ± 0.484
5.851LeuGly: 5.851 ± 1.222
1.33LeuHis: 1.33 ± 0.511
4.787LeuIle: 4.787 ± 0.663
6.117LeuLys: 6.117 ± 1.286
7.979LeuLeu: 7.979 ± 1.336
1.33LeuMet: 1.33 ± 0.35
3.723LeuAsn: 3.723 ± 0.633
4.255LeuPro: 4.255 ± 0.672
5.053LeuGln: 5.053 ± 1.077
4.255LeuArg: 4.255 ± 0.762
5.585LeuSer: 5.585 ± 1.021
5.585LeuThr: 5.585 ± 1.779
5.585LeuVal: 5.585 ± 1.293
2.394LeuTrp: 2.394 ± 0.704
1.862LeuTyr: 1.862 ± 0.678
0.0LeuXaa: 0.0 ± 0.0
Met
2.128MetAla: 2.128 ± 0.896
0.0MetCys: 0.0 ± 0.0
0.798MetAsp: 0.798 ± 0.396
1.064MetGlu: 1.064 ± 0.459
0.532MetPhe: 0.532 ± 0.497
2.66MetGly: 2.66 ± 0.693
0.532MetHis: 0.532 ± 0.497
0.532MetIle: 0.532 ± 0.367
0.266MetLys: 0.266 ± 0.281
1.596MetLeu: 1.596 ± 0.494
0.266MetMet: 0.266 ± 0.248
1.33MetAsn: 1.33 ± 0.406
0.798MetPro: 0.798 ± 0.536
0.532MetGln: 0.532 ± 0.285
0.798MetArg: 0.798 ± 0.409
2.128MetSer: 2.128 ± 0.987
2.394MetThr: 2.394 ± 0.336
0.532MetVal: 0.532 ± 0.198
0.266MetTrp: 0.266 ± 0.248
1.064MetTyr: 1.064 ± 0.571
0.0MetXaa: 0.0 ± 0.0
Asn
1.862AsnAla: 1.862 ± 0.601
2.66AsnCys: 2.66 ± 0.688
1.064AsnAsp: 1.064 ± 0.646
2.394AsnGlu: 2.394 ± 0.759
1.862AsnPhe: 1.862 ± 0.606
1.862AsnGly: 1.862 ± 0.783
1.064AsnHis: 1.064 ± 0.424
2.394AsnIle: 2.394 ± 0.494
3.191AsnLys: 3.191 ± 0.794
2.66AsnLeu: 2.66 ± 0.368
1.064AsnMet: 1.064 ± 0.458
1.064AsnAsn: 1.064 ± 0.477
4.255AsnPro: 4.255 ± 1.33
2.66AsnGln: 2.66 ± 0.495
2.128AsnArg: 2.128 ± 0.845
3.191AsnSer: 3.191 ± 0.479
3.191AsnThr: 3.191 ± 0.521
1.33AsnVal: 1.33 ± 0.406
2.128AsnTrp: 2.128 ± 0.705
2.926AsnTyr: 2.926 ± 0.738
0.0AsnXaa: 0.0 ± 0.0
Pro
4.255ProAla: 4.255 ± 0.59
1.064ProCys: 1.064 ± 0.788
2.394ProAsp: 2.394 ± 0.684
2.926ProGlu: 2.926 ± 0.747
1.596ProPhe: 1.596 ± 0.777
6.383ProGly: 6.383 ± 1.275
0.798ProHis: 0.798 ± 0.449
3.723ProIle: 3.723 ± 0.916
2.66ProLys: 2.66 ± 0.631
4.787ProLeu: 4.787 ± 1.115
1.33ProMet: 1.33 ± 0.404
1.33ProAsn: 1.33 ± 0.71
5.851ProPro: 5.851 ± 2.547
2.926ProGln: 2.926 ± 1.14
5.053ProArg: 5.053 ± 0.919
1.33ProSer: 1.33 ± 0.527
5.053ProThr: 5.053 ± 0.95
3.457ProVal: 3.457 ± 0.739
1.33ProTrp: 1.33 ± 0.756
1.596ProTyr: 1.596 ± 0.631
0.0ProXaa: 0.0 ± 0.0
Gln
3.723GlnAla: 3.723 ± 0.689
0.798GlnCys: 0.798 ± 0.216
0.798GlnAsp: 0.798 ± 0.396
5.053GlnGlu: 5.053 ± 1.463
1.064GlnPhe: 1.064 ± 0.539
5.053GlnGly: 5.053 ± 1.398
1.064GlnHis: 1.064 ± 0.45
5.053GlnIle: 5.053 ± 0.861
5.053GlnLys: 5.053 ± 1.235
4.255GlnLeu: 4.255 ± 0.665
1.596GlnMet: 1.596 ± 0.692
2.66GlnAsn: 2.66 ± 0.676
1.862GlnPro: 1.862 ± 0.377
5.053GlnGln: 5.053 ± 1.188
4.521GlnArg: 4.521 ± 1.422
2.926GlnSer: 2.926 ± 0.51
3.723GlnThr: 3.723 ± 0.609
2.128GlnVal: 2.128 ± 0.461
2.394GlnTrp: 2.394 ± 0.965
1.862GlnTyr: 1.862 ± 0.588
0.0GlnXaa: 0.0 ± 0.0
Arg
3.191ArgAla: 3.191 ± 0.652
1.33ArgCys: 1.33 ± 0.874
1.862ArgAsp: 1.862 ± 1.053
6.117ArgGlu: 6.117 ± 1.164
1.596ArgPhe: 1.596 ± 0.494
4.521ArgGly: 4.521 ± 1.033
1.33ArgHis: 1.33 ± 0.712
2.926ArgIle: 2.926 ± 1.167
1.064ArgLys: 1.064 ± 0.536
6.117ArgLeu: 6.117 ± 1.139
1.33ArgMet: 1.33 ± 0.338
3.723ArgAsn: 3.723 ± 0.98
3.457ArgPro: 3.457 ± 0.894
6.383ArgGln: 6.383 ± 0.711
8.777ArgArg: 8.777 ± 3.518
1.064ArgSer: 1.064 ± 0.624
2.394ArgThr: 2.394 ± 0.795
1.862ArgVal: 1.862 ± 0.626
1.862ArgTrp: 1.862 ± 0.617
2.394ArgTyr: 2.394 ± 0.754
0.0ArgXaa: 0.0 ± 0.0
Ser
3.191SerAla: 3.191 ± 1.579
2.128SerCys: 2.128 ± 0.829
2.128SerAsp: 2.128 ± 0.686
2.66SerGlu: 2.66 ± 0.617
0.532SerPhe: 0.532 ± 0.537
3.989SerGly: 3.989 ± 0.585
0.798SerHis: 0.798 ± 0.452
1.596SerIle: 1.596 ± 0.582
1.596SerLys: 1.596 ± 0.585
5.053SerLeu: 5.053 ± 1.089
0.266SerMet: 0.266 ± 0.267
1.33SerAsn: 1.33 ± 0.71
2.128SerPro: 2.128 ± 0.296
3.723SerGln: 3.723 ± 1.104
4.521SerArg: 4.521 ± 1.352
3.191SerSer: 3.191 ± 1.988
2.66SerThr: 2.66 ± 0.403
2.394SerVal: 2.394 ± 0.525
1.33SerTrp: 1.33 ± 1.007
1.064SerTyr: 1.064 ± 0.474
0.0SerXaa: 0.0 ± 0.0
Thr
5.053ThrAla: 5.053 ± 0.629
0.798ThrCys: 0.798 ± 0.34
2.66ThrAsp: 2.66 ± 0.505
3.989ThrGlu: 3.989 ± 0.752
1.064ThrPhe: 1.064 ± 0.443
3.723ThrGly: 3.723 ± 0.93
1.596ThrHis: 1.596 ± 0.661
1.596ThrIle: 1.596 ± 0.435
2.394ThrLys: 2.394 ± 0.561
5.851ThrLeu: 5.851 ± 1.192
1.064ThrMet: 1.064 ± 0.646
2.926ThrAsn: 2.926 ± 0.471
5.585ThrPro: 5.585 ± 1.015
2.926ThrGln: 2.926 ± 0.786
0.798ThrArg: 0.798 ± 0.743
4.521ThrSer: 4.521 ± 1.642
3.723ThrThr: 3.723 ± 2.083
4.521ThrVal: 4.521 ± 1.027
3.191ThrTrp: 3.191 ± 1.431
1.596ThrTyr: 1.596 ± 1.037
0.0ThrXaa: 0.0 ± 0.0
Val
2.66ValAla: 2.66 ± 1.252
1.064ValCys: 1.064 ± 0.367
3.191ValAsp: 3.191 ± 1.03
3.191ValGlu: 3.191 ± 0.477
1.596ValPhe: 1.596 ± 0.817
3.989ValGly: 3.989 ± 0.9
0.532ValHis: 0.532 ± 0.367
2.66ValIle: 2.66 ± 0.794
4.521ValLys: 4.521 ± 0.954
6.915ValLeu: 6.915 ± 1.123
0.798ValMet: 0.798 ± 0.388
2.394ValAsn: 2.394 ± 0.587
4.255ValPro: 4.255 ± 0.858
4.255ValGln: 4.255 ± 0.503
2.394ValArg: 2.394 ± 0.932
2.128ValSer: 2.128 ± 0.717
4.521ValThr: 4.521 ± 1.292
4.255ValVal: 4.255 ± 0.652
1.33ValTrp: 1.33 ± 0.465
0.532ValTyr: 0.532 ± 0.198
0.0ValXaa: 0.0 ± 0.0
Trp
1.33TrpAla: 1.33 ± 0.547
0.798TrpCys: 0.798 ± 0.216
2.128TrpAsp: 2.128 ± 0.501
1.33TrpGlu: 1.33 ± 0.538
1.064TrpPhe: 1.064 ± 0.99
1.862TrpGly: 1.862 ± 0.836
1.064TrpHis: 1.064 ± 0.624
1.596TrpIle: 1.596 ± 0.312
2.926TrpLys: 2.926 ± 0.562
1.33TrpLeu: 1.33 ± 0.735
1.33TrpMet: 1.33 ± 0.656
1.33TrpAsn: 1.33 ± 0.406
1.33TrpPro: 1.33 ± 0.613
1.596TrpGln: 1.596 ± 0.469
2.926TrpArg: 2.926 ± 1.063
0.266TrpSer: 0.266 ± 0.183
1.862TrpThr: 1.862 ± 0.683
1.33TrpVal: 1.33 ± 0.295
1.064TrpTrp: 1.064 ± 0.367
0.532TrpTyr: 0.532 ± 0.404
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.798TyrAla: 0.798 ± 0.707
1.33TyrCys: 1.33 ± 0.861
0.798TyrAsp: 0.798 ± 0.397
1.33TyrGlu: 1.33 ± 0.354
0.532TyrPhe: 0.532 ± 0.355
1.33TyrGly: 1.33 ± 0.674
0.532TyrHis: 0.532 ± 0.332
1.596TyrIle: 1.596 ± 0.566
2.394TyrLys: 2.394 ± 0.637
2.394TyrLeu: 2.394 ± 1.091
0.798TyrMet: 0.798 ± 0.216
2.926TyrAsn: 2.926 ± 0.324
1.33TyrPro: 1.33 ± 0.452
0.798TyrGln: 0.798 ± 0.388
2.66TyrArg: 2.66 ± 0.54
1.862TyrSer: 1.862 ± 1.105
2.128TyrThr: 2.128 ± 0.366
2.926TyrVal: 2.926 ± 0.602
1.33TyrTrp: 1.33 ± 0.494
1.064TyrTyr: 1.064 ± 0.316
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 9 proteins (3761 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski