Amino acid dipepetide frequency for Simian immunodeficiency virus (isolate MB66) (SIV-cpz) (Chimpanzee immunodeficiency virus)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.865AlaAla: 3.865 ± 0.878
1.933AlaCys: 1.933 ± 0.473
1.38AlaAsp: 1.38 ± 0.306
5.522AlaGlu: 5.522 ± 1.38
2.209AlaPhe: 2.209 ± 0.605
3.865AlaGly: 3.865 ± 0.851
0.828AlaHis: 0.828 ± 0.534
3.037AlaIle: 3.037 ± 0.862
2.485AlaLys: 2.485 ± 0.447
6.626AlaLeu: 6.626 ± 1.036
2.485AlaMet: 2.485 ± 0.549
3.313AlaAsn: 3.313 ± 0.922
1.38AlaPro: 1.38 ± 0.526
3.589AlaGln: 3.589 ± 1.01
4.97AlaArg: 4.97 ± 1.296
3.037AlaSer: 3.037 ± 0.675
3.037AlaThr: 3.037 ± 1.125
3.037AlaVal: 3.037 ± 0.746
1.933AlaTrp: 1.933 ± 0.391
0.828AlaTyr: 0.828 ± 0.534
0.0AlaXaa: 0.0 ± 0.0
Cys
0.552CysAla: 0.552 ± 0.49
0.276CysCys: 0.276 ± 0.501
0.276CysAsp: 0.276 ± 0.178
0.552CysGlu: 0.552 ± 0.399
1.38CysPhe: 1.38 ± 0.693
1.657CysGly: 1.657 ± 0.499
0.276CysHis: 0.276 ± 0.389
0.552CysIle: 0.552 ± 0.49
2.761CysLys: 2.761 ± 0.797
0.552CysLeu: 0.552 ± 0.59
0.276CysMet: 0.276 ± 0.343
1.657CysAsn: 1.657 ± 0.714
1.104CysPro: 1.104 ± 0.518
1.38CysGln: 1.38 ± 0.512
0.552CysArg: 0.552 ± 0.49
1.38CysSer: 1.38 ± 0.892
2.761CysThr: 2.761 ± 0.884
1.657CysVal: 1.657 ± 0.521
0.828CysTrp: 0.828 ± 0.377
0.552CysTyr: 0.552 ± 1.003
0.0CysXaa: 0.0 ± 0.0
Asp
1.104AspAla: 1.104 ± 0.256
1.933AspCys: 1.933 ± 0.693
1.38AspAsp: 1.38 ± 0.481
1.657AspGlu: 1.657 ± 0.643
0.828AspPhe: 0.828 ± 0.534
1.657AspGly: 1.657 ± 0.684
0.276AspHis: 0.276 ± 0.363
4.141AspIle: 4.141 ± 0.697
2.761AspLys: 2.761 ± 0.962
3.865AspLeu: 3.865 ± 1.228
0.552AspMet: 0.552 ± 0.49
2.209AspAsn: 2.209 ± 1.017
2.209AspPro: 2.209 ± 1.323
2.485AspGln: 2.485 ± 0.583
4.97AspArg: 4.97 ± 1.288
2.209AspSer: 2.209 ± 0.652
3.589AspThr: 3.589 ± 0.654
0.828AspVal: 0.828 ± 0.298
0.552AspTrp: 0.552 ± 0.546
1.104AspTyr: 1.104 ± 0.518
0.0AspXaa: 0.0 ± 0.0
Glu
5.522GluAla: 5.522 ± 1.1
0.0GluCys: 0.0 ± 0.0
2.209GluAsp: 2.209 ± 1.44
7.178GluGlu: 7.178 ± 1.294
0.828GluPhe: 0.828 ± 0.298
7.454GluGly: 7.454 ± 1.431
0.552GluHis: 0.552 ± 0.356
3.865GluIle: 3.865 ± 1.079
4.141GluLys: 4.141 ± 0.869
7.178GluLeu: 7.178 ± 1.323
0.828GluMet: 0.828 ± 0.294
2.485GluAsn: 2.485 ± 0.421
5.246GluPro: 5.246 ± 1.028
2.761GluGln: 2.761 ± 0.842
3.865GluArg: 3.865 ± 1.556
3.313GluSer: 3.313 ± 1.287
6.074GluThr: 6.074 ± 0.973
6.902GluVal: 6.902 ± 1.717
1.38GluTrp: 1.38 ± 0.646
1.104GluTyr: 1.104 ± 0.439
0.0GluXaa: 0.0 ± 0.0
Phe
2.209PheAla: 2.209 ± 0.618
0.552PheCys: 0.552 ± 0.49
1.104PheAsp: 1.104 ± 0.794
0.0PheGlu: 0.0 ± 0.0
0.828PhePhe: 0.828 ± 0.344
1.657PheGly: 1.657 ± 0.583
0.0PheHis: 0.0 ± 0.0
1.38PheIle: 1.38 ± 0.508
1.933PheLys: 1.933 ± 0.648
3.037PheLeu: 3.037 ± 1.173
0.552PheMet: 0.552 ± 0.284
2.485PheAsn: 2.485 ± 0.687
2.209PhePro: 2.209 ± 0.919
1.657PheGln: 1.657 ± 0.531
2.485PheArg: 2.485 ± 1.19
0.828PheSer: 0.828 ± 0.377
1.933PheThr: 1.933 ± 0.652
0.0PheVal: 0.0 ± 0.0
0.276PheTrp: 0.276 ± 0.178
1.933PheTyr: 1.933 ± 0.485
0.0PheXaa: 0.0 ± 0.0
Gly
4.97GlyAla: 4.97 ± 0.747
1.933GlyCys: 1.933 ± 0.516
3.865GlyAsp: 3.865 ± 0.884
2.209GlyGlu: 2.209 ± 0.695
2.209GlyPhe: 2.209 ± 0.735
7.731GlyGly: 7.731 ± 1.076
2.761GlyHis: 2.761 ± 1.398
8.559GlyIle: 8.559 ± 2.003
5.522GlyLys: 5.522 ± 2.069
4.417GlyLeu: 4.417 ± 1.163
0.276GlyMet: 0.276 ± 0.178
3.037GlyAsn: 3.037 ± 1.267
5.522GlyPro: 5.522 ± 1.27
5.522GlyGln: 5.522 ± 0.904
2.485GlyArg: 2.485 ± 0.573
3.865GlySer: 3.865 ± 1.75
4.417GlyThr: 4.417 ± 2.465
2.209GlyVal: 2.209 ± 0.587
1.38GlyTrp: 1.38 ± 0.734
1.657GlyTyr: 1.657 ± 0.596
0.0GlyXaa: 0.0 ± 0.0
His
0.828HisAla: 0.828 ± 0.228
0.552HisCys: 0.552 ± 0.547
0.0HisAsp: 0.0 ± 0.0
0.552HisGlu: 0.552 ± 0.207
0.552HisPhe: 0.552 ± 0.546
1.657HisGly: 1.657 ± 0.695
0.828HisHis: 0.828 ± 0.968
1.657HisIle: 1.657 ± 1.095
1.104HisLys: 1.104 ± 0.451
2.761HisLeu: 2.761 ± 0.571
0.552HisMet: 0.552 ± 0.673
1.104HisAsn: 1.104 ± 0.332
1.933HisPro: 1.933 ± 0.807
3.037HisGln: 3.037 ± 1.159
1.104HisArg: 1.104 ± 0.478
1.104HisSer: 1.104 ± 0.666
0.828HisThr: 0.828 ± 0.446
0.276HisVal: 0.276 ± 0.178
0.0HisTrp: 0.0 ± 0.0
0.828HisTyr: 0.828 ± 0.645
0.0HisXaa: 0.0 ± 0.0
Ile
2.209IleAla: 2.209 ± 0.531
1.38IleCys: 1.38 ± 0.448
2.209IleAsp: 2.209 ± 0.695
4.417IleGlu: 4.417 ± 1.076
1.933IlePhe: 1.933 ± 0.811
4.694IleGly: 4.694 ± 1.115
1.933IleHis: 1.933 ± 0.589
6.35IleIle: 6.35 ± 1.303
3.865IleLys: 3.865 ± 0.778
6.902IleLeu: 6.902 ± 1.284
0.276IleMet: 0.276 ± 0.363
1.657IleAsn: 1.657 ± 0.874
5.246IlePro: 5.246 ± 0.911
3.037IleGln: 3.037 ± 1.708
4.694IleArg: 4.694 ± 1.175
3.313IleSer: 3.313 ± 0.603
2.209IleThr: 2.209 ± 1.338
7.178IleVal: 7.178 ± 1.45
1.38IleTrp: 1.38 ± 0.509
2.209IleTyr: 2.209 ± 0.592
0.0IleXaa: 0.0 ± 0.0
Lys
5.246LysAla: 5.246 ± 0.988
2.209LysCys: 2.209 ± 0.527
1.657LysAsp: 1.657 ± 0.377
7.178LysGlu: 7.178 ± 1.496
0.276LysPhe: 0.276 ± 0.178
5.798LysGly: 5.798 ± 1.026
1.38LysHis: 1.38 ± 0.542
4.417LysIle: 4.417 ± 1.238
4.694LysLys: 4.694 ± 1.373
5.798LysLeu: 5.798 ± 1.114
0.828LysMet: 0.828 ± 0.377
1.933LysAsn: 1.933 ± 0.624
1.38LysPro: 1.38 ± 0.4
4.97LysGln: 4.97 ± 1.66
4.141LysArg: 4.141 ± 0.838
2.761LysSer: 2.761 ± 0.786
3.313LysThr: 3.313 ± 0.645
4.417LysVal: 4.417 ± 0.96
1.38LysTrp: 1.38 ± 0.474
1.657LysTyr: 1.657 ± 0.531
0.0LysXaa: 0.0 ± 0.0
Leu
4.417LeuAla: 4.417 ± 1.174
1.657LeuCys: 1.657 ± 0.874
4.417LeuAsp: 4.417 ± 0.747
7.731LeuGlu: 7.731 ± 1.128
3.313LeuPhe: 3.313 ± 1.387
6.074LeuGly: 6.074 ± 1.25
2.209LeuHis: 2.209 ± 0.732
4.417LeuIle: 4.417 ± 1.848
6.626LeuLys: 6.626 ± 1.386
6.902LeuLeu: 6.902 ± 2.708
1.104LeuMet: 1.104 ± 0.457
4.97LeuAsn: 4.97 ± 1.174
2.485LeuPro: 2.485 ± 1.105
4.141LeuGln: 4.141 ± 0.696
5.798LeuArg: 5.798 ± 1.239
3.865LeuSer: 3.865 ± 1.524
4.141LeuThr: 4.141 ± 0.448
5.522LeuVal: 5.522 ± 1.472
3.589LeuTrp: 3.589 ± 0.758
2.209LeuTyr: 2.209 ± 0.946
0.0LeuXaa: 0.0 ± 0.0
Met
1.657MetAla: 1.657 ± 0.697
0.0MetCys: 0.0 ± 0.0
1.104MetAsp: 1.104 ± 0.44
2.485MetGlu: 2.485 ± 0.672
0.828MetPhe: 0.828 ± 0.228
1.38MetGly: 1.38 ± 0.347
0.552MetHis: 0.552 ± 0.503
0.552MetIle: 0.552 ± 0.352
0.828MetLys: 0.828 ± 0.228
1.104MetLeu: 1.104 ± 0.567
0.552MetMet: 0.552 ± 0.284
0.828MetAsn: 0.828 ± 0.321
0.0MetPro: 0.0 ± 0.0
0.828MetGln: 0.828 ± 0.228
1.933MetArg: 1.933 ± 0.495
0.552MetSer: 0.552 ± 0.284
2.209MetThr: 2.209 ± 0.466
0.828MetVal: 0.828 ± 0.228
0.276MetTrp: 0.276 ± 0.245
1.104MetTyr: 1.104 ± 0.346
0.0MetXaa: 0.0 ± 0.0
Asn
2.485AsnAla: 2.485 ± 0.685
4.141AsnCys: 4.141 ± 1.24
2.485AsnAsp: 2.485 ± 0.687
4.417AsnGlu: 4.417 ± 1.166
3.037AsnPhe: 3.037 ± 1.093
1.38AsnGly: 1.38 ± 0.643
0.828AsnHis: 0.828 ± 1.096
2.761AsnIle: 2.761 ± 0.784
2.761AsnLys: 2.761 ± 0.61
2.209AsnLeu: 2.209 ± 0.974
1.657AsnMet: 1.657 ± 0.859
3.313AsnAsn: 3.313 ± 1.249
3.865AsnPro: 3.865 ± 0.576
1.104AsnGln: 1.104 ± 0.607
2.209AsnArg: 2.209 ± 0.84
1.38AsnSer: 1.38 ± 1.35
4.694AsnThr: 4.694 ± 1.068
1.933AsnVal: 1.933 ± 1.067
1.933AsnTrp: 1.933 ± 0.305
0.552AsnTyr: 0.552 ± 0.4
0.0AsnXaa: 0.0 ± 0.0
Pro
3.313ProAla: 3.313 ± 0.956
0.552ProCys: 0.552 ± 0.49
2.209ProAsp: 2.209 ± 0.502
4.97ProGlu: 4.97 ± 1.064
0.552ProPhe: 0.552 ± 0.356
4.417ProGly: 4.417 ± 1.568
0.276ProHis: 0.276 ± 0.178
6.074ProIle: 6.074 ± 1.013
2.209ProLys: 2.209 ± 1.091
4.141ProLeu: 4.141 ± 1.281
0.828ProMet: 0.828 ± 0.426
0.828ProAsn: 0.828 ± 0.916
3.589ProPro: 3.589 ± 1.043
2.485ProGln: 2.485 ± 0.829
3.589ProArg: 3.589 ± 1.074
3.313ProSer: 3.313 ± 0.676
1.38ProThr: 1.38 ± 0.645
6.074ProVal: 6.074 ± 1.193
0.552ProTrp: 0.552 ± 0.547
1.104ProTyr: 1.104 ± 0.631
0.0ProXaa: 0.0 ± 0.0
Gln
4.97GlnAla: 4.97 ± 0.851
0.552GlnCys: 0.552 ± 0.547
2.761GlnAsp: 2.761 ± 0.813
5.246GlnGlu: 5.246 ± 1.544
1.104GlnPhe: 1.104 ± 0.481
5.522GlnGly: 5.522 ± 1.296
1.38GlnHis: 1.38 ± 0.491
3.865GlnIle: 3.865 ± 0.942
4.694GlnLys: 4.694 ± 1.309
7.178GlnLeu: 7.178 ± 1.239
3.037GlnMet: 3.037 ± 1.294
2.209GlnAsn: 2.209 ± 0.686
1.933GlnPro: 1.933 ± 1.453
4.694GlnGln: 4.694 ± 1.035
2.485GlnArg: 2.485 ± 1.331
1.38GlnSer: 1.38 ± 0.54
1.933GlnThr: 1.933 ± 0.914
2.209GlnVal: 2.209 ± 0.826
0.828GlnTrp: 0.828 ± 0.298
1.933GlnTyr: 1.933 ± 0.715
0.0GlnXaa: 0.0 ± 0.0
Arg
5.246ArgAla: 5.246 ± 0.97
0.276ArgCys: 0.276 ± 0.363
4.417ArgAsp: 4.417 ± 0.934
5.798ArgGlu: 5.798 ± 1.206
1.657ArgPhe: 1.657 ± 0.499
2.485ArgGly: 2.485 ± 0.915
1.104ArgHis: 1.104 ± 0.746
4.417ArgIle: 4.417 ± 1.735
3.865ArgLys: 3.865 ± 1.396
3.313ArgLeu: 3.313 ± 1.191
0.828ArgMet: 0.828 ± 0.321
4.141ArgAsn: 4.141 ± 0.827
3.037ArgPro: 3.037 ± 1.029
5.246ArgGln: 5.246 ± 1.558
5.522ArgArg: 5.522 ± 3.371
1.933ArgSer: 1.933 ± 0.521
3.037ArgThr: 3.037 ± 0.645
2.761ArgVal: 2.761 ± 0.752
2.485ArgTrp: 2.485 ± 0.994
0.828ArgTyr: 0.828 ± 0.344
0.0ArgXaa: 0.0 ± 0.0
Ser
1.933SerAla: 1.933 ± 0.541
0.276SerCys: 0.276 ± 0.178
2.209SerAsp: 2.209 ± 0.819
3.865SerGlu: 3.865 ± 1.055
1.38SerPhe: 1.38 ± 0.526
1.933SerGly: 1.933 ± 0.794
0.828SerHis: 0.828 ± 0.426
2.761SerIle: 2.761 ± 0.61
1.933SerLys: 1.933 ± 0.828
5.798SerLeu: 5.798 ± 1.342
0.828SerMet: 0.828 ± 0.461
3.313SerAsn: 3.313 ± 0.571
3.037SerPro: 3.037 ± 1.046
3.589SerGln: 3.589 ± 1.859
3.589SerArg: 3.589 ± 1.322
2.485SerSer: 2.485 ± 0.834
4.694SerThr: 4.694 ± 1.268
2.209SerVal: 2.209 ± 0.418
0.828SerTrp: 0.828 ± 0.298
0.828SerTyr: 0.828 ± 0.656
0.0SerXaa: 0.0 ± 0.0
Thr
3.037ThrAla: 3.037 ± 0.482
0.552ThrCys: 0.552 ± 0.515
3.037ThrAsp: 3.037 ± 0.623
4.141ThrGlu: 4.141 ± 0.994
1.933ThrPhe: 1.933 ± 0.995
5.522ThrGly: 5.522 ± 2.822
0.552ThrHis: 0.552 ± 0.207
3.589ThrIle: 3.589 ± 1.006
2.761ThrLys: 2.761 ± 0.754
6.35ThrLeu: 6.35 ± 1.266
0.828ThrMet: 0.828 ± 0.228
3.589ThrAsn: 3.589 ± 1.021
2.761ThrPro: 2.761 ± 0.749
3.037ThrGln: 3.037 ± 0.763
1.104ThrArg: 1.104 ± 0.449
4.97ThrSer: 4.97 ± 1.078
5.246ThrThr: 5.246 ± 1.022
6.074ThrVal: 6.074 ± 1.777
1.657ThrTrp: 1.657 ± 0.59
1.104ThrTyr: 1.104 ± 0.581
0.0ThrXaa: 0.0 ± 0.0
Val
3.313ValAla: 3.313 ± 1.061
0.0ValCys: 0.0 ± 0.0
1.657ValAsp: 1.657 ± 0.662
2.761ValGlu: 2.761 ± 0.471
0.552ValPhe: 0.552 ± 0.207
6.626ValGly: 6.626 ± 0.988
3.313ValHis: 3.313 ± 0.588
3.589ValIle: 3.589 ± 1.214
6.074ValLys: 6.074 ± 1.503
4.97ValLeu: 4.97 ± 0.906
0.276ValMet: 0.276 ± 0.363
2.209ValAsn: 2.209 ± 0.891
3.037ValPro: 3.037 ± 0.692
3.313ValGln: 3.313 ± 0.671
3.589ValArg: 3.589 ± 1.401
5.246ValSer: 5.246 ± 1.163
3.037ValThr: 3.037 ± 1.083
3.589ValVal: 3.589 ± 0.992
2.209ValTrp: 2.209 ± 0.83
1.933ValTyr: 1.933 ± 0.791
0.0ValXaa: 0.0 ± 0.0
Trp
1.933TrpAla: 1.933 ± 0.389
0.828TrpCys: 0.828 ± 0.426
1.104TrpAsp: 1.104 ± 0.379
1.657TrpGlu: 1.657 ± 0.531
0.552TrpPhe: 0.552 ± 0.399
1.657TrpGly: 1.657 ± 0.574
0.276TrpHis: 0.276 ± 0.363
0.828TrpIle: 0.828 ± 0.377
1.657TrpLys: 1.657 ± 0.842
0.828TrpLeu: 0.828 ± 0.56
1.657TrpMet: 1.657 ± 0.499
1.657TrpAsn: 1.657 ± 1.074
1.104TrpPro: 1.104 ± 0.468
1.933TrpGln: 1.933 ± 0.693
1.657TrpArg: 1.657 ± 0.588
0.276TrpSer: 0.276 ± 0.348
2.209TrpThr: 2.209 ± 0.613
1.657TrpVal: 1.657 ± 0.499
0.552TrpTrp: 0.552 ± 0.356
0.828TrpTyr: 0.828 ± 0.417
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.552TyrAla: 0.552 ± 0.356
1.38TyrCys: 1.38 ± 0.557
0.828TyrAsp: 0.828 ± 0.298
0.552TyrGlu: 0.552 ± 0.352
1.104TyrPhe: 1.104 ± 0.332
1.38TyrGly: 1.38 ± 0.652
1.104TyrHis: 1.104 ± 0.53
0.276TyrIle: 0.276 ± 0.245
2.761TyrLys: 2.761 ± 0.614
1.657TyrLeu: 1.657 ± 0.323
1.104TyrMet: 1.104 ± 0.518
2.209TyrAsn: 2.209 ± 0.805
1.38TyrPro: 1.38 ± 0.776
1.38TyrGln: 1.38 ± 0.72
1.657TyrArg: 1.657 ± 0.913
1.104TyrSer: 1.104 ± 0.478
1.104TyrThr: 1.104 ± 0.499
1.933TyrVal: 1.933 ± 0.807
0.828TyrTrp: 0.828 ± 0.321
1.38TyrTyr: 1.38 ± 0.419
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 9 proteins (3623 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski