Amino acid dipepetide frequency for Simian immunodeficiency virus (isolate CPZ GAB1) (SIV-cpz) (Chimpanzee immunodeficiency virus)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.379AlaAla: 3.379 ± 0.96
2.253AlaCys: 2.253 ± 0.775
1.971AlaAsp: 1.971 ± 0.597
6.195AlaGlu: 6.195 ± 1.867
1.971AlaPhe: 1.971 ± 0.397
5.351AlaGly: 5.351 ± 1.386
1.408AlaHis: 1.408 ± 0.587
3.379AlaIle: 3.379 ± 1.244
1.971AlaLys: 1.971 ± 0.639
5.632AlaLeu: 5.632 ± 1.29
1.69AlaMet: 1.69 ± 0.816
2.534AlaAsn: 2.534 ± 0.496
3.379AlaPro: 3.379 ± 1.114
1.408AlaGln: 1.408 ± 0.409
4.787AlaArg: 4.787 ± 1.029
5.914AlaSer: 5.914 ± 1.526
3.098AlaThr: 3.098 ± 0.82
4.787AlaVal: 4.787 ± 1.035
1.69AlaTrp: 1.69 ± 0.805
0.845AlaTyr: 0.845 ± 0.365
0.0AlaXaa: 0.0 ± 0.0
Cys
0.845CysAla: 0.845 ± 0.451
0.282CysCys: 0.282 ± 0.606
0.563CysAsp: 0.563 ± 0.315
0.563CysGlu: 0.563 ± 0.458
1.69CysPhe: 1.69 ± 1.225
1.408CysGly: 1.408 ± 0.544
0.282CysHis: 0.282 ± 0.251
0.845CysIle: 0.845 ± 0.973
1.971CysLys: 1.971 ± 0.713
1.126CysLeu: 1.126 ± 0.79
0.0CysMet: 0.0 ± 0.0
1.69CysAsn: 1.69 ± 1.29
1.126CysPro: 1.126 ± 0.322
1.971CysGln: 1.971 ± 0.819
0.282CysArg: 0.282 ± 0.251
1.126CysSer: 1.126 ± 0.688
1.971CysThr: 1.971 ± 0.574
1.408CysVal: 1.408 ± 0.756
0.845CysTrp: 0.845 ± 0.408
0.845CysTyr: 0.845 ± 1.192
0.0CysXaa: 0.0 ± 0.0
Asp
1.408AspAla: 1.408 ± 0.756
2.253AspCys: 2.253 ± 0.837
1.69AspAsp: 1.69 ± 0.583
0.563AspGlu: 0.563 ± 0.457
1.408AspPhe: 1.408 ± 0.722
0.845AspGly: 0.845 ± 0.597
0.563AspHis: 0.563 ± 0.458
4.224AspIle: 4.224 ± 1.203
3.379AspLys: 3.379 ± 1.07
2.816AspLeu: 2.816 ± 1.512
0.282AspMet: 0.282 ± 0.401
1.408AspAsn: 1.408 ± 0.774
2.816AspPro: 2.816 ± 1.239
2.816AspGln: 2.816 ± 0.786
3.943AspArg: 3.943 ± 1.093
1.971AspSer: 1.971 ± 1.014
2.816AspThr: 2.816 ± 0.82
3.098AspVal: 3.098 ± 0.81
0.563AspTrp: 0.563 ± 0.457
1.126AspTyr: 1.126 ± 0.461
0.0AspXaa: 0.0 ± 0.0
Glu
5.632GluAla: 5.632 ± 0.916
0.0GluCys: 0.0 ± 0.0
2.534GluAsp: 2.534 ± 1.251
6.195GluGlu: 6.195 ± 1.874
0.845GluPhe: 0.845 ± 0.365
7.322GluGly: 7.322 ± 1.57
0.563GluHis: 0.563 ± 0.398
3.098GluIle: 3.098 ± 0.998
4.224GluLys: 4.224 ± 1.116
6.477GluLeu: 6.477 ± 2.018
1.408GluMet: 1.408 ± 0.363
2.253GluAsn: 2.253 ± 0.962
4.506GluPro: 4.506 ± 1.114
4.787GluGln: 4.787 ± 1.412
4.787GluArg: 4.787 ± 1.511
3.098GluSer: 3.098 ± 1.16
3.379GluThr: 3.379 ± 1.354
5.069GluVal: 5.069 ± 1.424
1.126GluTrp: 1.126 ± 0.535
0.845GluTyr: 0.845 ± 0.632
0.0GluXaa: 0.0 ± 0.0
Phe
1.69PheAla: 1.69 ± 0.608
0.282PheCys: 0.282 ± 0.251
1.69PheAsp: 1.69 ± 1.105
0.282PheGlu: 0.282 ± 0.251
1.408PhePhe: 1.408 ± 0.363
1.408PheGly: 1.408 ± 0.473
0.0PheHis: 0.0 ± 0.0
1.408PheIle: 1.408 ± 0.904
0.563PheLys: 0.563 ± 0.422
2.816PheLeu: 2.816 ± 0.646
0.0PheMet: 0.0 ± 0.0
1.971PheAsn: 1.971 ± 0.688
1.69PhePro: 1.69 ± 1.432
1.408PheGln: 1.408 ± 0.455
3.098PheArg: 3.098 ± 1.006
2.253PheSer: 2.253 ± 0.661
1.69PheThr: 1.69 ± 0.674
0.282PheVal: 0.282 ± 0.333
0.282PheTrp: 0.282 ± 0.199
1.69PheTyr: 1.69 ± 0.682
0.0PheXaa: 0.0 ± 0.0
Gly
6.195GlyAla: 6.195 ± 1.816
1.971GlyCys: 1.971 ± 0.598
2.534GlyAsp: 2.534 ± 0.579
4.506GlyGlu: 4.506 ± 0.423
1.69GlyPhe: 1.69 ± 0.782
7.04GlyGly: 7.04 ± 1.407
2.816GlyHis: 2.816 ± 1.297
5.914GlyIle: 5.914 ± 1.532
5.351GlyLys: 5.351 ± 1.365
4.787GlyLeu: 4.787 ± 1.465
0.845GlyMet: 0.845 ± 0.391
4.224GlyAsn: 4.224 ± 1.407
5.351GlyPro: 5.351 ± 1.88
3.943GlyGln: 3.943 ± 1.465
5.069GlyArg: 5.069 ± 1.287
4.787GlySer: 4.787 ± 2.025
2.816GlyThr: 2.816 ± 0.947
2.816GlyVal: 2.816 ± 0.541
1.971GlyTrp: 1.971 ± 0.992
2.253GlyTyr: 2.253 ± 0.88
0.0GlyXaa: 0.0 ± 0.0
His
0.563HisAla: 0.563 ± 0.315
0.845HisCys: 0.845 ± 0.543
0.282HisAsp: 0.282 ± 0.251
0.563HisGlu: 0.563 ± 0.242
1.126HisPhe: 1.126 ± 1.237
2.253HisGly: 2.253 ± 0.77
1.126HisHis: 1.126 ± 0.416
1.126HisIle: 1.126 ± 1.049
1.126HisLys: 1.126 ± 0.559
3.098HisLeu: 3.098 ± 0.553
0.563HisMet: 0.563 ± 0.505
1.126HisAsn: 1.126 ± 0.856
2.253HisPro: 2.253 ± 1.102
2.816HisGln: 2.816 ± 1.161
1.408HisArg: 1.408 ± 1.14
1.971HisSer: 1.971 ± 0.511
1.126HisThr: 1.126 ± 0.42
0.282HisVal: 0.282 ± 0.199
0.0HisTrp: 0.0 ± 0.0
0.563HisTyr: 0.563 ± 0.529
0.0HisXaa: 0.0 ± 0.0
Ile
2.816IleAla: 2.816 ± 0.705
0.845IleCys: 0.845 ± 0.609
1.408IleAsp: 1.408 ± 0.61
3.661IleGlu: 3.661 ± 0.999
1.69IlePhe: 1.69 ± 0.726
3.661IleGly: 3.661 ± 1.394
2.253IleHis: 2.253 ± 1.157
6.759IleIle: 6.759 ± 2.404
3.943IleLys: 3.943 ± 1.025
6.195IleLeu: 6.195 ± 1.227
0.282IleMet: 0.282 ± 0.251
1.971IleAsn: 1.971 ± 0.609
4.506IlePro: 4.506 ± 0.991
2.534IleGln: 2.534 ± 1.193
3.661IleArg: 3.661 ± 0.58
4.224IleSer: 4.224 ± 0.7
3.661IleThr: 3.661 ± 1.951
3.098IleVal: 3.098 ± 1.087
2.816IleTrp: 2.816 ± 1.278
1.971IleTyr: 1.971 ± 0.769
0.0IleXaa: 0.0 ± 0.0
Lys
7.322LysAla: 7.322 ± 0.631
1.408LysCys: 1.408 ± 0.409
2.534LysAsp: 2.534 ± 0.726
6.195LysGlu: 6.195 ± 1.82
0.563LysPhe: 0.563 ± 0.398
4.506LysGly: 4.506 ± 1.253
1.408LysHis: 1.408 ± 0.736
5.069LysIle: 5.069 ± 1.654
5.351LysLys: 5.351 ± 1.319
5.069LysLeu: 5.069 ± 1.165
0.282LysMet: 0.282 ± 0.199
3.379LysAsn: 3.379 ± 0.937
1.408LysPro: 1.408 ± 0.527
3.943LysGln: 3.943 ± 1.259
3.379LysArg: 3.379 ± 0.945
1.69LysSer: 1.69 ± 0.882
3.943LysThr: 3.943 ± 0.852
5.351LysVal: 5.351 ± 1.638
1.69LysTrp: 1.69 ± 0.576
1.126LysTyr: 1.126 ± 0.458
0.0LysXaa: 0.0 ± 0.0
Leu
5.069LeuAla: 5.069 ± 0.58
1.69LeuCys: 1.69 ± 0.539
3.379LeuAsp: 3.379 ± 0.68
7.04LeuGlu: 7.04 ± 1.612
1.971LeuPhe: 1.971 ± 0.783
7.04LeuGly: 7.04 ± 2.011
1.69LeuHis: 1.69 ± 0.573
4.506LeuIle: 4.506 ± 2.406
7.04LeuLys: 7.04 ± 1.204
7.603LeuLeu: 7.603 ± 2.528
1.126LeuMet: 1.126 ± 0.604
3.379LeuAsn: 3.379 ± 0.91
3.661LeuPro: 3.661 ± 0.652
4.224LeuGln: 4.224 ± 0.988
4.506LeuArg: 4.506 ± 1.148
3.661LeuSer: 3.661 ± 0.988
5.069LeuThr: 5.069 ± 1.723
6.759LeuVal: 6.759 ± 1.877
3.661LeuTrp: 3.661 ± 0.969
1.126LeuTyr: 1.126 ± 0.387
0.0LeuXaa: 0.0 ± 0.0
Met
1.408MetAla: 1.408 ± 0.526
0.0MetCys: 0.0 ± 0.0
1.126MetAsp: 1.126 ± 0.732
1.69MetGlu: 1.69 ± 0.795
1.126MetPhe: 1.126 ± 0.446
1.408MetGly: 1.408 ± 0.363
0.563MetHis: 0.563 ± 0.242
0.563MetIle: 0.563 ± 0.458
1.971MetLys: 1.971 ± 0.688
0.563MetLeu: 0.563 ± 0.315
1.126MetMet: 1.126 ± 0.629
0.0MetAsn: 0.0 ± 0.0
0.0MetPro: 0.0 ± 0.0
0.563MetGln: 0.563 ± 0.315
0.563MetArg: 0.563 ± 0.529
0.563MetSer: 0.563 ± 0.315
2.534MetThr: 2.534 ± 0.496
1.408MetVal: 1.408 ± 0.544
0.282MetTrp: 0.282 ± 0.251
0.563MetTyr: 0.563 ± 0.315
0.0MetXaa: 0.0 ± 0.0
Asn
2.253AsnAla: 2.253 ± 1.258
3.098AsnCys: 3.098 ± 0.756
0.845AsnAsp: 0.845 ± 0.365
3.098AsnGlu: 3.098 ± 0.853
1.69AsnPhe: 1.69 ± 0.589
2.253AsnGly: 2.253 ± 0.845
0.563AsnHis: 0.563 ± 0.926
3.379AsnIle: 3.379 ± 1.743
1.971AsnLys: 1.971 ± 0.597
3.098AsnLeu: 3.098 ± 1.301
1.126AsnMet: 1.126 ± 0.854
3.098AsnAsn: 3.098 ± 1.756
2.816AsnPro: 2.816 ± 0.896
1.971AsnGln: 1.971 ± 0.617
2.816AsnArg: 2.816 ± 0.625
2.253AsnSer: 2.253 ± 1.123
3.661AsnThr: 3.661 ± 0.842
1.971AsnVal: 1.971 ± 1.136
1.408AsnTrp: 1.408 ± 0.455
1.69AsnTyr: 1.69 ± 0.666
0.0AsnXaa: 0.0 ± 0.0
Pro
2.253ProAla: 2.253 ± 0.63
0.845ProCys: 0.845 ± 0.752
2.534ProAsp: 2.534 ± 1.058
3.661ProGlu: 3.661 ± 1.47
1.69ProPhe: 1.69 ± 0.863
5.914ProGly: 5.914 ± 0.995
0.563ProHis: 0.563 ± 0.398
4.506ProIle: 4.506 ± 0.925
3.098ProLys: 3.098 ± 0.716
4.506ProLeu: 4.506 ± 0.783
1.126ProMet: 1.126 ± 0.641
0.845ProAsn: 0.845 ± 0.453
3.379ProPro: 3.379 ± 0.731
3.098ProGln: 3.098 ± 0.891
3.379ProArg: 3.379 ± 1.214
2.534ProSer: 2.534 ± 0.762
3.379ProThr: 3.379 ± 0.842
4.506ProVal: 4.506 ± 1.059
0.845ProTrp: 0.845 ± 0.747
1.126ProTyr: 1.126 ± 0.573
0.0ProXaa: 0.0 ± 0.0
Gln
4.224GlnAla: 4.224 ± 0.747
0.563GlnCys: 0.563 ± 0.442
2.816GlnAsp: 2.816 ± 0.737
5.069GlnGlu: 5.069 ± 1.232
0.845GlnPhe: 0.845 ± 0.659
6.759GlnGly: 6.759 ± 2.037
1.971GlnHis: 1.971 ± 1.252
3.943GlnIle: 3.943 ± 0.963
3.943GlnLys: 3.943 ± 1.309
5.914GlnLeu: 5.914 ± 0.832
2.534GlnMet: 2.534 ± 0.895
3.098GlnAsn: 3.098 ± 1.12
1.408GlnPro: 1.408 ± 1.194
3.098GlnGln: 3.098 ± 1.23
1.69GlnArg: 1.69 ± 0.612
3.098GlnSer: 3.098 ± 0.653
1.408GlnThr: 1.408 ± 0.632
5.069GlnVal: 5.069 ± 1.624
1.408GlnTrp: 1.408 ± 0.536
1.408GlnTyr: 1.408 ± 0.542
0.0GlnXaa: 0.0 ± 0.0
Arg
3.943ArgAla: 3.943 ± 0.984
0.845ArgCys: 0.845 ± 0.463
2.816ArgAsp: 2.816 ± 0.635
5.914ArgGlu: 5.914 ± 1.617
1.69ArgPhe: 1.69 ± 0.944
2.534ArgGly: 2.534 ± 0.603
2.534ArgHis: 2.534 ± 1.613
3.661ArgIle: 3.661 ± 1.61
4.224ArgLys: 4.224 ± 1.666
5.351ArgLeu: 5.351 ± 1.747
0.845ArgMet: 0.845 ± 0.463
2.253ArgAsn: 2.253 ± 0.606
3.098ArgPro: 3.098 ± 1.152
5.351ArgGln: 5.351 ± 1.015
6.477ArgArg: 6.477 ± 3.317
3.098ArgSer: 3.098 ± 1.267
3.098ArgThr: 3.098 ± 1.718
1.971ArgVal: 1.971 ± 0.625
2.253ArgTrp: 2.253 ± 0.967
1.69ArgTyr: 1.69 ± 0.481
0.0ArgXaa: 0.0 ± 0.0
Ser
3.379SerAla: 3.379 ± 1.127
0.282SerCys: 0.282 ± 0.199
2.534SerAsp: 2.534 ± 0.83
3.661SerGlu: 3.661 ± 1.088
0.845SerPhe: 0.845 ± 0.752
3.661SerGly: 3.661 ± 1.032
0.845SerHis: 0.845 ± 0.472
3.379SerIle: 3.379 ± 0.685
3.379SerLys: 3.379 ± 1.707
6.477SerLeu: 6.477 ± 2.55
1.126SerMet: 1.126 ± 0.51
3.379SerAsn: 3.379 ± 1.197
3.098SerPro: 3.098 ± 0.519
4.787SerGln: 4.787 ± 1.209
3.661SerArg: 3.661 ± 1.565
2.253SerSer: 2.253 ± 0.631
2.816SerThr: 2.816 ± 0.757
1.971SerVal: 1.971 ± 0.575
1.126SerTrp: 1.126 ± 0.484
1.126SerTyr: 1.126 ± 0.851
0.0SerXaa: 0.0 ± 0.0
Thr
4.787ThrAla: 4.787 ± 0.744
0.282ThrCys: 0.282 ± 0.251
3.943ThrAsp: 3.943 ± 1.364
3.379ThrGlu: 3.379 ± 0.814
0.563ThrPhe: 0.563 ± 0.501
3.943ThrGly: 3.943 ± 0.71
1.408ThrHis: 1.408 ± 0.409
1.408ThrIle: 1.408 ± 0.587
3.098ThrLys: 3.098 ± 1.083
6.477ThrLeu: 6.477 ± 1.302
0.845ThrMet: 0.845 ± 0.284
2.534ThrAsn: 2.534 ± 0.862
3.379ThrPro: 3.379 ± 1.3
2.534ThrGln: 2.534 ± 0.704
1.971ThrArg: 1.971 ± 1.408
4.224ThrSer: 4.224 ± 1.476
4.506ThrThr: 4.506 ± 0.646
4.787ThrVal: 4.787 ± 1.506
1.408ThrTrp: 1.408 ± 0.662
1.971ThrTyr: 1.971 ± 0.885
0.0ThrXaa: 0.0 ± 0.0
Val
3.379ValAla: 3.379 ± 1.358
0.563ValCys: 0.563 ± 0.612
2.816ValAsp: 2.816 ± 0.977
4.224ValGlu: 4.224 ± 1.008
1.69ValPhe: 1.69 ± 0.475
5.914ValGly: 5.914 ± 1.342
1.971ValHis: 1.971 ± 0.52
3.379ValIle: 3.379 ± 1.377
4.787ValLys: 4.787 ± 1.217
4.506ValLeu: 4.506 ± 1.207
0.563ValMet: 0.563 ± 0.529
2.534ValAsn: 2.534 ± 0.988
3.098ValPro: 3.098 ± 0.943
5.069ValGln: 5.069 ± 1.405
3.943ValArg: 3.943 ± 0.936
2.816ValSer: 2.816 ± 0.646
3.661ValThr: 3.661 ± 1.868
3.943ValVal: 3.943 ± 1.15
2.253ValTrp: 2.253 ± 0.567
2.253ValTyr: 2.253 ± 0.686
0.0ValXaa: 0.0 ± 0.0
Trp
2.534TrpAla: 2.534 ± 0.479
0.563TrpCys: 0.563 ± 0.535
1.408TrpAsp: 1.408 ± 0.649
1.408TrpGlu: 1.408 ± 0.734
0.563TrpPhe: 0.563 ± 0.529
2.253TrpGly: 2.253 ± 1.06
0.563TrpHis: 0.563 ± 0.529
0.563TrpIle: 0.563 ± 0.242
1.971TrpLys: 1.971 ± 0.875
0.845TrpLeu: 0.845 ± 0.684
1.126TrpMet: 1.126 ± 0.322
1.971TrpAsn: 1.971 ± 1.468
1.408TrpPro: 1.408 ± 0.363
1.971TrpGln: 1.971 ± 0.935
2.253TrpArg: 2.253 ± 0.606
0.563TrpSer: 0.563 ± 0.457
1.408TrpThr: 1.408 ± 0.734
2.534TrpVal: 2.534 ± 0.723
0.563TrpTrp: 0.563 ± 0.398
0.563TrpTyr: 0.563 ± 0.242
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.563TyrAla: 0.563 ± 0.242
1.971TyrCys: 1.971 ± 1.218
0.845TyrAsp: 0.845 ± 0.659
0.282TyrGlu: 0.282 ± 0.199
0.845TyrPhe: 0.845 ± 0.509
1.408TyrGly: 1.408 ± 0.722
1.126TyrHis: 1.126 ± 0.917
0.845TyrIle: 0.845 ± 0.426
2.253TyrLys: 2.253 ± 1.051
0.845TyrLeu: 0.845 ± 0.378
0.845TyrMet: 0.845 ± 0.408
1.408TyrAsn: 1.408 ± 0.563
1.408TyrPro: 1.408 ± 0.628
1.971TyrGln: 1.971 ± 0.497
1.971TyrArg: 1.971 ± 1.022
1.69TyrSer: 1.69 ± 0.811
1.408TyrThr: 1.408 ± 0.482
2.253TyrVal: 2.253 ± 0.68
0.845TyrTrp: 0.845 ± 0.453
1.408TyrTyr: 1.408 ± 0.528
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 9 proteins (3552 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski