Amino acid dipepetide frequency for Simian immunodeficiency virus (isolate EK505) (SIV-cpz) (Chimpanzee immunodeficiency virus)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.774AlaAla: 4.774 ± 1.428
2.247AlaCys: 2.247 ± 0.48
0.562AlaAsp: 0.562 ± 0.412
7.02AlaGlu: 7.02 ± 1.774
1.404AlaPhe: 1.404 ± 0.268
4.212AlaGly: 4.212 ± 0.641
1.404AlaHis: 1.404 ± 0.378
3.651AlaIle: 3.651 ± 1.458
2.247AlaLys: 2.247 ± 0.689
5.897AlaLeu: 5.897 ± 0.96
1.404AlaMet: 1.404 ± 0.591
3.089AlaAsn: 3.089 ± 0.911
2.527AlaPro: 2.527 ± 0.731
2.527AlaGln: 2.527 ± 0.703
4.212AlaArg: 4.212 ± 1.142
5.055AlaSer: 5.055 ± 1.173
3.37AlaThr: 3.37 ± 0.716
4.493AlaVal: 4.493 ± 0.936
1.966AlaTrp: 1.966 ± 0.518
1.404AlaTyr: 1.404 ± 0.525
0.0AlaXaa: 0.0 ± 0.0
Cys
1.123CysAla: 1.123 ± 0.393
0.562CysCys: 0.562 ± 0.48
0.281CysAsp: 0.281 ± 0.175
0.562CysGlu: 0.562 ± 0.469
1.404CysPhe: 1.404 ± 0.775
1.404CysGly: 1.404 ± 0.448
0.281CysHis: 0.281 ± 0.459
0.0CysIle: 0.0 ± 0.0
1.685CysLys: 1.685 ± 0.624
0.842CysLeu: 0.842 ± 0.679
0.281CysMet: 0.281 ± 0.303
1.966CysAsn: 1.966 ± 0.83
0.842CysPro: 0.842 ± 0.307
1.404CysGln: 1.404 ± 0.585
0.842CysArg: 0.842 ± 0.243
1.123CysSer: 1.123 ± 0.624
1.404CysThr: 1.404 ± 0.525
2.247CysVal: 2.247 ± 0.701
0.842CysTrp: 0.842 ± 0.352
0.562CysTyr: 0.562 ± 0.48
0.0CysXaa: 0.0 ± 0.0
Asp
1.685AspAla: 1.685 ± 0.613
1.966AspCys: 1.966 ± 0.531
0.562AspAsp: 0.562 ± 0.35
1.123AspGlu: 1.123 ± 0.476
1.404AspPhe: 1.404 ± 0.525
1.404AspGly: 1.404 ± 0.727
0.562AspHis: 0.562 ± 0.412
3.931AspIle: 3.931 ± 1.087
2.808AspLys: 2.808 ± 0.936
3.089AspLeu: 3.089 ± 1.272
0.842AspMet: 0.842 ± 0.409
2.247AspAsn: 2.247 ± 0.673
1.966AspPro: 1.966 ± 1.348
2.808AspGln: 2.808 ± 0.909
4.212AspArg: 4.212 ± 1.234
1.685AspSer: 1.685 ± 0.926
3.089AspThr: 3.089 ± 0.799
0.281AspVal: 0.281 ± 0.175
0.562AspTrp: 0.562 ± 0.296
1.123AspTyr: 1.123 ± 0.287
0.0AspXaa: 0.0 ± 0.0
Glu
5.616GluAla: 5.616 ± 1.14
0.281GluCys: 0.281 ± 0.226
3.089GluAsp: 3.089 ± 1.513
5.897GluGlu: 5.897 ± 2.08
1.123GluPhe: 1.123 ± 0.334
5.616GluGly: 5.616 ± 1.499
0.842GluHis: 0.842 ± 0.534
2.808GluIle: 2.808 ± 1.21
5.055GluLys: 5.055 ± 1.406
8.144GluLeu: 8.144 ± 1.155
1.404GluMet: 1.404 ± 0.394
1.685GluAsn: 1.685 ± 0.431
5.616GluPro: 5.616 ± 1.303
4.774GluGln: 4.774 ± 0.807
4.493GluArg: 4.493 ± 0.526
3.089GluSer: 3.089 ± 0.791
4.774GluThr: 4.774 ± 1.305
5.055GluVal: 5.055 ± 0.851
1.123GluTrp: 1.123 ± 0.635
0.562GluTyr: 0.562 ± 0.412
0.0GluXaa: 0.0 ± 0.0
Phe
1.123PheAla: 1.123 ± 0.287
0.281PheCys: 0.281 ± 0.226
1.404PheAsp: 1.404 ± 0.837
0.281PheGlu: 0.281 ± 0.226
0.842PhePhe: 0.842 ± 0.307
1.404PheGly: 1.404 ± 0.417
0.0PheHis: 0.0 ± 0.0
1.966PheIle: 1.966 ± 0.595
1.123PheLys: 1.123 ± 0.352
2.527PheLeu: 2.527 ± 0.686
1.123PheMet: 1.123 ± 0.516
2.527PheAsn: 2.527 ± 0.703
1.966PhePro: 1.966 ± 0.798
0.281PheGln: 0.281 ± 0.175
2.527PheArg: 2.527 ± 1.056
0.281PheSer: 0.281 ± 0.175
1.685PheThr: 1.685 ± 0.584
0.562PheVal: 0.562 ± 0.35
0.281PheTrp: 0.281 ± 0.175
2.247PheTyr: 2.247 ± 0.753
0.0PheXaa: 0.0 ± 0.0
Gly
5.897GlyAla: 5.897 ± 1.169
2.247GlyCys: 2.247 ± 0.558
3.931GlyAsp: 3.931 ± 0.703
3.651GlyGlu: 3.651 ± 0.912
2.247GlyPhe: 2.247 ± 0.639
5.616GlyGly: 5.616 ± 1.178
2.808GlyHis: 2.808 ± 1.243
7.02GlyIle: 7.02 ± 1.257
5.616GlyLys: 5.616 ± 1.523
6.178GlyLeu: 6.178 ± 1.415
0.842GlyMet: 0.842 ± 0.307
2.808GlyAsn: 2.808 ± 0.893
4.774GlyPro: 4.774 ± 0.87
4.493GlyGln: 4.493 ± 1.268
3.089GlyArg: 3.089 ± 0.57
3.931GlySer: 3.931 ± 0.946
2.808GlyThr: 2.808 ± 1.509
3.37GlyVal: 3.37 ± 1.286
1.123GlyTrp: 1.123 ± 0.698
1.404GlyTyr: 1.404 ± 0.408
0.0GlyXaa: 0.0 ± 0.0
His
0.0HisAla: 0.0 ± 0.0
0.562HisCys: 0.562 ± 0.917
0.0HisAsp: 0.0 ± 0.0
0.281HisGlu: 0.281 ± 0.175
0.281HisPhe: 0.281 ± 0.454
1.685HisGly: 1.685 ± 0.595
0.562HisHis: 0.562 ± 0.89
0.562HisIle: 0.562 ± 0.469
1.685HisLys: 1.685 ± 0.423
2.808HisLeu: 2.808 ± 0.87
0.842HisMet: 0.842 ± 0.475
1.123HisAsn: 1.123 ± 0.287
2.247HisPro: 2.247 ± 0.979
2.527HisGln: 2.527 ± 0.937
1.123HisArg: 1.123 ± 0.456
1.123HisSer: 1.123 ± 0.398
1.685HisThr: 1.685 ± 0.431
0.562HisVal: 0.562 ± 0.218
0.0HisTrp: 0.0 ± 0.0
0.562HisTyr: 0.562 ± 0.653
0.0HisXaa: 0.0 ± 0.0
Ile
1.685IleAla: 1.685 ± 0.423
0.562IleCys: 0.562 ± 0.218
1.685IleAsp: 1.685 ± 0.747
3.37IleGlu: 3.37 ± 0.783
1.404IlePhe: 1.404 ± 0.431
5.055IleGly: 5.055 ± 1.246
2.247IleHis: 2.247 ± 0.814
6.178IleIle: 6.178 ± 2.299
6.178IleLys: 6.178 ± 0.658
5.055IleLeu: 5.055 ± 0.98
0.562IleMet: 0.562 ± 0.453
2.808IleAsn: 2.808 ± 1.409
5.336IlePro: 5.336 ± 1.21
2.247IleGln: 2.247 ± 1.399
5.055IleArg: 5.055 ± 1.162
3.931IleSer: 3.931 ± 0.814
2.527IleThr: 2.527 ± 0.99
4.493IleVal: 4.493 ± 1.15
1.123IleTrp: 1.123 ± 0.624
3.931IleTyr: 3.931 ± 0.58
0.0IleXaa: 0.0 ± 0.0
Lys
5.336LysAla: 5.336 ± 1.604
1.966LysCys: 1.966 ± 0.531
1.685LysAsp: 1.685 ± 0.423
7.02LysGlu: 7.02 ± 1.98
0.562LysPhe: 0.562 ± 0.35
4.774LysGly: 4.774 ± 0.839
1.123LysHis: 1.123 ± 0.473
5.336LysIle: 5.336 ± 0.683
5.897LysLys: 5.897 ± 1.531
5.897LysLeu: 5.897 ± 1.49
1.123LysMet: 1.123 ± 0.492
3.089LysAsn: 3.089 ± 0.618
1.123LysPro: 1.123 ± 0.437
3.37LysGln: 3.37 ± 0.62
1.966LysArg: 1.966 ± 0.623
2.247LysSer: 2.247 ± 0.534
3.931LysThr: 3.931 ± 1.3
5.055LysVal: 5.055 ± 1.008
1.123LysTrp: 1.123 ± 0.473
1.685LysTyr: 1.685 ± 0.439
0.0LysXaa: 0.0 ± 0.0
Leu
6.178LeuAla: 6.178 ± 1.143
0.562LeuCys: 0.562 ± 0.218
4.493LeuAsp: 4.493 ± 0.766
7.582LeuGlu: 7.582 ± 0.96
2.808LeuPhe: 2.808 ± 0.896
5.616LeuGly: 5.616 ± 1.87
1.685LeuHis: 1.685 ± 0.769
3.931LeuIle: 3.931 ± 1.586
4.774LeuLys: 4.774 ± 1.429
7.582LeuLeu: 7.582 ± 2.541
1.966LeuMet: 1.966 ± 0.772
4.493LeuAsn: 4.493 ± 0.937
2.527LeuPro: 2.527 ± 1.479
3.651LeuGln: 3.651 ± 0.733
6.74LeuArg: 6.74 ± 1.159
5.336LeuSer: 5.336 ± 1.858
4.493LeuThr: 4.493 ± 0.906
6.459LeuVal: 6.459 ± 1.111
3.931LeuTrp: 3.931 ± 1.577
2.247LeuTyr: 2.247 ± 0.576
0.0LeuXaa: 0.0 ± 0.0
Met
1.966MetAla: 1.966 ± 0.77
0.281MetCys: 0.281 ± 0.313
1.123MetAsp: 1.123 ± 0.492
1.685MetGlu: 1.685 ± 0.915
0.562MetPhe: 0.562 ± 0.258
2.247MetGly: 2.247 ± 0.417
0.0MetHis: 0.0 ± 0.0
0.562MetIle: 0.562 ± 0.218
1.966MetLys: 1.966 ± 0.689
1.404MetLeu: 1.404 ± 0.578
1.123MetMet: 1.123 ± 0.516
0.281MetAsn: 0.281 ± 0.175
0.281MetPro: 0.281 ± 0.445
1.966MetGln: 1.966 ± 0.613
0.842MetArg: 0.842 ± 0.481
0.281MetSer: 0.281 ± 0.226
3.651MetThr: 3.651 ± 0.809
1.685MetVal: 1.685 ± 0.651
0.281MetTrp: 0.281 ± 0.226
0.842MetTyr: 0.842 ± 0.475
0.0MetXaa: 0.0 ± 0.0
Asn
1.685AsnAla: 1.685 ± 0.774
3.37AsnCys: 3.37 ± 0.814
1.404AsnAsp: 1.404 ± 0.983
2.808AsnGlu: 2.808 ± 0.844
3.089AsnPhe: 3.089 ± 1.327
2.247AsnGly: 2.247 ± 0.741
0.562AsnHis: 0.562 ± 0.48
4.212AsnIle: 4.212 ± 1.491
1.966AsnLys: 1.966 ± 0.479
2.527AsnLeu: 2.527 ± 1.19
0.562AsnMet: 0.562 ± 0.453
5.055AsnAsn: 5.055 ± 1.849
4.774AsnPro: 4.774 ± 0.973
1.685AsnGln: 1.685 ± 0.614
2.527AsnArg: 2.527 ± 0.823
1.966AsnSer: 1.966 ± 1.065
4.774AsnThr: 4.774 ± 0.809
1.966AsnVal: 1.966 ± 0.802
2.247AsnTrp: 2.247 ± 0.487
1.123AsnTyr: 1.123 ± 0.299
0.0AsnXaa: 0.0 ± 0.0
Pro
3.651ProAla: 3.651 ± 1.13
0.281ProCys: 0.281 ± 0.226
3.37ProAsp: 3.37 ± 0.933
5.055ProGlu: 5.055 ± 1.204
0.562ProPhe: 0.562 ± 0.35
6.178ProGly: 6.178 ± 0.76
0.281ProHis: 0.281 ± 0.175
4.212ProIle: 4.212 ± 1.188
2.527ProLys: 2.527 ± 1.268
5.897ProLeu: 5.897 ± 1.879
1.123ProMet: 1.123 ± 0.586
1.123ProAsn: 1.123 ± 0.804
4.212ProPro: 4.212 ± 0.766
3.651ProGln: 3.651 ± 1.519
2.808ProArg: 2.808 ± 0.767
1.404ProSer: 1.404 ± 0.593
1.966ProThr: 1.966 ± 0.568
4.493ProVal: 4.493 ± 0.723
1.123ProTrp: 1.123 ± 0.93
1.123ProTyr: 1.123 ± 0.812
0.0ProXaa: 0.0 ± 0.0
Gln
6.459GlnAla: 6.459 ± 0.728
0.562GlnCys: 0.562 ± 0.453
3.651GlnAsp: 3.651 ± 0.9
4.774GlnGlu: 4.774 ± 0.76
0.281GlnPhe: 0.281 ± 0.175
5.336GlnGly: 5.336 ± 0.929
1.123GlnHis: 1.123 ± 0.557
4.493GlnIle: 4.493 ± 1.079
3.931GlnLys: 3.931 ± 1.052
5.897GlnLeu: 5.897 ± 1.158
2.808GlnMet: 2.808 ± 0.934
3.931GlnAsn: 3.931 ± 0.896
1.404GlnPro: 1.404 ± 0.851
2.808GlnGln: 2.808 ± 0.933
3.089GlnArg: 3.089 ± 1.815
1.685GlnSer: 1.685 ± 0.663
2.527GlnThr: 2.527 ± 0.485
3.37GlnVal: 3.37 ± 1.335
1.404GlnTrp: 1.404 ± 0.408
1.966GlnTyr: 1.966 ± 0.558
0.0GlnXaa: 0.0 ± 0.0
Arg
4.212ArgAla: 4.212 ± 1.654
0.0ArgCys: 0.0 ± 0.0
2.527ArgAsp: 2.527 ± 0.515
6.459ArgGlu: 6.459 ± 1.764
1.685ArgPhe: 1.685 ± 0.417
3.931ArgGly: 3.931 ± 0.761
1.123ArgHis: 1.123 ± 0.991
4.212ArgIle: 4.212 ± 1.989
4.774ArgLys: 4.774 ± 1.073
1.966ArgLeu: 1.966 ± 0.745
1.123ArgMet: 1.123 ± 0.52
2.808ArgAsn: 2.808 ± 0.527
3.089ArgPro: 3.089 ± 1.327
5.897ArgGln: 5.897 ± 1.981
3.931ArgArg: 3.931 ± 2.693
3.089ArgSer: 3.089 ± 0.78
2.808ArgThr: 2.808 ± 1.078
2.808ArgVal: 2.808 ± 0.544
1.685ArgTrp: 1.685 ± 0.908
1.123ArgTyr: 1.123 ± 0.334
0.0ArgXaa: 0.0 ± 0.0
Ser
0.842SerAla: 0.842 ± 0.397
1.123SerCys: 1.123 ± 0.476
1.966SerAsp: 1.966 ± 0.472
3.089SerGlu: 3.089 ± 0.88
0.842SerPhe: 0.842 ± 0.53
4.212SerGly: 4.212 ± 1.169
0.842SerHis: 0.842 ± 0.683
3.651SerIle: 3.651 ± 1.026
1.123SerLys: 1.123 ± 0.6
5.897SerLeu: 5.897 ± 1.746
1.123SerMet: 1.123 ± 0.605
2.247SerAsn: 2.247 ± 0.679
2.808SerPro: 2.808 ± 0.759
3.37SerGln: 3.37 ± 1.341
2.247SerArg: 2.247 ± 0.957
2.808SerSer: 2.808 ± 1.056
3.37SerThr: 3.37 ± 0.678
3.651SerVal: 3.651 ± 0.825
0.562SerTrp: 0.562 ± 0.218
0.842SerTyr: 0.842 ± 0.545
0.0SerXaa: 0.0 ± 0.0
Thr
5.055ThrAla: 5.055 ± 1.04
0.0ThrCys: 0.0 ± 0.0
2.247ThrAsp: 2.247 ± 0.645
5.336ThrGlu: 5.336 ± 0.744
1.404ThrPhe: 1.404 ± 0.417
3.931ThrGly: 3.931 ± 0.773
0.842ThrHis: 0.842 ± 0.325
2.247ThrIle: 2.247 ± 0.449
3.089ThrLys: 3.089 ± 0.666
6.74ThrLeu: 6.74 ± 0.813
1.404ThrMet: 1.404 ± 0.393
2.527ThrAsn: 2.527 ± 1.209
3.37ThrPro: 3.37 ± 0.817
4.493ThrGln: 4.493 ± 0.779
2.527ThrArg: 2.527 ± 1.239
2.808ThrSer: 2.808 ± 0.626
4.774ThrThr: 4.774 ± 1.196
3.931ThrVal: 3.931 ± 1.242
2.808ThrTrp: 2.808 ± 0.59
1.404ThrTyr: 1.404 ± 0.681
0.0ThrXaa: 0.0 ± 0.0
Val
3.651ValAla: 3.651 ± 1.136
0.842ValCys: 0.842 ± 0.575
1.966ValAsp: 1.966 ± 0.568
1.966ValGlu: 1.966 ± 0.802
0.842ValPhe: 0.842 ± 0.352
5.336ValGly: 5.336 ± 0.936
2.527ValHis: 2.527 ± 0.852
5.336ValIle: 5.336 ± 0.728
4.212ValLys: 4.212 ± 1.293
5.616ValLeu: 5.616 ± 0.969
0.562ValMet: 0.562 ± 0.726
3.089ValAsn: 3.089 ± 0.897
2.808ValPro: 2.808 ± 0.607
5.616ValGln: 5.616 ± 1.244
2.527ValArg: 2.527 ± 0.603
2.247ValSer: 2.247 ± 0.867
4.212ValThr: 4.212 ± 1.554
2.808ValVal: 2.808 ± 0.559
3.37ValTrp: 3.37 ± 0.955
1.685ValTyr: 1.685 ± 0.824
0.0ValXaa: 0.0 ± 0.0
Trp
1.966TrpAla: 1.966 ± 0.557
0.281TrpCys: 0.281 ± 0.313
0.842TrpAsp: 0.842 ± 0.243
2.527TrpGlu: 2.527 ± 0.495
0.562TrpPhe: 0.562 ± 0.469
2.247TrpGly: 2.247 ± 0.998
0.281TrpHis: 0.281 ± 0.445
0.842TrpIle: 0.842 ± 0.352
2.247TrpLys: 2.247 ± 0.886
1.123TrpLeu: 1.123 ± 0.704
1.404TrpMet: 1.404 ± 0.591
1.685TrpAsn: 1.685 ± 1.227
1.123TrpPro: 1.123 ± 0.352
1.966TrpGln: 1.966 ± 0.761
1.966TrpArg: 1.966 ± 0.846
1.123TrpSer: 1.123 ± 0.744
1.404TrpThr: 1.404 ± 0.636
2.247TrpVal: 2.247 ± 0.576
0.562TrpTrp: 0.562 ± 0.35
0.562TrpTyr: 0.562 ± 0.218
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.562TyrAla: 0.562 ± 0.218
1.685TyrCys: 1.685 ± 0.647
0.562TyrAsp: 0.562 ± 0.218
0.562TyrGlu: 0.562 ± 0.412
1.123TyrPhe: 1.123 ± 0.575
1.404TyrGly: 1.404 ± 0.653
1.123TyrHis: 1.123 ± 0.465
0.281TyrIle: 0.281 ± 0.226
1.404TyrLys: 1.404 ± 0.417
1.966TyrLeu: 1.966 ± 0.42
1.123TyrMet: 1.123 ± 0.287
1.966TyrAsn: 1.966 ± 0.849
2.247TyrPro: 2.247 ± 0.708
2.247TyrGln: 2.247 ± 0.793
2.527TyrArg: 2.527 ± 1.409
1.404TyrSer: 1.404 ± 0.593
1.685TyrThr: 1.685 ± 0.603
1.685TyrVal: 1.685 ± 0.636
0.842TyrTrp: 0.842 ± 0.397
1.685TyrTyr: 1.685 ± 0.489
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 9 proteins (3562 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski