Amino acid dipepetide frequency for Simian retrovirus Y

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.432AlaAla: 7.432 ± 3.1
0.826AlaCys: 0.826 ± 0.422
2.89AlaAsp: 2.89 ± 0.86
2.064AlaGlu: 2.064 ± 0.563
2.064AlaPhe: 2.064 ± 0.488
5.367AlaGly: 5.367 ± 1.002
2.064AlaHis: 2.064 ± 1.496
4.542AlaIle: 4.542 ± 1.014
2.477AlaLys: 2.477 ± 1.009
8.258AlaLeu: 8.258 ± 2.254
0.826AlaMet: 0.826 ± 0.332
2.89AlaAsn: 2.89 ± 0.737
3.716AlaPro: 3.716 ± 1.284
5.78AlaGln: 5.78 ± 1.332
4.129AlaArg: 4.129 ± 0.453
4.955AlaSer: 4.955 ± 1.092
4.542AlaThr: 4.542 ± 1.145
2.89AlaVal: 2.89 ± 1.128
1.239AlaTrp: 1.239 ± 0.593
2.89AlaTyr: 2.89 ± 0.829
0.0AlaXaa: 0.0 ± 0.0
Cys
2.477CysAla: 2.477 ± 1.778
0.413CysCys: 0.413 ± 0.378
0.826CysAsp: 0.826 ± 0.755
0.413CysGlu: 0.413 ± 0.378
2.064CysPhe: 2.064 ± 0.488
0.826CysGly: 0.826 ± 0.422
0.0CysHis: 0.0 ± 0.0
0.413CysIle: 0.413 ± 0.299
2.064CysLys: 2.064 ± 0.796
1.652CysLeu: 1.652 ± 1.511
0.0CysMet: 0.0 ± 0.0
0.413CysAsn: 0.413 ± 0.299
1.652CysPro: 1.652 ± 0.648
1.652CysGln: 1.652 ± 0.665
0.413CysArg: 0.413 ± 0.332
1.652CysSer: 1.652 ± 0.351
0.413CysThr: 0.413 ± 0.378
1.239CysVal: 1.239 ± 0.551
0.826CysTrp: 0.826 ± 0.755
1.239CysTyr: 1.239 ± 0.551
0.0CysXaa: 0.0 ± 0.0
Asp
2.477AspAla: 2.477 ± 0.732
1.652AspCys: 1.652 ± 0.497
4.129AspAsp: 4.129 ± 1.334
0.826AspGlu: 0.826 ± 0.664
2.477AspPhe: 2.477 ± 1.186
2.477AspGly: 2.477 ± 1.011
1.239AspHis: 1.239 ± 0.659
4.542AspIle: 4.542 ± 0.311
2.477AspLys: 2.477 ± 0.953
7.432AspLeu: 7.432 ± 1.252
0.413AspMet: 0.413 ± 0.379
4.129AspAsn: 4.129 ± 0.846
5.367AspPro: 5.367 ± 1.756
3.303AspGln: 3.303 ± 1.104
1.239AspArg: 1.239 ± 0.505
2.477AspSer: 2.477 ± 0.671
2.477AspThr: 2.477 ± 0.469
2.064AspVal: 2.064 ± 1.092
2.064AspTrp: 2.064 ± 0.896
1.239AspTyr: 1.239 ± 0.794
0.0AspXaa: 0.0 ± 0.0
Glu
3.303GluAla: 3.303 ± 1.157
0.413GluCys: 0.413 ± 0.332
1.652GluAsp: 1.652 ± 1.097
2.477GluGlu: 2.477 ± 1.588
1.239GluPhe: 1.239 ± 0.593
1.239GluGly: 1.239 ± 0.794
0.0GluHis: 0.0 ± 0.0
0.826GluIle: 0.826 ± 0.621
4.129GluLys: 4.129 ± 0.886
5.367GluLeu: 5.367 ± 1.151
0.826GluMet: 0.826 ± 0.621
1.652GluAsn: 1.652 ± 1.05
2.477GluPro: 2.477 ± 0.732
2.064GluGln: 2.064 ± 0.649
1.652GluArg: 1.652 ± 0.665
1.652GluSer: 1.652 ± 0.497
3.303GluThr: 3.303 ± 1.086
2.064GluVal: 2.064 ± 1.056
0.413GluTrp: 0.413 ± 0.299
0.0GluTyr: 0.0 ± 0.0
0.0GluXaa: 0.0 ± 0.0
Phe
2.064PheAla: 2.064 ± 0.488
0.826PheCys: 0.826 ± 0.598
2.064PheAsp: 2.064 ± 0.896
0.413PheGlu: 0.413 ± 0.332
1.239PhePhe: 1.239 ± 0.593
2.89PheGly: 2.89 ± 0.786
0.413PheHis: 0.413 ± 0.299
3.303PheIle: 3.303 ± 1.213
1.652PheLys: 1.652 ± 0.805
2.89PheLeu: 2.89 ± 0.871
0.413PheMet: 0.413 ± 0.241
2.064PheAsn: 2.064 ± 1.406
4.129PhePro: 4.129 ± 0.583
2.064PheGln: 2.064 ± 0.882
0.413PheArg: 0.413 ± 0.589
2.89PheSer: 2.89 ± 0.598
3.303PheThr: 3.303 ± 0.627
2.064PheVal: 2.064 ± 0.902
1.239PheTrp: 1.239 ± 0.551
0.413PheTyr: 0.413 ± 0.378
0.0PheXaa: 0.0 ± 0.0
Gly
3.303GlyAla: 3.303 ± 0.859
1.239GlyCys: 1.239 ± 0.729
3.303GlyAsp: 3.303 ± 0.451
2.064GlyGlu: 2.064 ± 0.728
2.89GlyPhe: 2.89 ± 0.242
3.303GlyGly: 3.303 ± 1.44
1.239GlyHis: 1.239 ± 0.538
4.955GlyIle: 4.955 ± 2.217
4.955GlyLys: 4.955 ± 1.797
7.432GlyLeu: 7.432 ± 3.201
0.413GlyMet: 0.413 ± 0.299
4.542GlyAsn: 4.542 ± 1.399
5.78GlyPro: 5.78 ± 1.484
4.955GlyGln: 4.955 ± 1.862
2.477GlyArg: 2.477 ± 0.835
3.716GlySer: 3.716 ± 0.566
2.89GlyThr: 2.89 ± 0.806
2.477GlyVal: 2.477 ± 1.186
0.826GlyTrp: 0.826 ± 0.664
2.064GlyTyr: 2.064 ± 0.286
0.0GlyXaa: 0.0 ± 0.0
His
1.239HisAla: 1.239 ± 0.551
0.413HisCys: 0.413 ± 0.299
0.413HisAsp: 0.413 ± 0.332
0.413HisGlu: 0.413 ± 0.299
0.413HisPhe: 0.413 ± 0.332
0.826HisGly: 0.826 ± 0.366
0.826HisHis: 0.826 ± 0.332
1.652HisIle: 1.652 ± 0.808
0.413HisLys: 0.413 ± 0.378
3.303HisLeu: 3.303 ± 1.135
0.413HisMet: 0.413 ± 0.378
1.652HisAsn: 1.652 ± 0.844
1.239HisPro: 1.239 ± 0.283
1.652HisGln: 1.652 ± 0.954
1.652HisArg: 1.652 ± 0.497
1.652HisSer: 1.652 ± 0.808
1.652HisThr: 1.652 ± 0.551
2.064HisVal: 2.064 ± 1.089
0.826HisTrp: 0.826 ± 0.664
1.652HisTyr: 1.652 ± 0.808
0.0HisXaa: 0.0 ± 0.0
Ile
4.129IleAla: 4.129 ± 2.102
1.652IleCys: 1.652 ± 1.038
4.129IleAsp: 4.129 ± 1.305
1.652IleGlu: 1.652 ± 0.401
2.477IlePhe: 2.477 ± 0.64
2.064IleGly: 2.064 ± 0.286
1.652IleHis: 1.652 ± 0.731
5.367IleIle: 5.367 ± 2.515
8.258IleLys: 8.258 ± 1.634
4.129IleLeu: 4.129 ± 1.427
1.239IleMet: 1.239 ± 1.04
2.064IleAsn: 2.064 ± 1.496
4.129IlePro: 4.129 ± 1.504
3.303IleGln: 3.303 ± 0.991
1.652IleArg: 1.652 ± 0.665
2.89IleSer: 2.89 ± 0.364
4.955IleThr: 4.955 ± 1.612
3.716IleVal: 3.716 ± 0.968
1.239IleTrp: 1.239 ± 0.551
1.239IleTyr: 1.239 ± 1.111
0.0IleXaa: 0.0 ± 0.0
Lys
4.542LysAla: 4.542 ± 1.399
0.826LysCys: 0.826 ± 0.422
3.716LysAsp: 3.716 ± 1.146
3.716LysGlu: 3.716 ± 1.182
1.652LysPhe: 1.652 ± 0.805
4.542LysGly: 4.542 ± 2.004
2.477LysHis: 2.477 ± 0.64
4.129LysIle: 4.129 ± 0.835
5.78LysLys: 5.78 ± 0.72
5.78LysLeu: 5.78 ± 1.682
0.0LysMet: 0.0 ± 0.0
2.477LysAsn: 2.477 ± 0.72
2.064LysPro: 2.064 ± 0.928
3.716LysGln: 3.716 ± 0.254
4.129LysArg: 4.129 ± 0.826
5.367LysSer: 5.367 ± 0.32
4.542LysThr: 4.542 ± 1.838
2.89LysVal: 2.89 ± 0.86
1.239LysTrp: 1.239 ± 0.551
1.652LysTyr: 1.652 ± 0.561
0.0LysXaa: 0.0 ± 0.0
Leu
9.909LeuAla: 9.909 ± 2.117
2.89LeuCys: 2.89 ± 0.687
4.542LeuAsp: 4.542 ± 0.717
3.716LeuGlu: 3.716 ± 0.849
3.303LeuPhe: 3.303 ± 1.157
5.367LeuGly: 5.367 ± 2.002
2.477LeuHis: 2.477 ± 1.379
9.083LeuIle: 9.083 ± 0.488
5.78LeuLys: 5.78 ± 1.976
10.735LeuLeu: 10.735 ± 2.403
1.239LeuMet: 1.239 ± 0.679
4.129LeuAsn: 4.129 ± 1.169
7.432LeuPro: 7.432 ± 0.898
7.432LeuGln: 7.432 ± 1.522
2.477LeuArg: 2.477 ± 0.968
5.367LeuSer: 5.367 ± 1.509
7.845LeuThr: 7.845 ± 1.282
5.78LeuVal: 5.78 ± 1.689
1.652LeuTrp: 1.652 ± 1.732
0.826LeuTyr: 0.826 ± 0.422
0.0LeuXaa: 0.0 ± 0.0
Met
1.652MetAla: 1.652 ± 0.648
0.413MetCys: 0.413 ± 0.589
1.239MetAsp: 1.239 ± 0.679
0.413MetGlu: 0.413 ± 0.378
0.413MetPhe: 0.413 ± 0.299
1.652MetGly: 1.652 ± 0.351
0.0MetHis: 0.0 ± 0.0
0.413MetIle: 0.413 ± 0.299
1.239MetLys: 1.239 ± 1.111
1.239MetLeu: 1.239 ± 0.507
0.413MetMet: 0.413 ± 0.589
0.0MetAsn: 0.0 ± 0.0
0.0MetPro: 0.0 ± 0.0
0.413MetGln: 0.413 ± 0.332
0.0MetArg: 0.0 ± 0.0
0.826MetSer: 0.826 ± 0.755
0.0MetThr: 0.0 ± 0.0
0.826MetVal: 0.826 ± 0.598
0.413MetTrp: 0.413 ± 0.299
0.413MetTyr: 0.413 ± 0.299
0.0MetXaa: 0.0 ± 0.0
Asn
2.477AsnAla: 2.477 ± 0.732
2.064AsnCys: 2.064 ± 0.654
2.89AsnAsp: 2.89 ± 0.786
1.652AsnGlu: 1.652 ± 0.721
2.477AsnPhe: 2.477 ± 0.748
3.716AsnGly: 3.716 ± 1.189
0.826AsnHis: 0.826 ± 0.598
2.477AsnIle: 2.477 ± 0.732
3.716AsnLys: 3.716 ± 0.772
4.955AsnLeu: 4.955 ± 1.687
0.413AsnMet: 0.413 ± 0.299
4.542AsnAsn: 4.542 ± 1.571
4.955AsnPro: 4.955 ± 1.246
1.652AsnGln: 1.652 ± 0.401
1.239AsnArg: 1.239 ± 0.68
2.477AsnSer: 2.477 ± 1.102
4.129AsnThr: 4.129 ± 0.967
1.239AsnVal: 1.239 ± 1.133
2.064AsnTrp: 2.064 ± 0.459
0.826AsnTyr: 0.826 ± 0.332
0.0AsnXaa: 0.0 ± 0.0
Pro
1.239ProAla: 1.239 ± 0.794
1.239ProCys: 1.239 ± 0.68
4.129ProAsp: 4.129 ± 1.207
2.89ProGlu: 2.89 ± 1.048
2.89ProPhe: 2.89 ± 1.185
4.955ProGly: 4.955 ± 1.862
2.064ProHis: 2.064 ± 0.796
2.89ProIle: 2.89 ± 1.014
3.716ProLys: 3.716 ± 1.302
6.606ProLeu: 6.606 ± 1.644
0.826ProMet: 0.826 ± 0.598
4.129ProAsn: 4.129 ± 1.998
4.955ProPro: 4.955 ± 1.909
4.955ProGln: 4.955 ± 0.949
3.303ProArg: 3.303 ± 0.509
5.367ProSer: 5.367 ± 1.072
5.78ProThr: 5.78 ± 1.282
5.367ProVal: 5.367 ± 0.49
0.826ProTrp: 0.826 ± 0.332
4.129ProTyr: 4.129 ± 1.015
0.0ProXaa: 0.0 ± 0.0
Gln
5.78GlnAla: 5.78 ± 1.253
1.652GlnCys: 1.652 ± 0.808
3.303GlnAsp: 3.303 ± 1.158
4.129GlnGlu: 4.129 ± 1.509
2.477GlnPhe: 2.477 ± 0.748
4.542GlnGly: 4.542 ± 1.165
1.239GlnHis: 1.239 ± 0.995
3.716GlnIle: 3.716 ± 1.413
4.129GlnLys: 4.129 ± 1.169
5.78GlnLeu: 5.78 ± 0.53
2.064GlnMet: 2.064 ± 1.017
2.89GlnAsn: 2.89 ± 0.907
3.716GlnPro: 3.716 ± 0.813
4.129GlnGln: 4.129 ± 2.034
1.239GlnArg: 1.239 ± 0.897
3.303GlnSer: 3.303 ± 2.067
2.477GlnThr: 2.477 ± 0.566
4.955GlnVal: 4.955 ± 1.633
0.826GlnTrp: 0.826 ± 0.598
1.239GlnTyr: 1.239 ± 0.995
0.0GlnXaa: 0.0 ± 0.0
Arg
2.477ArgAla: 2.477 ± 0.469
0.413ArgCys: 0.413 ± 0.332
2.477ArgAsp: 2.477 ± 0.968
2.477ArgGlu: 2.477 ± 0.86
1.652ArgPhe: 1.652 ± 0.401
4.542ArgGly: 4.542 ± 0.532
0.413ArgHis: 0.413 ± 0.332
1.652ArgIle: 1.652 ± 0.561
2.477ArgLys: 2.477 ± 0.284
4.542ArgLeu: 4.542 ± 1.014
0.0ArgMet: 0.0 ± 0.0
1.652ArgAsn: 1.652 ± 0.497
2.064ArgPro: 2.064 ± 0.488
2.064ArgGln: 2.064 ± 0.286
1.239ArgArg: 1.239 ± 0.729
2.064ArgSer: 2.064 ± 0.996
0.413ArgThr: 0.413 ± 0.332
1.652ArgVal: 1.652 ± 0.648
0.826ArgTrp: 0.826 ± 0.332
0.826ArgTyr: 0.826 ± 0.332
0.0ArgXaa: 0.0 ± 0.0
Ser
4.955SerAla: 4.955 ± 1.423
0.826SerCys: 0.826 ± 0.755
3.716SerAsp: 3.716 ± 0.254
2.064SerGlu: 2.064 ± 0.623
2.064SerPhe: 2.064 ± 1.017
4.542SerGly: 4.542 ± 0.311
0.826SerHis: 0.826 ± 0.621
4.129SerIle: 4.129 ± 0.968
3.716SerLys: 3.716 ± 0.372
6.193SerLeu: 6.193 ± 1.216
0.0SerMet: 0.0 ± 0.0
3.303SerAsn: 3.303 ± 0.938
4.542SerPro: 4.542 ± 0.935
4.129SerGln: 4.129 ± 1.303
2.064SerArg: 2.064 ± 0.523
4.542SerSer: 4.542 ± 2.685
2.89SerThr: 2.89 ± 1.241
3.303SerVal: 3.303 ± 1.637
0.0SerTrp: 0.0 ± 0.0
2.89SerTyr: 2.89 ± 1.542
0.0SerXaa: 0.0 ± 0.0
Thr
6.606ThrAla: 6.606 ± 1.334
1.652ThrCys: 1.652 ± 0.579
4.955ThrAsp: 4.955 ± 1.266
2.064ThrGlu: 2.064 ± 0.623
2.477ThrPhe: 2.477 ± 0.367
7.432ThrGly: 7.432 ± 1.887
1.652ThrHis: 1.652 ± 0.808
4.129ThrIle: 4.129 ± 0.835
2.064ThrLys: 2.064 ± 0.649
4.955ThrLeu: 4.955 ± 0.615
1.239ThrMet: 1.239 ± 0.505
3.303ThrAsn: 3.303 ± 1.36
5.78ThrPro: 5.78 ± 1.608
3.716ThrGln: 3.716 ± 1.509
0.826ThrArg: 0.826 ± 0.422
2.477ThrSer: 2.477 ± 0.933
4.129ThrThr: 4.129 ± 0.912
3.716ThrVal: 3.716 ± 0.753
1.652ThrTrp: 1.652 ± 0.887
2.064ThrTyr: 2.064 ± 0.886
0.0ThrXaa: 0.0 ± 0.0
Val
3.716ValAla: 3.716 ± 1.427
1.239ValCys: 1.239 ± 0.283
2.477ValAsp: 2.477 ± 0.732
2.477ValGlu: 2.477 ± 0.983
1.239ValPhe: 1.239 ± 0.505
1.652ValGly: 1.652 ± 0.77
2.89ValHis: 2.89 ± 1.677
2.064ValIle: 2.064 ± 0.563
3.716ValLys: 3.716 ± 2.122
5.367ValLeu: 5.367 ± 0.32
0.413ValMet: 0.413 ± 0.332
2.477ValAsn: 2.477 ± 0.284
4.955ValPro: 4.955 ± 1.714
3.303ValGln: 3.303 ± 0.938
2.89ValArg: 2.89 ± 0.627
3.716ValSer: 3.716 ± 1.537
5.78ValThr: 5.78 ± 0.485
1.239ValVal: 1.239 ± 0.586
0.413ValTrp: 0.413 ± 0.299
0.413ValTyr: 0.413 ± 0.299
0.0ValXaa: 0.0 ± 0.0
Trp
0.413TrpAla: 0.413 ± 0.332
0.0TrpCys: 0.0 ± 0.0
0.826TrpAsp: 0.826 ± 0.664
0.0TrpGlu: 0.0 ± 0.0
0.413TrpPhe: 0.413 ± 0.332
1.652TrpGly: 1.652 ± 1.035
0.826TrpHis: 0.826 ± 0.598
0.0TrpIle: 0.0 ± 0.0
2.064TrpLys: 2.064 ± 0.742
2.477TrpLeu: 2.477 ± 0.469
0.0TrpMet: 0.0 ± 0.0
1.239TrpAsn: 1.239 ± 0.593
2.477TrpPro: 2.477 ± 1.014
1.239TrpGln: 1.239 ± 0.551
2.477TrpArg: 2.477 ± 1.186
0.413TrpSer: 0.413 ± 0.378
1.239TrpThr: 1.239 ± 0.729
1.239TrpVal: 1.239 ± 0.679
0.0TrpTrp: 0.0 ± 0.0
0.413TrpTyr: 0.413 ± 0.299
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.652TyrAla: 1.652 ± 0.579
0.0TyrCys: 0.0 ± 0.0
1.239TyrAsp: 1.239 ± 0.283
0.826TyrGlu: 0.826 ± 0.332
0.826TyrPhe: 0.826 ± 0.598
1.239TyrGly: 1.239 ± 0.794
1.239TyrHis: 1.239 ± 0.729
1.652TyrIle: 1.652 ± 0.401
0.413TyrLys: 0.413 ± 0.332
2.89TyrLeu: 2.89 ± 0.931
0.826TyrMet: 0.826 ± 0.279
1.239TyrAsn: 1.239 ± 0.283
0.826TyrPro: 0.826 ± 0.598
2.064TyrGln: 2.064 ± 0.891
0.413TyrArg: 0.413 ± 0.332
2.477TyrSer: 2.477 ± 0.46
4.129TyrThr: 4.129 ± 0.669
1.652TyrVal: 1.652 ± 0.497
0.826TyrTrp: 0.826 ± 0.615
0.826TyrTyr: 0.826 ± 0.332
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4 proteins (2423 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski