Amino acid dipepetide frequency for Simian foamy virus Pongo pygmaeus pygmaeus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.874AlaAla: 3.874 ± 0.829
0.596AlaCys: 0.596 ± 0.228
2.682AlaAsp: 2.682 ± 1.316
4.172AlaGlu: 4.172 ± 1.196
1.788AlaPhe: 1.788 ± 0.409
2.384AlaGly: 2.384 ± 0.817
1.49AlaHis: 1.49 ± 0.459
3.278AlaIle: 3.278 ± 0.737
2.086AlaLys: 2.086 ± 0.853
5.364AlaLeu: 5.364 ± 1.185
0.596AlaMet: 0.596 ± 0.428
4.172AlaAsn: 4.172 ± 0.614
3.278AlaPro: 3.278 ± 2.258
2.98AlaGln: 2.98 ± 0.509
2.98AlaArg: 2.98 ± 0.892
3.576AlaSer: 3.576 ± 0.921
2.98AlaThr: 2.98 ± 1.032
4.768AlaVal: 4.768 ± 1.476
0.596AlaTrp: 0.596 ± 0.489
1.788AlaTyr: 1.788 ± 0.479
0.0AlaXaa: 0.0 ± 0.0
Cys
0.596CysAla: 0.596 ± 0.489
0.596CysCys: 0.596 ± 0.38
0.894CysAsp: 0.894 ± 0.527
0.298CysGlu: 0.298 ± 0.245
1.192CysPhe: 1.192 ± 0.979
1.192CysGly: 1.192 ± 0.76
0.0CysHis: 0.0 ± 0.0
1.788CysIle: 1.788 ± 0.668
0.596CysLys: 0.596 ± 0.228
1.49CysLeu: 1.49 ± 0.441
0.0CysMet: 0.0 ± 0.0
1.192CysAsn: 1.192 ± 0.82
0.894CysPro: 0.894 ± 0.361
0.596CysGln: 0.596 ± 0.343
0.894CysArg: 0.894 ± 0.527
0.894CysSer: 0.894 ± 0.527
0.0CysThr: 0.0 ± 0.0
0.596CysVal: 0.596 ± 0.38
0.596CysTrp: 0.596 ± 0.343
1.192CysTyr: 1.192 ± 0.76
0.0CysXaa: 0.0 ± 0.0
Asp
3.278AspAla: 3.278 ± 0.715
1.49AspCys: 1.49 ± 0.646
1.788AspAsp: 1.788 ± 0.791
1.788AspGlu: 1.788 ± 0.335
1.49AspPhe: 1.49 ± 0.895
1.49AspGly: 1.49 ± 0.281
1.192AspHis: 1.192 ± 0.651
2.682AspIle: 2.682 ± 1.151
2.384AspLys: 2.384 ± 0.971
3.576AspLeu: 3.576 ± 1.348
0.894AspMet: 0.894 ± 0.653
2.086AspAsn: 2.086 ± 0.253
2.98AspPro: 2.98 ± 1.069
2.086AspGln: 2.086 ± 0.684
2.086AspArg: 2.086 ± 0.531
3.278AspSer: 3.278 ± 0.696
0.894AspThr: 0.894 ± 0.368
3.576AspVal: 3.576 ± 0.514
1.788AspTrp: 1.788 ± 0.869
2.682AspTyr: 2.682 ± 0.68
0.0AspXaa: 0.0 ± 0.0
Glu
2.682GluAla: 2.682 ± 0.848
1.192GluCys: 1.192 ± 0.778
2.086GluAsp: 2.086 ± 0.662
6.853GluGlu: 6.853 ± 1.711
1.49GluPhe: 1.49 ± 0.496
4.768GluGly: 4.768 ± 0.599
1.192GluHis: 1.192 ± 0.419
3.576GluIle: 3.576 ± 0.966
3.874GluLys: 3.874 ± 1.186
5.066GluLeu: 5.066 ± 0.86
1.49GluMet: 1.49 ± 0.695
1.49GluAsn: 1.49 ± 0.888
2.384GluPro: 2.384 ± 1.146
2.384GluGln: 2.384 ± 0.814
5.959GluArg: 5.959 ± 0.857
4.172GluSer: 4.172 ± 1.02
2.384GluThr: 2.384 ± 0.728
4.172GluVal: 4.172 ± 0.639
0.0GluTrp: 0.0 ± 0.0
0.298GluTyr: 0.298 ± 0.214
0.0GluXaa: 0.0 ± 0.0
Phe
1.49PheAla: 1.49 ± 0.76
0.894PheCys: 0.894 ± 0.535
0.894PheAsp: 0.894 ± 0.453
0.596PheGlu: 0.596 ± 0.305
0.298PhePhe: 0.298 ± 0.214
2.086PheGly: 2.086 ± 0.761
0.894PheHis: 0.894 ± 0.584
1.788PheIle: 1.788 ± 0.278
1.788PheLys: 1.788 ± 0.684
2.384PheLeu: 2.384 ± 0.826
0.0PheMet: 0.0 ± 0.0
1.192PheAsn: 1.192 ± 0.283
1.192PhePro: 1.192 ± 0.667
2.086PheGln: 2.086 ± 0.692
0.596PheArg: 0.596 ± 0.305
1.192PheSer: 1.192 ± 0.348
1.49PheThr: 1.49 ± 0.601
2.384PheVal: 2.384 ± 0.688
0.894PheTrp: 0.894 ± 0.295
1.192PheTyr: 1.192 ± 0.558
0.0PheXaa: 0.0 ± 0.0
Gly
1.788GlyAla: 1.788 ± 0.819
0.596GlyCys: 0.596 ± 0.525
1.788GlyAsp: 1.788 ± 0.722
3.278GlyGlu: 3.278 ± 2.307
1.788GlyPhe: 1.788 ± 0.967
4.172GlyGly: 4.172 ± 3.402
1.788GlyHis: 1.788 ± 0.59
4.768GlyIle: 4.768 ± 0.985
2.98GlyLys: 2.98 ± 0.809
5.959GlyLeu: 5.959 ± 1.792
1.788GlyMet: 1.788 ± 0.554
4.172GlyAsn: 4.172 ± 0.758
2.682GlyPro: 2.682 ± 0.68
4.47GlyGln: 4.47 ± 2.029
4.47GlyArg: 4.47 ± 2.56
4.172GlySer: 4.172 ± 1.331
3.278GlyThr: 3.278 ± 1.121
3.278GlyVal: 3.278 ± 0.548
0.894GlyTrp: 0.894 ± 0.437
3.278GlyTyr: 3.278 ± 0.458
0.0GlyXaa: 0.0 ± 0.0
His
1.192HisAla: 1.192 ± 0.716
0.596HisCys: 0.596 ± 0.228
1.192HisAsp: 1.192 ± 0.283
0.894HisGlu: 0.894 ± 0.295
1.192HisPhe: 1.192 ± 0.322
0.596HisGly: 0.596 ± 0.312
0.0HisHis: 0.0 ± 0.0
1.788HisIle: 1.788 ± 0.467
1.192HisLys: 1.192 ± 0.558
4.768HisLeu: 4.768 ± 0.972
0.298HisMet: 0.298 ± 0.245
0.596HisAsn: 0.596 ± 0.312
3.278HisPro: 3.278 ± 0.553
0.894HisGln: 0.894 ± 0.368
0.894HisArg: 0.894 ± 0.202
1.49HisSer: 1.49 ± 0.76
1.788HisThr: 1.788 ± 0.681
1.49HisVal: 1.49 ± 0.486
0.894HisTrp: 0.894 ± 0.502
0.596HisTyr: 0.596 ± 0.228
0.0HisXaa: 0.0 ± 0.0
Ile
2.98IleAla: 2.98 ± 0.904
1.192IleCys: 1.192 ± 0.741
2.384IleAsp: 2.384 ± 0.726
2.384IleGlu: 2.384 ± 1.176
1.192IlePhe: 1.192 ± 0.667
3.576IleGly: 3.576 ± 1.015
2.086IleHis: 2.086 ± 0.492
4.47IleIle: 4.47 ± 1.596
6.555IleLys: 6.555 ± 2.44
7.151IleLeu: 7.151 ± 1.588
1.192IleMet: 1.192 ± 0.348
2.98IleAsn: 2.98 ± 0.622
9.237IlePro: 9.237 ± 1.351
2.98IleGln: 2.98 ± 0.627
2.682IleArg: 2.682 ± 0.906
3.278IleSer: 3.278 ± 1.177
4.172IleThr: 4.172 ± 1.585
3.874IleVal: 3.874 ± 1.045
0.894IleTrp: 0.894 ± 0.73
1.49IleTyr: 1.49 ± 0.888
0.0IleXaa: 0.0 ± 0.0
Lys
4.172LysAla: 4.172 ± 1.439
1.192LysCys: 1.192 ± 0.632
2.682LysAsp: 2.682 ± 1.021
4.172LysGlu: 4.172 ± 1.376
0.894LysPhe: 0.894 ± 0.642
2.682LysGly: 2.682 ± 1.258
2.086LysHis: 2.086 ± 0.921
3.874LysIle: 3.874 ± 1.214
2.384LysLys: 2.384 ± 0.621
5.662LysLeu: 5.662 ± 1.522
0.596LysMet: 0.596 ± 0.525
3.278LysAsn: 3.278 ± 1.301
3.576LysPro: 3.576 ± 1.418
2.98LysGln: 2.98 ± 0.993
1.788LysArg: 1.788 ± 0.558
3.576LysSer: 3.576 ± 0.994
4.47LysThr: 4.47 ± 1.278
3.576LysVal: 3.576 ± 1.142
1.49LysTrp: 1.49 ± 0.395
2.98LysTyr: 2.98 ± 1.579
0.0LysXaa: 0.0 ± 0.0
Leu
6.555LeuAla: 6.555 ± 1.115
1.49LeuCys: 1.49 ± 1.34
4.47LeuAsp: 4.47 ± 0.538
5.959LeuGlu: 5.959 ± 1.133
2.384LeuPhe: 2.384 ± 0.914
4.768LeuGly: 4.768 ± 0.249
2.682LeuHis: 2.682 ± 0.428
6.257LeuIle: 6.257 ± 0.995
5.959LeuLys: 5.959 ± 1.964
11.025LeuLeu: 11.025 ± 2.348
1.49LeuMet: 1.49 ± 0.633
4.172LeuAsn: 4.172 ± 0.651
5.066LeuPro: 5.066 ± 0.608
7.151LeuGln: 7.151 ± 1.362
6.257LeuArg: 6.257 ± 2.248
5.662LeuSer: 5.662 ± 1.082
7.151LeuThr: 7.151 ± 1.468
4.768LeuVal: 4.768 ± 0.855
1.788LeuTrp: 1.788 ± 0.67
3.874LeuTyr: 3.874 ± 1.425
0.0LeuXaa: 0.0 ± 0.0
Met
1.192MetAla: 1.192 ± 0.471
0.298MetCys: 0.298 ± 0.359
0.298MetAsp: 0.298 ± 0.359
0.596MetGlu: 0.596 ± 0.489
0.0MetPhe: 0.0 ± 0.0
1.49MetGly: 1.49 ± 0.7
0.298MetHis: 0.298 ± 0.31
0.894MetIle: 0.894 ± 0.457
1.192MetLys: 1.192 ± 0.419
2.682MetLeu: 2.682 ± 0.591
0.894MetMet: 0.894 ± 0.437
0.894MetAsn: 0.894 ± 0.502
1.49MetPro: 1.49 ± 0.807
1.192MetGln: 1.192 ± 0.857
0.894MetArg: 0.894 ± 0.295
0.894MetSer: 0.894 ± 0.708
2.384MetThr: 2.384 ± 0.869
1.192MetVal: 1.192 ± 0.558
0.298MetTrp: 0.298 ± 0.31
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
2.98AsnAla: 2.98 ± 0.991
0.0AsnCys: 0.0 ± 0.0
2.682AsnAsp: 2.682 ± 0.964
3.576AsnGlu: 3.576 ± 1.115
2.086AsnPhe: 2.086 ± 0.372
3.874AsnGly: 3.874 ± 1.267
1.192AsnHis: 1.192 ± 0.281
4.768AsnIle: 4.768 ± 1.597
2.384AsnLys: 2.384 ± 0.742
4.768AsnLeu: 4.768 ± 1.205
1.49AsnMet: 1.49 ± 0.471
4.47AsnAsn: 4.47 ± 0.926
4.768AsnPro: 4.768 ± 1.383
3.278AsnGln: 3.278 ± 1.262
2.682AsnArg: 2.682 ± 0.823
3.576AsnSer: 3.576 ± 0.835
2.682AsnThr: 2.682 ± 0.566
1.192AsnVal: 1.192 ± 0.322
0.596AsnTrp: 0.596 ± 0.489
1.49AsnTyr: 1.49 ± 0.298
0.0AsnXaa: 0.0 ± 0.0
Pro
3.278ProAla: 3.278 ± 1.036
0.298ProCys: 0.298 ± 0.245
1.788ProAsp: 1.788 ± 0.768
4.768ProGlu: 4.768 ± 1.16
2.086ProPhe: 2.086 ± 0.405
2.98ProGly: 2.98 ± 1.693
1.788ProHis: 1.788 ± 0.684
5.066ProIle: 5.066 ± 1.26
3.576ProLys: 3.576 ± 0.944
7.449ProLeu: 7.449 ± 0.652
1.788ProMet: 1.788 ± 0.89
2.98ProAsn: 2.98 ± 0.887
6.257ProPro: 6.257 ± 1.145
3.874ProGln: 3.874 ± 1.404
5.662ProArg: 5.662 ± 2.35
7.449ProSer: 7.449 ± 1.267
4.172ProThr: 4.172 ± 0.614
4.768ProVal: 4.768 ± 1.641
1.49ProTrp: 1.49 ± 1.042
2.682ProTyr: 2.682 ± 0.524
0.0ProXaa: 0.0 ± 0.0
Gln
2.682GlnAla: 2.682 ± 0.707
1.788GlnCys: 1.788 ± 0.457
2.384GlnAsp: 2.384 ± 0.557
5.066GlnGlu: 5.066 ± 0.79
0.894GlnPhe: 0.894 ± 0.492
6.257GlnGly: 6.257 ± 1.539
2.086GlnHis: 2.086 ± 0.694
1.49GlnIle: 1.49 ± 0.742
1.192GlnLys: 1.192 ± 0.558
4.172GlnLeu: 4.172 ± 0.782
1.49GlnMet: 1.49 ± 0.489
4.47GlnAsn: 4.47 ± 1.25
3.874GlnPro: 3.874 ± 0.973
4.47GlnGln: 4.47 ± 0.576
1.49GlnArg: 1.49 ± 1.55
2.98GlnSer: 2.98 ± 2.015
1.49GlnThr: 1.49 ± 0.668
3.576GlnVal: 3.576 ± 0.798
0.894GlnTrp: 0.894 ± 0.422
1.788GlnTyr: 1.788 ± 0.401
0.0GlnXaa: 0.0 ± 0.0
Arg
3.874ArgAla: 3.874 ± 2.052
0.894ArgCys: 0.894 ± 0.527
1.788ArgAsp: 1.788 ± 1.128
2.682ArgGlu: 2.682 ± 0.428
0.894ArgPhe: 0.894 ± 0.437
4.47ArgGly: 4.47 ± 3.265
0.298ArgHis: 0.298 ± 0.31
2.98ArgIle: 2.98 ± 0.543
4.172ArgLys: 4.172 ± 1.14
3.874ArgLeu: 3.874 ± 1.044
0.894ArgMet: 0.894 ± 0.202
2.384ArgAsn: 2.384 ± 1.259
5.662ArgPro: 5.662 ± 2.384
0.894ArgGln: 0.894 ± 0.395
4.172ArgArg: 4.172 ± 1.66
2.682ArgSer: 2.682 ± 1.067
2.682ArgThr: 2.682 ± 0.591
2.384ArgVal: 2.384 ± 0.558
1.192ArgTrp: 1.192 ± 0.612
1.192ArgTyr: 1.192 ± 0.471
0.0ArgXaa: 0.0 ± 0.0
Ser
4.47SerAla: 4.47 ± 1.792
0.894SerCys: 0.894 ± 0.734
4.47SerAsp: 4.47 ± 1.698
1.49SerGlu: 1.49 ± 0.685
1.788SerPhe: 1.788 ± 1.117
4.172SerGly: 4.172 ± 1.813
2.384SerHis: 2.384 ± 0.514
5.066SerIle: 5.066 ± 0.709
2.384SerLys: 2.384 ± 0.826
5.662SerLeu: 5.662 ± 1.115
1.49SerMet: 1.49 ± 0.464
4.172SerAsn: 4.172 ± 1.147
5.662SerPro: 5.662 ± 0.728
3.278SerGln: 3.278 ± 0.47
1.192SerArg: 1.192 ± 0.664
4.768SerSer: 4.768 ± 2.265
6.257SerThr: 6.257 ± 0.474
2.086SerVal: 2.086 ± 0.653
1.192SerTrp: 1.192 ± 0.651
1.788SerTyr: 1.788 ± 0.404
0.0SerXaa: 0.0 ± 0.0
Thr
3.874ThrAla: 3.874 ± 1.151
1.192ThrCys: 1.192 ± 0.486
2.384ThrAsp: 2.384 ± 0.759
2.682ThrGlu: 2.682 ± 0.848
1.788ThrPhe: 1.788 ± 0.658
4.768ThrGly: 4.768 ± 0.704
1.788ThrHis: 1.788 ± 0.467
3.278ThrIle: 3.278 ± 0.825
4.768ThrLys: 4.768 ± 1.363
5.364ThrLeu: 5.364 ± 1.584
0.596ThrMet: 0.596 ± 0.228
1.49ThrAsn: 1.49 ± 0.486
5.066ThrPro: 5.066 ± 1.606
2.086ThrGln: 2.086 ± 0.573
2.384ThrArg: 2.384 ± 0.646
5.959ThrSer: 5.959 ± 0.791
3.874ThrThr: 3.874 ± 1.279
2.98ThrVal: 2.98 ± 0.966
0.894ThrTrp: 0.894 ± 0.437
1.788ThrTyr: 1.788 ± 0.478
0.0ThrXaa: 0.0 ± 0.0
Val
2.086ValAla: 2.086 ± 0.492
0.298ValCys: 0.298 ± 0.359
3.278ValAsp: 3.278 ± 0.972
2.682ValGlu: 2.682 ± 0.867
1.192ValPhe: 1.192 ± 0.62
2.384ValGly: 2.384 ± 0.457
1.192ValHis: 1.192 ± 0.683
4.47ValIle: 4.47 ± 0.553
4.47ValLys: 4.47 ± 1.093
6.555ValLeu: 6.555 ± 1.012
0.596ValMet: 0.596 ± 0.41
4.768ValAsn: 4.768 ± 0.647
3.874ValPro: 3.874 ± 1.184
3.874ValGln: 3.874 ± 0.601
0.894ValArg: 0.894 ± 0.584
2.086ValSer: 2.086 ± 0.694
3.874ValThr: 3.874 ± 1.594
3.874ValVal: 3.874 ± 1.873
0.894ValTrp: 0.894 ± 0.422
5.066ValTyr: 5.066 ± 0.858
0.0ValXaa: 0.0 ± 0.0
Trp
0.894TrpAla: 0.894 ± 0.437
0.0TrpCys: 0.0 ± 0.0
1.788TrpAsp: 1.788 ± 0.532
1.788TrpGlu: 1.788 ± 0.629
0.0TrpPhe: 0.0 ± 0.0
0.596TrpGly: 0.596 ± 0.429
0.596TrpHis: 0.596 ± 0.343
1.788TrpIle: 1.788 ± 0.397
1.49TrpLys: 1.49 ± 0.591
2.384TrpLeu: 2.384 ± 0.659
0.596TrpMet: 0.596 ± 0.343
0.894TrpAsn: 0.894 ± 0.422
0.894TrpPro: 0.894 ± 0.422
1.192TrpGln: 1.192 ± 0.348
1.192TrpArg: 1.192 ± 0.436
0.596TrpSer: 0.596 ± 0.228
0.894TrpThr: 0.894 ± 0.368
0.0TrpVal: 0.0 ± 0.0
0.894TrpTrp: 0.894 ± 0.395
0.596TrpTyr: 0.596 ± 0.228
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.49TyrAla: 1.49 ± 0.508
0.0TyrCys: 0.0 ± 0.0
1.788TyrAsp: 1.788 ± 0.684
1.788TyrGlu: 1.788 ± 0.722
0.894TyrPhe: 0.894 ± 0.527
2.682TyrGly: 2.682 ± 1.469
0.596TyrHis: 0.596 ± 0.343
2.682TyrIle: 2.682 ± 0.515
2.682TyrLys: 2.682 ± 0.986
3.576TyrLeu: 3.576 ± 1.352
0.596TyrMet: 0.596 ± 0.481
2.682TyrAsn: 2.682 ± 0.909
2.086TyrPro: 2.086 ± 0.618
1.788TyrGln: 1.788 ± 0.397
0.894TyrArg: 0.894 ± 0.437
2.682TyrSer: 2.682 ± 0.751
2.086TyrThr: 2.086 ± 1.07
3.874TyrVal: 3.874 ± 0.639
0.894TyrTrp: 0.894 ± 0.642
2.682TyrTyr: 2.682 ± 1.222
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5 proteins (3357 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski