Amino acid dipepetide frequency for Microviridae Fen7786_21

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
14.463AlaAla: 14.463 ± 4.294
1.377AlaCys: 1.377 ± 0.711
4.821AlaAsp: 4.821 ± 1.324
5.51AlaGlu: 5.51 ± 1.117
2.066AlaPhe: 2.066 ± 0.583
4.821AlaGly: 4.821 ± 2.575
2.066AlaHis: 2.066 ± 0.897
4.132AlaIle: 4.132 ± 2.274
3.444AlaLys: 3.444 ± 0.806
8.264AlaLeu: 8.264 ± 1.637
0.689AlaMet: 0.689 ± 0.486
6.198AlaAsn: 6.198 ± 2.911
6.198AlaPro: 6.198 ± 2.04
8.264AlaGln: 8.264 ± 3.132
4.821AlaArg: 4.821 ± 2.013
4.821AlaSer: 4.821 ± 1.94
4.821AlaThr: 4.821 ± 3.106
3.444AlaVal: 3.444 ± 1.461
0.0AlaTrp: 0.0 ± 0.0
2.755AlaTyr: 2.755 ± 0.871
0.0AlaXaa: 0.0 ± 0.0
Cys
0.689CysAla: 0.689 ± 0.486
0.689CysCys: 0.689 ± 0.824
0.0CysAsp: 0.0 ± 0.0
1.377CysGlu: 1.377 ± 0.973
0.689CysPhe: 0.689 ± 0.715
1.377CysGly: 1.377 ± 1.647
0.0CysHis: 0.0 ± 0.0
0.689CysIle: 0.689 ± 0.824
0.0CysLys: 0.0 ± 0.0
1.377CysLeu: 1.377 ± 1.647
0.0CysMet: 0.0 ± 0.0
0.689CysAsn: 0.689 ± 0.824
0.689CysPro: 0.689 ± 0.824
0.0CysGln: 0.0 ± 0.0
0.689CysArg: 0.689 ± 0.715
0.689CysSer: 0.689 ± 0.486
0.689CysThr: 0.689 ± 0.486
0.689CysVal: 0.689 ± 0.486
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
6.198AspAla: 6.198 ± 1.824
0.0AspCys: 0.0 ± 0.0
2.066AspAsp: 2.066 ± 0.853
2.066AspGlu: 2.066 ± 1.095
3.444AspPhe: 3.444 ± 2.432
2.755AspGly: 2.755 ± 1.133
0.0AspHis: 0.0 ± 0.0
3.444AspIle: 3.444 ± 2.479
1.377AspLys: 1.377 ± 0.798
6.198AspLeu: 6.198 ± 2.75
0.689AspMet: 0.689 ± 0.715
2.066AspAsn: 2.066 ± 0.643
2.755AspPro: 2.755 ± 1.773
0.0AspGln: 0.0 ± 0.0
3.444AspArg: 3.444 ± 1.544
2.755AspSer: 2.755 ± 1.341
4.132AspThr: 4.132 ± 1.699
3.444AspVal: 3.444 ± 1.729
1.377AspTrp: 1.377 ± 0.711
2.066AspTyr: 2.066 ± 0.976
0.0AspXaa: 0.0 ± 0.0
Glu
4.132GluAla: 4.132 ± 1.301
0.0GluCys: 0.0 ± 0.0
1.377GluAsp: 1.377 ± 0.566
0.689GluGlu: 0.689 ± 0.486
2.066GluPhe: 2.066 ± 0.897
1.377GluGly: 1.377 ± 0.566
2.066GluHis: 2.066 ± 0.853
2.755GluIle: 2.755 ± 0.986
2.066GluLys: 2.066 ± 0.853
4.132GluLeu: 4.132 ± 2.171
0.689GluMet: 0.689 ± 0.54
1.377GluAsn: 1.377 ± 0.973
0.689GluPro: 0.689 ± 0.486
2.755GluGln: 2.755 ± 0.986
1.377GluArg: 1.377 ± 0.711
1.377GluSer: 1.377 ± 1.647
3.444GluThr: 3.444 ± 0.486
2.755GluVal: 2.755 ± 1.372
1.377GluTrp: 1.377 ± 0.973
2.755GluTyr: 2.755 ± 1.035
0.0GluXaa: 0.0 ± 0.0
Phe
2.066PheAla: 2.066 ± 0.907
0.0PheCys: 0.0 ± 0.0
0.689PheAsp: 0.689 ± 0.486
0.689PheGlu: 0.689 ± 0.486
2.755PhePhe: 2.755 ± 1.946
2.066PheGly: 2.066 ± 0.897
2.066PheHis: 2.066 ± 1.459
2.755PheIle: 2.755 ± 1.421
1.377PheLys: 1.377 ± 0.849
1.377PheLeu: 1.377 ± 0.566
1.377PheMet: 1.377 ± 1.118
2.066PheAsn: 2.066 ± 1.086
3.444PhePro: 3.444 ± 1.138
2.755PheGln: 2.755 ± 1.236
2.755PheArg: 2.755 ± 0.88
3.444PheSer: 3.444 ± 1.681
2.755PheThr: 2.755 ± 1.372
2.066PheVal: 2.066 ± 0.643
0.689PheTrp: 0.689 ± 0.486
2.066PheTyr: 2.066 ± 1.46
0.0PheXaa: 0.0 ± 0.0
Gly
4.821GlyAla: 4.821 ± 1.38
0.689GlyCys: 0.689 ± 0.486
3.444GlyAsp: 3.444 ± 1.267
2.755GlyGlu: 2.755 ± 0.658
0.689GlyPhe: 0.689 ± 0.824
6.198GlyGly: 6.198 ± 1.602
2.066GlyHis: 2.066 ± 0.907
3.444GlyIle: 3.444 ± 1.432
1.377GlyLys: 1.377 ± 1.385
4.821GlyLeu: 4.821 ± 1.26
0.689GlyMet: 0.689 ± 0.824
4.132GlyAsn: 4.132 ± 1.988
2.066GlyPro: 2.066 ± 1.459
5.51GlyGln: 5.51 ± 1.981
0.0GlyArg: 0.0 ± 0.0
5.51GlySer: 5.51 ± 0.812
6.198GlyThr: 6.198 ± 3.139
1.377GlyVal: 1.377 ± 0.973
0.689GlyTrp: 0.689 ± 0.824
3.444GlyTyr: 3.444 ± 0.999
0.0GlyXaa: 0.0 ± 0.0
His
1.377HisAla: 1.377 ± 0.973
0.0HisCys: 0.0 ± 0.0
1.377HisAsp: 1.377 ± 2.061
0.0HisGlu: 0.0 ± 0.0
1.377HisPhe: 1.377 ± 0.703
2.066HisGly: 2.066 ± 1.459
0.0HisHis: 0.0 ± 0.0
2.755HisIle: 2.755 ± 1.421
0.689HisLys: 0.689 ± 0.486
2.066HisLeu: 2.066 ± 1.415
1.377HisMet: 1.377 ± 0.802
2.755HisAsn: 2.755 ± 0.986
0.0HisPro: 0.0 ± 0.0
0.689HisGln: 0.689 ± 0.824
1.377HisArg: 1.377 ± 1.074
2.755HisSer: 2.755 ± 1.667
0.0HisThr: 0.0 ± 0.0
0.0HisVal: 0.0 ± 0.0
0.689HisTrp: 0.689 ± 0.486
1.377HisTyr: 1.377 ± 0.711
0.0HisXaa: 0.0 ± 0.0
Ile
4.821IleAla: 4.821 ± 2.221
1.377IleCys: 1.377 ± 0.973
2.755IleAsp: 2.755 ± 1.536
2.066IleGlu: 2.066 ± 1.086
2.066IlePhe: 2.066 ± 0.897
2.755IleGly: 2.755 ± 1.199
2.755IleHis: 2.755 ± 2.074
2.066IleIle: 2.066 ± 0.853
3.444IleLys: 3.444 ± 1.592
4.821IleLeu: 4.821 ± 1.94
2.066IleMet: 2.066 ± 1.319
2.755IleAsn: 2.755 ± 0.957
3.444IlePro: 3.444 ± 0.872
2.755IleGln: 2.755 ± 0.815
2.066IleArg: 2.066 ± 0.853
5.51IleSer: 5.51 ± 3.215
3.444IleThr: 3.444 ± 0.802
2.755IleVal: 2.755 ± 0.63
0.0IleTrp: 0.0 ± 0.0
0.0IleTyr: 0.0 ± 0.0
0.0IleXaa: 0.0 ± 0.0
Lys
3.444LysAla: 3.444 ± 1.884
0.689LysCys: 0.689 ± 0.824
1.377LysAsp: 1.377 ± 1.118
2.066LysGlu: 2.066 ± 1.555
2.066LysPhe: 2.066 ± 0.976
3.444LysGly: 3.444 ± 2.143
0.689LysHis: 0.689 ± 0.54
2.755LysIle: 2.755 ± 1.11
3.444LysLys: 3.444 ± 2.712
4.132LysLeu: 4.132 ± 1.772
0.689LysMet: 0.689 ± 0.54
2.066LysAsn: 2.066 ± 1.015
1.377LysPro: 1.377 ± 0.703
2.755LysGln: 2.755 ± 1.035
4.132LysArg: 4.132 ± 2.536
2.066LysSer: 2.066 ± 1.015
7.576LysThr: 7.576 ± 2.384
0.689LysVal: 0.689 ± 0.54
0.0LysTrp: 0.0 ± 0.0
0.689LysTyr: 0.689 ± 0.824
0.0LysXaa: 0.0 ± 0.0
Leu
7.576LeuAla: 7.576 ± 3.913
0.0LeuCys: 0.0 ± 0.0
4.821LeuAsp: 4.821 ± 2.057
4.821LeuGlu: 4.821 ± 1.339
4.132LeuPhe: 4.132 ± 2.553
5.51LeuGly: 5.51 ± 2.33
0.0LeuHis: 0.0 ± 0.0
3.444LeuIle: 3.444 ± 1.153
5.51LeuLys: 5.51 ± 2.613
7.576LeuLeu: 7.576 ± 3.508
2.755LeuMet: 2.755 ± 1.421
7.576LeuAsn: 7.576 ± 0.992
6.887LeuPro: 6.887 ± 1.43
8.264LeuGln: 8.264 ± 2.711
2.066LeuArg: 2.066 ± 0.897
10.331LeuSer: 10.331 ± 2.093
7.576LeuThr: 7.576 ± 3.1
4.821LeuVal: 4.821 ± 2.02
2.066LeuTrp: 2.066 ± 0.853
0.0LeuTyr: 0.0 ± 0.0
0.0LeuXaa: 0.0 ± 0.0
Met
3.444MetAla: 3.444 ± 0.802
0.0MetCys: 0.0 ± 0.0
2.755MetAsp: 2.755 ± 1.341
0.689MetGlu: 0.689 ± 0.715
0.0MetPhe: 0.0 ± 0.0
2.755MetGly: 2.755 ± 1.421
1.377MetHis: 1.377 ± 0.711
0.0MetIle: 0.0 ± 0.0
0.689MetLys: 0.689 ± 0.715
2.066MetLeu: 2.066 ± 1.273
1.377MetMet: 1.377 ± 0.566
0.0MetAsn: 0.0 ± 0.0
3.444MetPro: 3.444 ± 1.235
0.689MetGln: 0.689 ± 0.54
0.689MetArg: 0.689 ± 0.54
2.066MetSer: 2.066 ± 0.583
1.377MetThr: 1.377 ± 1.385
0.689MetVal: 0.689 ± 0.715
0.689MetTrp: 0.689 ± 0.54
1.377MetTyr: 1.377 ± 0.566
0.0MetXaa: 0.0 ± 0.0
Asn
6.887AsnAla: 6.887 ± 3.483
0.689AsnCys: 0.689 ± 0.824
2.755AsnAsp: 2.755 ± 1.481
0.689AsnGlu: 0.689 ± 0.54
2.066AsnPhe: 2.066 ± 1.459
1.377AsnGly: 1.377 ± 1.647
1.377AsnHis: 1.377 ± 1.264
2.066AsnIle: 2.066 ± 0.907
3.444AsnLys: 3.444 ± 2.171
3.444AsnLeu: 3.444 ± 2.018
1.377AsnMet: 1.377 ± 0.798
5.51AsnAsn: 5.51 ± 2.312
3.444AsnPro: 3.444 ± 1.432
5.51AsnGln: 5.51 ± 1.747
2.755AsnArg: 2.755 ± 0.88
3.444AsnSer: 3.444 ± 1.415
3.444AsnThr: 3.444 ± 1.177
6.198AsnVal: 6.198 ± 2.124
1.377AsnTrp: 1.377 ± 0.973
0.689AsnTyr: 0.689 ± 0.486
0.0AsnXaa: 0.0 ± 0.0
Pro
4.132ProAla: 4.132 ± 0.741
0.689ProCys: 0.689 ± 0.824
4.821ProAsp: 4.821 ± 1.343
2.755ProGlu: 2.755 ± 1.256
2.066ProPhe: 2.066 ± 0.897
2.755ProGly: 2.755 ± 1.946
1.377ProHis: 1.377 ± 1.647
4.132ProIle: 4.132 ± 2.918
2.066ProLys: 2.066 ± 1.917
4.132ProLeu: 4.132 ± 2.105
1.377ProMet: 1.377 ± 0.545
4.821ProAsn: 4.821 ± 1.526
1.377ProPro: 1.377 ± 0.973
2.066ProGln: 2.066 ± 0.907
1.377ProArg: 1.377 ± 0.711
2.066ProSer: 2.066 ± 0.907
4.132ProThr: 4.132 ± 1.599
4.821ProVal: 4.821 ± 1.034
1.377ProTrp: 1.377 ± 0.973
2.755ProTyr: 2.755 ± 1.372
0.0ProXaa: 0.0 ± 0.0
Gln
4.132GlnAla: 4.132 ± 1.541
0.689GlnCys: 0.689 ± 0.486
0.689GlnAsp: 0.689 ± 0.54
2.066GlnGlu: 2.066 ± 0.583
2.066GlnPhe: 2.066 ± 1.161
2.755GlnGly: 2.755 ± 1.341
1.377GlnHis: 1.377 ± 0.946
4.132GlnIle: 4.132 ± 1.286
4.821GlnLys: 4.821 ± 1.156
7.576GlnLeu: 7.576 ± 2.066
2.755GlnMet: 2.755 ± 1.133
5.51GlnAsn: 5.51 ± 1.711
2.066GlnPro: 2.066 ± 0.907
4.132GlnGln: 4.132 ± 2.548
4.132GlnArg: 4.132 ± 1.846
2.755GlnSer: 2.755 ± 1.497
3.444GlnThr: 3.444 ± 2.125
4.132GlnVal: 4.132 ± 1.815
0.0GlnTrp: 0.0 ± 0.0
0.0GlnTyr: 0.0 ± 0.0
0.0GlnXaa: 0.0 ± 0.0
Arg
5.51ArgAla: 5.51 ± 3.615
0.0ArgCys: 0.0 ± 0.0
1.377ArgAsp: 1.377 ± 0.711
0.689ArgGlu: 0.689 ± 1.031
0.689ArgPhe: 0.689 ± 0.486
1.377ArgGly: 1.377 ± 0.703
0.0ArgHis: 0.0 ± 0.0
5.51ArgIle: 5.51 ± 3.043
4.132ArgLys: 4.132 ± 1.609
6.198ArgLeu: 6.198 ± 1.731
0.689ArgMet: 0.689 ± 0.715
2.066ArgAsn: 2.066 ± 0.897
3.444ArgPro: 3.444 ± 1.544
2.755ArgGln: 2.755 ± 0.658
0.689ArgArg: 0.689 ± 0.824
2.066ArgSer: 2.066 ± 1.627
3.444ArgThr: 3.444 ± 1.138
2.066ArgVal: 2.066 ± 1.775
0.0ArgTrp: 0.0 ± 0.0
2.755ArgTyr: 2.755 ± 1.372
0.0ArgXaa: 0.0 ± 0.0
Ser
7.576SerAla: 7.576 ± 2.796
1.377SerCys: 1.377 ± 0.711
6.198SerAsp: 6.198 ± 0.912
2.755SerGlu: 2.755 ± 0.88
3.444SerPhe: 3.444 ± 0.806
4.821SerGly: 4.821 ± 2.238
2.066SerHis: 2.066 ± 0.976
1.377SerIle: 1.377 ± 0.973
1.377SerLys: 1.377 ± 1.385
7.576SerLeu: 7.576 ± 1.608
1.377SerMet: 1.377 ± 0.73
1.377SerAsn: 1.377 ± 0.566
2.755SerPro: 2.755 ± 1.372
4.132SerGln: 4.132 ± 2.321
4.132SerArg: 4.132 ± 1.08
6.887SerSer: 6.887 ± 1.386
6.198SerThr: 6.198 ± 2.292
3.444SerVal: 3.444 ± 1.544
0.689SerTrp: 0.689 ± 0.824
2.066SerTyr: 2.066 ± 0.583
0.0SerXaa: 0.0 ± 0.0
Thr
5.51ThrAla: 5.51 ± 2.428
1.377ThrCys: 1.377 ± 1.429
3.444ThrAsp: 3.444 ± 1.177
3.444ThrGlu: 3.444 ± 0.802
4.821ThrPhe: 4.821 ± 2.062
8.264ThrGly: 8.264 ± 2.266
0.689ThrHis: 0.689 ± 0.54
2.755ThrIle: 2.755 ± 0.63
3.444ThrLys: 3.444 ± 1.461
8.264ThrLeu: 8.264 ± 2.81
1.377ThrMet: 1.377 ± 0.798
2.066ThrAsn: 2.066 ± 0.643
2.066ThrPro: 2.066 ± 0.583
2.755ThrGln: 2.755 ± 0.957
4.132ThrArg: 4.132 ± 2.553
6.887ThrSer: 6.887 ± 2.864
4.821ThrThr: 4.821 ± 2.082
4.132ThrVal: 4.132 ± 1.934
0.689ThrTrp: 0.689 ± 0.54
2.755ThrTyr: 2.755 ± 1.667
0.0ThrXaa: 0.0 ± 0.0
Val
2.755ValAla: 2.755 ± 1.625
0.0ValCys: 0.0 ± 0.0
2.066ValAsp: 2.066 ± 0.994
2.755ValGlu: 2.755 ± 0.658
0.689ValPhe: 0.689 ± 0.824
2.066ValGly: 2.066 ± 0.907
1.377ValHis: 1.377 ± 1.429
5.51ValIle: 5.51 ± 1.712
2.755ValLys: 2.755 ± 1.31
6.198ValLeu: 6.198 ± 0.638
2.755ValMet: 2.755 ± 0.815
1.377ValAsn: 1.377 ± 0.703
6.887ValPro: 6.887 ± 2.274
0.689ValGln: 0.689 ± 0.824
2.755ValArg: 2.755 ± 1.112
4.132ValSer: 4.132 ± 1.624
3.444ValThr: 3.444 ± 1.63
2.066ValVal: 2.066 ± 0.907
0.0ValTrp: 0.0 ± 0.0
2.755ValTyr: 2.755 ± 0.63
0.0ValXaa: 0.0 ± 0.0
Trp
0.689TrpAla: 0.689 ± 0.824
0.0TrpCys: 0.0 ± 0.0
0.0TrpAsp: 0.0 ± 0.0
0.689TrpGlu: 0.689 ± 0.486
1.377TrpPhe: 1.377 ± 0.973
0.0TrpGly: 0.0 ± 0.0
0.689TrpHis: 0.689 ± 0.486
0.0TrpIle: 0.0 ± 0.0
0.689TrpLys: 0.689 ± 0.715
1.377TrpLeu: 1.377 ± 0.849
0.0TrpMet: 0.0 ± 0.0
0.689TrpAsn: 0.689 ± 0.54
0.689TrpPro: 0.689 ± 0.486
0.689TrpGln: 0.689 ± 0.486
1.377TrpArg: 1.377 ± 0.711
0.689TrpSer: 0.689 ± 0.486
0.689TrpThr: 0.689 ± 0.486
0.689TrpVal: 0.689 ± 0.824
0.0TrpTrp: 0.0 ± 0.0
1.377TrpTyr: 1.377 ± 0.711
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.755TyrAla: 2.755 ± 1.372
1.377TyrCys: 1.377 ± 1.647
3.444TyrAsp: 3.444 ± 1.877
0.689TyrGlu: 0.689 ± 0.486
0.689TyrPhe: 0.689 ± 0.486
1.377TyrGly: 1.377 ± 0.849
0.689TyrHis: 0.689 ± 0.824
0.689TyrIle: 0.689 ± 0.824
0.0TyrLys: 0.0 ± 0.0
4.132TyrLeu: 4.132 ± 1.001
1.377TyrMet: 1.377 ± 0.711
3.444TyrAsn: 3.444 ± 2.018
1.377TyrPro: 1.377 ± 0.946
2.066TyrGln: 2.066 ± 1.459
0.689TyrArg: 0.689 ± 0.486
1.377TyrSer: 1.377 ± 0.703
2.066TyrThr: 2.066 ± 0.853
2.755TyrVal: 2.755 ± 0.658
0.689TyrTrp: 0.689 ± 0.824
2.066TyrTyr: 2.066 ± 0.897
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5 proteins (1453 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski