Amino acid dipepetide frequency for Microviridae Fen4707_41

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
11.664AlaAla: 11.664 ± 2.187
0.0AlaCys: 0.0 ± 0.0
6.998AlaAsp: 6.998 ± 2.082
7.776AlaGlu: 7.776 ± 4.889
2.333AlaPhe: 2.333 ± 2.052
7.776AlaGly: 7.776 ± 2.656
1.555AlaHis: 1.555 ± 0.662
1.555AlaIle: 1.555 ± 0.728
4.666AlaLys: 4.666 ± 0.983
10.109AlaLeu: 10.109 ± 2.242
0.778AlaMet: 0.778 ± 0.516
3.11AlaAsn: 3.11 ± 0.672
3.888AlaPro: 3.888 ± 1.605
4.666AlaGln: 4.666 ± 2.655
6.221AlaArg: 6.221 ± 2.644
5.443AlaSer: 5.443 ± 1.937
0.778AlaThr: 0.778 ± 0.838
6.998AlaVal: 6.998 ± 2.03
0.778AlaTrp: 0.778 ± 0.901
3.888AlaTyr: 3.888 ± 1.248
0.0AlaXaa: 0.0 ± 0.0
Cys
0.778CysAla: 0.778 ± 0.516
0.0CysCys: 0.0 ± 0.0
0.778CysAsp: 0.778 ± 1.055
0.0CysGlu: 0.0 ± 0.0
0.778CysPhe: 0.778 ± 1.753
2.333CysGly: 2.333 ± 1.66
0.778CysHis: 0.778 ± 0.737
0.778CysIle: 0.778 ± 0.516
0.778CysLys: 0.778 ± 1.753
0.778CysLeu: 0.778 ± 0.737
0.0CysMet: 0.0 ± 0.0
0.778CysAsn: 0.778 ± 0.516
0.778CysPro: 0.778 ± 0.737
1.555CysGln: 1.555 ± 1.032
0.0CysArg: 0.0 ± 0.0
2.333CysSer: 2.333 ± 3.422
0.0CysThr: 0.0 ± 0.0
0.778CysVal: 0.778 ± 0.516
0.0CysTrp: 0.0 ± 0.0
0.778CysTyr: 0.778 ± 0.737
0.0CysXaa: 0.0 ± 0.0
Asp
2.333AspAla: 2.333 ± 1.694
0.778AspCys: 0.778 ± 0.737
3.888AspAsp: 3.888 ± 2.046
1.555AspGlu: 1.555 ± 1.802
2.333AspPhe: 2.333 ± 1.548
3.11AspGly: 3.11 ± 1.35
0.778AspHis: 0.778 ± 0.516
2.333AspIle: 2.333 ± 0.93
1.555AspLys: 1.555 ± 1.475
6.221AspLeu: 6.221 ± 2.947
2.333AspMet: 2.333 ± 1.913
2.333AspAsn: 2.333 ± 1.278
4.666AspPro: 4.666 ± 1.767
3.11AspGln: 3.11 ± 1.232
4.666AspArg: 4.666 ± 1.867
2.333AspSer: 2.333 ± 1.66
1.555AspThr: 1.555 ± 0.662
2.333AspVal: 2.333 ± 1.214
0.778AspTrp: 0.778 ± 0.737
4.666AspTyr: 4.666 ± 1.118
0.0AspXaa: 0.0 ± 0.0
Glu
6.998GluAla: 6.998 ± 4.423
0.0GluCys: 0.0 ± 0.0
2.333GluAsp: 2.333 ± 2.704
2.333GluGlu: 2.333 ± 1.865
1.555GluPhe: 1.555 ± 0.869
1.555GluGly: 1.555 ± 1.342
0.778GluHis: 0.778 ± 0.516
0.778GluIle: 0.778 ± 0.901
3.11GluLys: 3.11 ± 2.227
3.11GluLeu: 3.11 ± 1.369
0.0GluMet: 0.0 ± 0.0
2.333GluAsn: 2.333 ± 3.166
1.555GluPro: 1.555 ± 0.869
1.555GluGln: 1.555 ± 0.869
3.888GluArg: 3.888 ± 2.008
1.555GluSer: 1.555 ± 0.728
3.11GluThr: 3.11 ± 0.957
4.666GluVal: 4.666 ± 1.51
0.0GluTrp: 0.0 ± 0.0
1.555GluTyr: 1.555 ± 1.032
0.0GluXaa: 0.0 ± 0.0
Phe
5.443PheAla: 5.443 ± 1.786
1.555PheCys: 1.555 ± 1.032
0.778PheAsp: 0.778 ± 0.516
1.555PheGlu: 1.555 ± 0.662
4.666PhePhe: 4.666 ± 2.605
3.888PheGly: 3.888 ± 1.821
0.0PheHis: 0.0 ± 0.0
1.555PheIle: 1.555 ± 0.662
3.888PheLys: 3.888 ± 2.735
6.221PheLeu: 6.221 ± 3.205
0.778PheMet: 0.778 ± 0.838
4.666PheAsn: 4.666 ± 0.888
4.666PhePro: 4.666 ± 1.397
0.0PheGln: 0.0 ± 0.0
1.555PheArg: 1.555 ± 1.937
3.888PheSer: 3.888 ± 1.557
1.555PheThr: 1.555 ± 1.707
1.555PheVal: 1.555 ± 1.707
2.333PheTrp: 2.333 ± 0.93
0.778PheTyr: 0.778 ± 0.516
0.0PheXaa: 0.0 ± 0.0
Gly
4.666GlyAla: 4.666 ± 1.049
0.778GlyCys: 0.778 ± 0.516
3.888GlyAsp: 3.888 ± 1.931
1.555GlyGlu: 1.555 ± 1.032
3.888GlyPhe: 3.888 ± 1.049
6.221GlyGly: 6.221 ± 2.542
0.0GlyHis: 0.0 ± 0.0
0.778GlyIle: 0.778 ± 0.516
3.11GlyLys: 3.11 ± 2.227
6.221GlyLeu: 6.221 ± 3.969
0.778GlyMet: 0.778 ± 0.796
3.11GlyAsn: 3.11 ± 1.692
1.555GlyPro: 1.555 ± 1.032
2.333GlyGln: 2.333 ± 0.944
3.888GlyArg: 3.888 ± 1.931
6.221GlySer: 6.221 ± 2.671
7.776GlyThr: 7.776 ± 4.28
6.221GlyVal: 6.221 ± 1.761
0.778GlyTrp: 0.778 ± 0.901
2.333GlyTyr: 2.333 ± 1.303
0.0GlyXaa: 0.0 ± 0.0
His
0.778HisAla: 0.778 ± 0.516
0.0HisCys: 0.0 ± 0.0
0.778HisAsp: 0.778 ± 0.516
0.778HisGlu: 0.778 ± 0.901
0.778HisPhe: 0.778 ± 1.055
0.778HisGly: 0.778 ± 0.516
0.0HisHis: 0.0 ± 0.0
0.778HisIle: 0.778 ± 0.516
0.778HisLys: 0.778 ± 0.516
4.666HisLeu: 4.666 ± 1.242
0.0HisMet: 0.0 ± 0.0
0.0HisAsn: 0.0 ± 0.0
1.555HisPro: 1.555 ± 1.707
1.555HisGln: 1.555 ± 1.004
0.778HisArg: 0.778 ± 0.737
0.778HisSer: 0.778 ± 0.737
0.0HisThr: 0.0 ± 0.0
0.778HisVal: 0.778 ± 0.737
0.0HisTrp: 0.0 ± 0.0
1.555HisTyr: 1.555 ± 1.004
0.0HisXaa: 0.0 ± 0.0
Ile
2.333IleAla: 2.333 ± 0.681
0.0IleCys: 0.0 ± 0.0
3.11IleAsp: 3.11 ± 2.085
0.778IleGlu: 0.778 ± 0.516
2.333IlePhe: 2.333 ± 0.93
3.888IleGly: 3.888 ± 1.945
0.778IleHis: 0.778 ± 0.737
0.778IleIle: 0.778 ± 1.055
0.778IleLys: 0.778 ± 0.737
3.11IleLeu: 3.11 ± 0.957
1.555IleMet: 1.555 ± 1.032
0.778IleAsn: 0.778 ± 1.055
3.888IlePro: 3.888 ± 2.372
2.333IleGln: 2.333 ± 0.974
2.333IleArg: 2.333 ± 2.385
2.333IleSer: 2.333 ± 0.93
2.333IleThr: 2.333 ± 1.214
2.333IleVal: 2.333 ± 0.93
0.778IleTrp: 0.778 ± 0.737
3.888IleTyr: 3.888 ± 1.605
0.0IleXaa: 0.0 ± 0.0
Lys
5.443LysAla: 5.443 ± 2.741
1.555LysCys: 1.555 ± 1.475
2.333LysAsp: 2.333 ± 1.694
3.11LysGlu: 3.11 ± 1.566
2.333LysPhe: 2.333 ± 0.901
1.555LysGly: 1.555 ± 1.202
0.778LysHis: 0.778 ± 0.737
3.888LysIle: 3.888 ± 2.694
3.11LysLys: 3.11 ± 2.278
3.11LysLeu: 3.11 ± 0.672
1.555LysMet: 1.555 ± 0.968
2.333LysAsn: 2.333 ± 2.197
4.666LysPro: 4.666 ± 2.927
3.11LysGln: 3.11 ± 1.835
2.333LysArg: 2.333 ± 1.549
3.888LysSer: 3.888 ± 2.046
0.778LysThr: 0.778 ± 0.901
1.555LysVal: 1.555 ± 0.728
0.778LysTrp: 0.778 ± 0.737
3.888LysTyr: 3.888 ± 1.895
0.0LysXaa: 0.0 ± 0.0
Leu
9.331LeuAla: 9.331 ± 1.884
0.778LeuCys: 0.778 ± 0.737
6.998LeuAsp: 6.998 ± 1.939
3.888LeuGlu: 3.888 ± 2.581
0.778LeuPhe: 0.778 ± 0.516
6.221LeuGly: 6.221 ± 1.378
1.555LeuHis: 1.555 ± 1.014
6.998LeuIle: 6.998 ± 1.16
5.443LeuLys: 5.443 ± 3.627
6.998LeuLeu: 6.998 ± 2.787
2.333LeuMet: 2.333 ± 0.93
6.998LeuAsn: 6.998 ± 1.892
1.555LeuPro: 1.555 ± 0.662
6.221LeuGln: 6.221 ± 1.812
10.886LeuArg: 10.886 ± 2.42
2.333LeuSer: 2.333 ± 1.257
7.776LeuThr: 7.776 ± 1.594
1.555LeuVal: 1.555 ± 1.707
0.778LeuTrp: 0.778 ± 0.516
1.555LeuTyr: 1.555 ± 1.475
0.0LeuXaa: 0.0 ± 0.0
Met
1.555MetAla: 1.555 ± 0.728
0.0MetCys: 0.0 ± 0.0
0.778MetAsp: 0.778 ± 0.516
0.778MetGlu: 0.778 ± 0.838
0.0MetPhe: 0.0 ± 0.0
2.333MetGly: 2.333 ± 0.944
0.778MetHis: 0.778 ± 0.516
0.778MetIle: 0.778 ± 1.055
1.555MetLys: 1.555 ± 2.174
0.778MetLeu: 0.778 ± 0.838
0.0MetMet: 0.0 ± 0.0
0.778MetAsn: 0.778 ± 0.516
0.778MetPro: 0.778 ± 0.516
0.778MetGln: 0.778 ± 0.516
2.333MetArg: 2.333 ± 1.952
2.333MetSer: 2.333 ± 0.93
1.555MetThr: 1.555 ± 0.728
0.0MetVal: 0.0 ± 0.0
0.0MetTrp: 0.0 ± 0.0
0.778MetTyr: 0.778 ± 0.737
0.0MetXaa: 0.0 ± 0.0
Asn
10.109AsnAla: 10.109 ± 4.944
1.555AsnCys: 1.555 ± 1.843
1.555AsnAsp: 1.555 ± 1.014
3.11AsnGlu: 3.11 ± 3.15
1.555AsnPhe: 1.555 ± 0.662
3.888AsnGly: 3.888 ± 1.991
0.0AsnHis: 0.0 ± 0.0
3.888AsnIle: 3.888 ± 1.466
1.555AsnLys: 1.555 ± 0.662
3.888AsnLeu: 3.888 ± 1.458
1.555AsnMet: 1.555 ± 0.728
0.0AsnAsn: 0.0 ± 0.0
1.555AsnPro: 1.555 ± 1.004
4.666AsnGln: 4.666 ± 1.428
0.0AsnArg: 0.0 ± 0.0
2.333AsnSer: 2.333 ± 2.052
5.443AsnThr: 5.443 ± 3.611
3.888AsnVal: 3.888 ± 2.579
0.0AsnTrp: 0.0 ± 0.0
2.333AsnTyr: 2.333 ± 1.69
0.0AsnXaa: 0.0 ± 0.0
Pro
3.888ProAla: 3.888 ± 1.895
0.778ProCys: 0.778 ± 0.737
3.888ProAsp: 3.888 ± 0.993
5.443ProGlu: 5.443 ± 1.507
2.333ProPhe: 2.333 ± 1.303
3.11ProGly: 3.11 ± 1.692
1.555ProHis: 1.555 ± 1.843
2.333ProIle: 2.333 ± 1.548
0.778ProLys: 0.778 ± 0.838
0.778ProLeu: 0.778 ± 0.737
2.333ProMet: 2.333 ± 1.573
2.333ProAsn: 2.333 ± 0.944
3.11ProPro: 3.11 ± 1.975
2.333ProGln: 2.333 ± 1.11
4.666ProArg: 4.666 ± 2.31
3.11ProSer: 3.11 ± 1.497
3.888ProThr: 3.888 ± 1.246
4.666ProVal: 4.666 ± 1.236
0.0ProTrp: 0.0 ± 0.0
2.333ProTyr: 2.333 ± 1.257
0.0ProXaa: 0.0 ± 0.0
Gln
6.221GlnAla: 6.221 ± 1.752
3.11GlnCys: 3.11 ± 1.232
0.778GlnAsp: 0.778 ± 0.516
2.333GlnGlu: 2.333 ± 1.278
3.888GlnPhe: 3.888 ± 1.049
1.555GlnGly: 1.555 ± 1.676
0.778GlnHis: 0.778 ± 1.055
0.0GlnIle: 0.0 ± 0.0
1.555GlnLys: 1.555 ± 1.475
3.11GlnLeu: 3.11 ± 2.052
1.555GlnMet: 1.555 ± 1.676
4.666GlnAsn: 4.666 ± 1.888
0.778GlnPro: 0.778 ± 0.516
2.333GlnGln: 2.333 ± 1.483
3.11GlnArg: 3.11 ± 1.497
6.221GlnSer: 6.221 ± 4.041
3.11GlnThr: 3.11 ± 2.064
3.888GlnVal: 3.888 ± 1.29
0.778GlnTrp: 0.778 ± 0.516
1.555GlnTyr: 1.555 ± 1.676
0.0GlnXaa: 0.0 ± 0.0
Arg
5.443ArgAla: 5.443 ± 3.091
1.555ArgCys: 1.555 ± 1.707
3.11ArgAsp: 3.11 ± 1.399
2.333ArgGlu: 2.333 ± 1.257
3.888ArgPhe: 3.888 ± 0.658
3.11ArgGly: 3.11 ± 0.957
1.555ArgHis: 1.555 ± 1.202
0.778ArgIle: 0.778 ± 0.737
10.109ArgLys: 10.109 ± 4.837
6.221ArgLeu: 6.221 ± 2.206
0.778ArgMet: 0.778 ± 0.516
2.333ArgAsn: 2.333 ± 1.483
4.666ArgPro: 4.666 ± 2.297
6.221ArgGln: 6.221 ± 1.999
3.11ArgArg: 3.11 ± 2.007
2.333ArgSer: 2.333 ± 0.944
0.0ArgThr: 0.0 ± 0.0
3.11ArgVal: 3.11 ± 2.047
0.0ArgTrp: 0.0 ± 0.0
3.11ArgTyr: 3.11 ± 1.218
0.0ArgXaa: 0.0 ± 0.0
Ser
3.888SerAla: 3.888 ± 3.122
2.333SerCys: 2.333 ± 3.422
2.333SerAsp: 2.333 ± 1.303
1.555SerGlu: 1.555 ± 0.869
2.333SerPhe: 2.333 ± 2.192
10.109SerGly: 10.109 ± 3.701
2.333SerHis: 2.333 ± 1.11
3.11SerIle: 3.11 ± 1.35
2.333SerLys: 2.333 ± 0.681
9.331SerLeu: 9.331 ± 1.92
0.778SerMet: 0.778 ± 0.498
1.555SerAsn: 1.555 ± 1.032
2.333SerPro: 2.333 ± 0.93
0.778SerGln: 0.778 ± 0.838
3.888SerArg: 3.888 ± 3.318
3.11SerSer: 3.11 ± 3.68
4.666SerThr: 4.666 ± 3.095
2.333SerVal: 2.333 ± 1.548
0.778SerTrp: 0.778 ± 0.901
2.333SerTyr: 2.333 ± 1.66
0.0SerXaa: 0.0 ± 0.0
Thr
3.888ThrAla: 3.888 ± 1.458
0.0ThrCys: 0.0 ± 0.0
3.11ThrAsp: 3.11 ± 1.739
0.0ThrGlu: 0.0 ± 0.0
4.666ThrPhe: 4.666 ± 1.985
2.333ThrGly: 2.333 ± 1.548
0.778ThrHis: 0.778 ± 0.838
3.888ThrIle: 3.888 ± 1.895
1.555ThrLys: 1.555 ± 1.342
6.998ThrLeu: 6.998 ± 2.895
0.778ThrMet: 0.778 ± 0.516
4.666ThrAsn: 4.666 ± 1.65
3.888ThrPro: 3.888 ± 1.049
2.333ThrGln: 2.333 ± 1.548
0.0ThrArg: 0.0 ± 0.0
4.666ThrSer: 4.666 ± 3.095
4.666ThrThr: 4.666 ± 3.095
5.443ThrVal: 5.443 ± 2.909
0.778ThrTrp: 0.778 ± 0.516
2.333ThrTyr: 2.333 ± 1.548
0.0ThrXaa: 0.0 ± 0.0
Val
3.11ValAla: 3.11 ± 1.73
0.0ValCys: 0.0 ± 0.0
3.888ValAsp: 3.888 ± 1.049
2.333ValGlu: 2.333 ± 0.681
5.443ValPhe: 5.443 ± 2.566
0.778ValGly: 0.778 ± 0.737
0.0ValHis: 0.0 ± 0.0
3.888ValIle: 3.888 ± 1.991
3.888ValLys: 3.888 ± 2.196
4.666ValLeu: 4.666 ± 2.422
0.0ValMet: 0.0 ± 0.0
3.888ValAsn: 3.888 ± 1.248
5.443ValPro: 5.443 ± 1.342
1.555ValGln: 1.555 ± 1.676
3.888ValArg: 3.888 ± 1.719
4.666ValSer: 4.666 ± 3.095
3.888ValThr: 3.888 ± 1.529
3.888ValVal: 3.888 ± 1.529
1.555ValTrp: 1.555 ± 1.032
4.666ValTyr: 4.666 ± 1.65
0.0ValXaa: 0.0 ± 0.0
Trp
0.778TrpAla: 0.778 ± 0.737
0.0TrpCys: 0.0 ± 0.0
0.778TrpAsp: 0.778 ± 0.737
0.778TrpGlu: 0.778 ± 0.516
0.778TrpPhe: 0.778 ± 0.737
0.0TrpGly: 0.0 ± 0.0
0.778TrpHis: 0.778 ± 0.516
0.0TrpIle: 0.0 ± 0.0
0.0TrpLys: 0.0 ± 0.0
0.0TrpLeu: 0.0 ± 0.0
0.0TrpMet: 0.0 ± 0.0
1.555TrpAsn: 1.555 ± 1.032
0.778TrpPro: 0.778 ± 0.516
0.778TrpGln: 0.778 ± 0.737
2.333TrpArg: 2.333 ± 0.93
0.0TrpSer: 0.0 ± 0.0
0.0TrpThr: 0.0 ± 0.0
2.333TrpVal: 2.333 ± 2.704
0.0TrpTrp: 0.0 ± 0.0
0.778TrpTyr: 0.778 ± 0.516
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.333TyrAla: 2.333 ± 1.11
0.0TyrCys: 0.0 ± 0.0
1.555TyrAsp: 1.555 ± 1.032
0.778TyrGlu: 0.778 ± 1.055
4.666TyrPhe: 4.666 ± 1.684
1.555TyrGly: 1.555 ± 1.475
1.555TyrHis: 1.555 ± 1.475
1.555TyrIle: 1.555 ± 0.728
1.555TyrLys: 1.555 ± 1.032
4.666TyrLeu: 4.666 ± 1.049
0.0TyrMet: 0.0 ± 0.0
4.666TyrAsn: 4.666 ± 1.362
2.333TyrPro: 2.333 ± 1.11
3.11TyrGln: 3.11 ± 0.672
3.888TyrArg: 3.888 ± 1.049
2.333TyrSer: 2.333 ± 1.696
3.888TyrThr: 3.888 ± 0.658
3.11TyrVal: 3.11 ± 2.011
1.555TyrTrp: 1.555 ± 0.662
1.555TyrTyr: 1.555 ± 0.662
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (1287 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski