Amino acid dipepetide frequency for Tortoise microvirus 90

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.104AlaAla: 8.104 ± 3.429
0.81AlaCys: 0.81 ± 0.859
5.673AlaAsp: 5.673 ± 2.167
8.914AlaGlu: 8.914 ± 3.089
0.81AlaPhe: 0.81 ± 0.674
4.862AlaGly: 4.862 ± 2.468
1.621AlaHis: 1.621 ± 1.049
1.621AlaIle: 1.621 ± 0.68
3.241AlaLys: 3.241 ± 2.295
9.724AlaLeu: 9.724 ± 2.122
0.81AlaMet: 0.81 ± 0.859
4.052AlaAsn: 4.052 ± 1.071
4.052AlaPro: 4.052 ± 1.333
4.862AlaGln: 4.862 ± 2.552
3.241AlaArg: 3.241 ± 1.36
4.862AlaSer: 4.862 ± 2.786
1.621AlaThr: 1.621 ± 0.86
4.862AlaVal: 4.862 ± 0.656
1.621AlaTrp: 1.621 ± 0.68
3.241AlaTyr: 3.241 ± 1.36
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.81CysCys: 0.81 ± 0.859
0.81CysAsp: 0.81 ± 0.859
0.0CysGlu: 0.0 ± 0.0
0.0CysPhe: 0.0 ± 0.0
0.81CysGly: 0.81 ± 0.859
0.0CysHis: 0.0 ± 0.0
0.0CysIle: 0.0 ± 0.0
0.81CysLys: 0.81 ± 0.859
0.0CysLeu: 0.0 ± 0.0
0.81CysMet: 0.81 ± 0.859
0.0CysAsn: 0.0 ± 0.0
0.81CysPro: 0.81 ± 0.859
0.81CysGln: 0.81 ± 0.525
0.0CysArg: 0.0 ± 0.0
0.0CysSer: 0.0 ± 0.0
0.0CysThr: 0.0 ± 0.0
0.0CysVal: 0.0 ± 0.0
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
2.431AspAla: 2.431 ± 0.858
0.0AspCys: 0.0 ± 0.0
2.431AspAsp: 2.431 ± 0.458
0.81AspGlu: 0.81 ± 0.525
4.862AspPhe: 4.862 ± 3.998
2.431AspGly: 2.431 ± 0.858
0.81AspHis: 0.81 ± 0.859
2.431AspIle: 2.431 ± 0.458
2.431AspLys: 2.431 ± 1.107
8.914AspLeu: 8.914 ± 2.75
0.81AspMet: 0.81 ± 0.938
4.862AspAsn: 4.862 ± 3.193
3.241AspPro: 3.241 ± 1.442
3.241AspGln: 3.241 ± 1.721
2.431AspArg: 2.431 ± 1.601
1.621AspSer: 1.621 ± 0.667
3.241AspThr: 3.241 ± 1.335
4.052AspVal: 4.052 ± 0.862
0.0AspTrp: 0.0 ± 0.0
4.862AspTyr: 4.862 ± 2.432
0.0AspXaa: 0.0 ± 0.0
Glu
4.862GluAla: 4.862 ± 1.36
0.81GluCys: 0.81 ± 0.859
0.81GluAsp: 0.81 ± 0.525
3.241GluGlu: 3.241 ± 1.205
1.621GluPhe: 1.621 ± 0.68
3.241GluGly: 3.241 ± 1.25
2.431GluHis: 2.431 ± 0.994
4.052GluIle: 4.052 ± 1.912
8.104GluLys: 8.104 ± 1.587
4.862GluLeu: 4.862 ± 2.688
0.0GluMet: 0.0 ± 0.0
1.621GluAsn: 1.621 ± 0.877
0.81GluPro: 0.81 ± 0.525
4.862GluGln: 4.862 ± 0.955
4.862GluArg: 4.862 ± 1.619
4.052GluSer: 4.052 ± 0.756
1.621GluThr: 1.621 ± 0.667
1.621GluVal: 1.621 ± 0.86
0.81GluTrp: 0.81 ± 0.525
4.862GluTyr: 4.862 ± 0.911
0.0GluXaa: 0.0 ± 0.0
Phe
6.483PheAla: 6.483 ± 2.708
0.0PheCys: 0.0 ± 0.0
2.431PheAsp: 2.431 ± 0.858
2.431PheGlu: 2.431 ± 1.458
1.621PhePhe: 1.621 ± 1.049
2.431PheGly: 2.431 ± 1.574
2.431PheHis: 2.431 ± 0.858
3.241PheIle: 3.241 ± 1.25
3.241PheLys: 3.241 ± 1.873
0.0PheLeu: 0.0 ± 0.0
1.621PheMet: 1.621 ± 1.719
0.81PheAsn: 0.81 ± 0.525
0.81PhePro: 0.81 ± 0.859
1.621PheGln: 1.621 ± 0.877
3.241PheArg: 3.241 ± 2.295
3.241PheSer: 3.241 ± 0.924
1.621PheThr: 1.621 ± 1.049
1.621PheVal: 1.621 ± 0.68
0.81PheTrp: 0.81 ± 0.525
0.0PheTyr: 0.0 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
5.673GlyAla: 5.673 ± 1.608
0.0GlyCys: 0.0 ± 0.0
5.673GlyAsp: 5.673 ± 2.079
4.862GlyGlu: 4.862 ± 1.54
2.431GlyPhe: 2.431 ± 0.458
5.673GlyGly: 5.673 ± 1.989
0.81GlyHis: 0.81 ± 0.525
4.052GlyIle: 4.052 ± 1.333
2.431GlyLys: 2.431 ± 1.458
5.673GlyLeu: 5.673 ± 3.197
0.0GlyMet: 0.0 ± 0.0
3.241GlyAsn: 3.241 ± 2.099
4.052GlyPro: 4.052 ± 1.93
1.621GlyGln: 1.621 ± 1.049
4.052GlyArg: 4.052 ± 1.558
4.052GlySer: 4.052 ± 1.558
2.431GlyThr: 2.431 ± 0.858
3.241GlyVal: 3.241 ± 0.924
0.81GlyTrp: 0.81 ± 0.525
2.431GlyTyr: 2.431 ± 1.574
0.0GlyXaa: 0.0 ± 0.0
His
0.81HisAla: 0.81 ± 0.859
0.81HisCys: 0.81 ± 0.525
2.431HisAsp: 2.431 ± 0.994
0.0HisGlu: 0.0 ± 0.0
4.052HisPhe: 4.052 ± 1.714
1.621HisGly: 1.621 ± 1.049
0.81HisHis: 0.81 ± 0.525
2.431HisIle: 2.431 ± 0.858
1.621HisLys: 1.621 ± 0.68
0.81HisLeu: 0.81 ± 0.525
1.621HisMet: 1.621 ± 0.667
0.81HisAsn: 0.81 ± 0.859
2.431HisPro: 2.431 ± 0.959
0.81HisGln: 0.81 ± 0.859
1.621HisArg: 1.621 ± 0.68
0.81HisSer: 0.81 ± 0.859
0.0HisThr: 0.0 ± 0.0
1.621HisVal: 1.621 ± 0.68
0.0HisTrp: 0.0 ± 0.0
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
0.81IleAla: 0.81 ± 0.525
0.0IleCys: 0.0 ± 0.0
4.862IleAsp: 4.862 ± 2.002
3.241IleGlu: 3.241 ± 1.442
3.241IlePhe: 3.241 ± 0.924
3.241IleGly: 3.241 ± 0.448
2.431IleHis: 2.431 ± 0.858
3.241IleIle: 3.241 ± 1.25
4.862IleLys: 4.862 ± 2.432
1.621IleLeu: 1.621 ± 0.667
0.81IleMet: 0.81 ± 0.754
0.81IleAsn: 0.81 ± 0.525
7.293IlePro: 7.293 ± 2.21
2.431IleGln: 2.431 ± 1.216
0.0IleArg: 0.0 ± 0.0
7.293IleSer: 7.293 ± 2.493
2.431IleThr: 2.431 ± 1.234
0.81IleVal: 0.81 ± 0.525
0.0IleTrp: 0.0 ± 0.0
3.241IleTyr: 3.241 ± 0.448
0.0IleXaa: 0.0 ± 0.0
Lys
8.914LysAla: 8.914 ± 0.684
0.81LysCys: 0.81 ± 0.859
2.431LysAsp: 2.431 ± 0.994
2.431LysGlu: 2.431 ± 0.907
3.241LysPhe: 3.241 ± 1.25
3.241LysGly: 3.241 ± 1.205
0.81LysHis: 0.81 ± 0.898
1.621LysIle: 1.621 ± 0.68
1.621LysLys: 1.621 ± 1.239
7.293LysLeu: 7.293 ± 2.885
2.431LysMet: 2.431 ± 0.542
2.431LysAsn: 2.431 ± 0.907
0.0LysPro: 0.0 ± 0.0
3.241LysGln: 3.241 ± 1.346
4.052LysArg: 4.052 ± 2.107
5.673LysSer: 5.673 ± 0.879
4.862LysThr: 4.862 ± 1.995
2.431LysVal: 2.431 ± 0.858
0.0LysTrp: 0.0 ± 0.0
1.621LysTyr: 1.621 ± 0.68
0.0LysXaa: 0.0 ± 0.0
Leu
4.862LeuAla: 4.862 ± 3.083
0.81LeuCys: 0.81 ± 0.859
5.673LeuAsp: 5.673 ± 2.104
4.862LeuGlu: 4.862 ± 1.399
0.81LeuPhe: 0.81 ± 0.674
7.293LeuGly: 7.293 ± 0.234
1.621LeuHis: 1.621 ± 0.68
5.673LeuIle: 5.673 ± 2.516
6.483LeuLys: 6.483 ± 3.444
4.052LeuLeu: 4.052 ± 0.756
1.621LeuMet: 1.621 ± 0.877
4.862LeuAsn: 4.862 ± 1.987
5.673LeuPro: 5.673 ± 0.655
4.862LeuGln: 4.862 ± 0.955
10.535LeuArg: 10.535 ± 1.354
6.483LeuSer: 6.483 ± 2.188
4.052LeuThr: 4.052 ± 2.107
2.431LeuVal: 2.431 ± 0.907
0.0LeuTrp: 0.0 ± 0.0
0.0LeuTyr: 0.0 ± 0.0
0.0LeuXaa: 0.0 ± 0.0
Met
1.621MetAla: 1.621 ± 0.68
0.0MetCys: 0.0 ± 0.0
0.0MetAsp: 0.0 ± 0.0
0.0MetGlu: 0.0 ± 0.0
0.0MetPhe: 0.0 ± 0.0
3.241MetGly: 3.241 ± 1.335
1.621MetHis: 1.621 ± 0.68
2.431MetIle: 2.431 ± 0.858
2.431MetLys: 2.431 ± 0.959
0.0MetLeu: 0.0 ± 0.0
0.0MetMet: 0.0 ± 0.0
0.81MetAsn: 0.81 ± 0.674
4.052MetPro: 4.052 ± 0.756
0.81MetGln: 0.81 ± 0.674
2.431MetArg: 2.431 ± 0.458
3.241MetSer: 3.241 ± 1.807
0.81MetThr: 0.81 ± 0.859
2.431MetVal: 2.431 ± 0.858
0.0MetTrp: 0.0 ± 0.0
0.81MetTyr: 0.81 ± 0.674
0.0MetXaa: 0.0 ± 0.0
Asn
0.81AsnAla: 0.81 ± 0.525
0.0AsnCys: 0.0 ± 0.0
2.431AsnAsp: 2.431 ± 0.858
4.862AsnGlu: 4.862 ± 2.213
2.431AsnPhe: 2.431 ± 0.458
0.81AsnGly: 0.81 ± 0.525
0.0AsnHis: 0.0 ± 0.0
1.621AsnIle: 1.621 ± 0.667
3.241AsnLys: 3.241 ± 1.335
4.862AsnLeu: 4.862 ± 1.399
2.431AsnMet: 2.431 ± 1.574
1.621AsnAsn: 1.621 ± 0.877
1.621AsnPro: 1.621 ± 0.877
2.431AsnGln: 2.431 ± 0.907
4.862AsnArg: 4.862 ± 1.987
4.052AsnSer: 4.052 ± 1.635
4.052AsnThr: 4.052 ± 0.756
4.862AsnVal: 4.862 ± 2.468
0.81AsnTrp: 0.81 ± 0.859
2.431AsnTyr: 2.431 ± 1.601
0.0AsnXaa: 0.0 ± 0.0
Pro
4.862ProAla: 4.862 ± 2.35
0.81ProCys: 0.81 ± 0.859
3.241ProAsp: 3.241 ± 1.721
2.431ProGlu: 2.431 ± 1.458
2.431ProPhe: 2.431 ± 1.574
2.431ProGly: 2.431 ± 1.574
0.81ProHis: 0.81 ± 0.859
2.431ProIle: 2.431 ± 1.871
2.431ProLys: 2.431 ± 1.458
2.431ProLeu: 2.431 ± 0.458
4.052ProMet: 4.052 ± 0.479
3.241ProAsn: 3.241 ± 1.442
4.052ProPro: 4.052 ± 2.512
3.241ProGln: 3.241 ± 0.93
2.431ProArg: 2.431 ± 0.858
3.241ProSer: 3.241 ± 1.442
5.673ProThr: 5.673 ± 1.864
4.862ProVal: 4.862 ± 1.717
0.0ProTrp: 0.0 ± 0.0
1.621ProTyr: 1.621 ± 0.68
0.0ProXaa: 0.0 ± 0.0
Gln
4.052GlnAla: 4.052 ± 2.136
0.0GlnCys: 0.0 ± 0.0
0.81GlnAsp: 0.81 ± 0.859
6.483GlnGlu: 6.483 ± 1.281
0.81GlnPhe: 0.81 ± 0.859
3.241GlnGly: 3.241 ± 1.504
0.81GlnHis: 0.81 ± 0.525
4.052GlnIle: 4.052 ± 1.609
2.431GlnLys: 2.431 ± 1.678
4.052GlnLeu: 4.052 ± 1.558
3.241GlnMet: 3.241 ± 0.93
5.673GlnAsn: 5.673 ± 1.735
1.621GlnPro: 1.621 ± 0.86
3.241GlnGln: 3.241 ± 0.924
2.431GlnArg: 2.431 ± 1.307
4.052GlnSer: 4.052 ± 2.922
4.052GlnThr: 4.052 ± 1.333
0.81GlnVal: 0.81 ± 0.898
0.81GlnTrp: 0.81 ± 0.674
1.621GlnTyr: 1.621 ± 0.667
0.0GlnXaa: 0.0 ± 0.0
Arg
4.862ArgAla: 4.862 ± 0.917
0.0ArgCys: 0.0 ± 0.0
4.862ArgAsp: 4.862 ± 0.911
4.862ArgGlu: 4.862 ± 1.399
0.81ArgPhe: 0.81 ± 0.525
1.621ArgGly: 1.621 ± 0.68
0.81ArgHis: 0.81 ± 0.525
4.862ArgIle: 4.862 ± 1.36
4.862ArgLys: 4.862 ± 1.749
6.483ArgLeu: 6.483 ± 1.86
0.81ArgMet: 0.81 ± 0.674
3.241ArgAsn: 3.241 ± 1.205
1.621ArgPro: 1.621 ± 0.68
4.052ArgGln: 4.052 ± 2.922
4.862ArgArg: 4.862 ± 1.68
2.431ArgSer: 2.431 ± 1.601
0.81ArgThr: 0.81 ± 0.859
2.431ArgVal: 2.431 ± 0.994
0.0ArgTrp: 0.0 ± 0.0
4.052ArgTyr: 4.052 ± 2.107
0.0ArgXaa: 0.0 ± 0.0
Ser
5.673SerAla: 5.673 ± 0.879
0.0SerCys: 0.0 ± 0.0
5.673SerAsp: 5.673 ± 1.305
3.241SerGlu: 3.241 ± 0.93
3.241SerPhe: 3.241 ± 1.25
9.724SerGly: 9.724 ± 2.469
0.81SerHis: 0.81 ± 0.525
3.241SerIle: 3.241 ± 1.205
4.052SerLys: 4.052 ± 3.144
6.483SerLeu: 6.483 ± 2.287
0.0SerMet: 0.0 ± 0.0
3.241SerAsn: 3.241 ± 1.205
2.431SerPro: 2.431 ± 2.021
5.673SerGln: 5.673 ± 3.668
1.621SerArg: 1.621 ± 0.667
9.724SerSer: 9.724 ± 3.533
3.241SerThr: 3.241 ± 1.721
7.293SerVal: 7.293 ± 2.9
0.0SerTrp: 0.0 ± 0.0
4.052SerTyr: 4.052 ± 0.756
0.0SerXaa: 0.0 ± 0.0
Thr
2.431ThrAla: 2.431 ± 0.994
0.0ThrCys: 0.0 ± 0.0
3.241ThrAsp: 3.241 ± 1.605
3.241ThrGlu: 3.241 ± 2.099
4.052ThrPhe: 4.052 ± 0.479
3.241ThrGly: 3.241 ± 1.25
0.81ThrHis: 0.81 ± 0.525
2.431ThrIle: 2.431 ± 0.994
1.621ThrLys: 1.621 ± 1.719
7.293ThrLeu: 7.293 ± 2.77
0.81ThrMet: 0.81 ± 0.743
2.431ThrAsn: 2.431 ± 1.234
4.052ThrPro: 4.052 ± 1.714
0.81ThrGln: 0.81 ± 0.674
3.241ThrArg: 3.241 ± 1.526
4.052ThrSer: 4.052 ± 0.756
4.052ThrThr: 4.052 ± 1.071
1.621ThrVal: 1.621 ± 1.082
1.621ThrTrp: 1.621 ± 0.68
0.81ThrTyr: 0.81 ± 0.859
0.0ThrXaa: 0.0 ± 0.0
Val
6.483ValAla: 6.483 ± 2.167
0.0ValCys: 0.0 ± 0.0
1.621ValAsp: 1.621 ± 1.049
0.81ValGlu: 0.81 ± 0.525
2.431ValPhe: 2.431 ± 1.574
2.431ValGly: 2.431 ± 0.858
1.621ValHis: 1.621 ± 0.68
1.621ValIle: 1.621 ± 1.049
2.431ValLys: 2.431 ± 0.458
3.241ValLeu: 3.241 ± 1.346
2.431ValMet: 2.431 ± 0.994
2.431ValAsn: 2.431 ± 1.107
6.483ValPro: 6.483 ± 1.846
1.621ValGln: 1.621 ± 1.049
0.81ValArg: 0.81 ± 0.674
6.483ValSer: 6.483 ± 1.93
3.241ValThr: 3.241 ± 1.605
0.0ValVal: 0.0 ± 0.0
0.0ValTrp: 0.0 ± 0.0
2.431ValTyr: 2.431 ± 0.994
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.0TrpCys: 0.0 ± 0.0
0.0TrpAsp: 0.0 ± 0.0
1.621TrpGlu: 1.621 ± 1.049
0.0TrpPhe: 0.0 ± 0.0
0.0TrpGly: 0.0 ± 0.0
1.621TrpHis: 1.621 ± 0.667
0.0TrpIle: 0.0 ± 0.0
0.0TrpLys: 0.0 ± 0.0
0.81TrpLeu: 0.81 ± 0.525
0.0TrpMet: 0.0 ± 0.0
0.0TrpAsn: 0.0 ± 0.0
1.621TrpPro: 1.621 ± 0.68
0.81TrpGln: 0.81 ± 0.859
0.0TrpArg: 0.0 ± 0.0
0.81TrpSer: 0.81 ± 0.525
0.81TrpThr: 0.81 ± 0.859
0.0TrpVal: 0.0 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
4.862TyrAla: 4.862 ± 0.911
0.0TyrCys: 0.0 ± 0.0
1.621TyrAsp: 1.621 ± 0.667
0.81TyrGlu: 0.81 ± 0.674
0.81TyrPhe: 0.81 ± 0.525
1.621TyrGly: 1.621 ± 0.877
2.431TyrHis: 2.431 ± 2.578
1.621TyrIle: 1.621 ± 1.049
0.81TyrLys: 0.81 ± 0.525
4.052TyrLeu: 4.052 ± 1.714
1.621TyrMet: 1.621 ± 0.877
4.052TyrAsn: 4.052 ± 0.756
0.0TyrPro: 0.0 ± 0.0
3.241TyrGln: 3.241 ± 2.099
1.621TyrArg: 1.621 ± 0.68
3.241TyrSer: 3.241 ± 1.205
3.241TyrThr: 3.241 ± 1.25
1.621TyrVal: 1.621 ± 0.877
0.81TyrTrp: 0.81 ± 0.525
5.673TyrTyr: 5.673 ± 1.581
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4 proteins (1235 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski