Amino acid dipepetide frequency for Tick-associated genomovirus 3

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.981AlaAla: 0.981 ± 0.961
0.0AlaCys: 0.0 ± 0.0
1.963AlaAsp: 1.963 ± 1.921
1.963AlaGlu: 1.963 ± 0.86
3.925AlaPhe: 3.925 ± 1.721
4.907AlaGly: 4.907 ± 0.978
1.963AlaHis: 1.963 ± 0.86
5.888AlaIle: 5.888 ± 0.278
3.925AlaLys: 3.925 ± 0.627
2.944AlaLeu: 2.944 ± 1.295
0.981AlaMet: 0.981 ± 0.961
9.814AlaAsn: 9.814 ± 2.773
1.963AlaPro: 1.963 ± 1.921
6.869AlaGln: 6.869 ± 1.629
7.851AlaArg: 7.851 ± 0.701
7.851AlaSer: 7.851 ± 2.139
3.925AlaThr: 3.925 ± 3.843
8.832AlaVal: 8.832 ± 2.488
0.0AlaTrp: 0.0 ± 0.0
2.944AlaTyr: 2.944 ± 1.769
0.0AlaXaa: 0.0 ± 0.0
Cys
2.944CysAla: 2.944 ± 0.139
0.0CysCys: 0.0 ± 0.0
0.0CysAsp: 0.0 ± 0.0
1.963CysGlu: 1.963 ± 0.86
2.944CysPhe: 2.944 ± 1.769
1.963CysGly: 1.963 ± 0.86
1.963CysHis: 1.963 ± 0.86
1.963CysIle: 1.963 ± 0.86
0.0CysLys: 0.0 ± 0.0
0.0CysLeu: 0.0 ± 0.0
0.0CysMet: 0.0 ± 0.0
1.963CysAsn: 1.963 ± 0.86
0.0CysPro: 0.0 ± 0.0
1.963CysGln: 1.963 ± 0.86
0.0CysArg: 0.0 ± 0.0
0.0CysSer: 0.0 ± 0.0
0.0CysThr: 0.0 ± 0.0
1.963CysVal: 1.963 ± 0.86
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
5.888AspAla: 5.888 ± 0.278
0.0AspCys: 0.0 ± 0.0
6.869AspAsp: 6.869 ± 1.629
4.907AspGlu: 4.907 ± 0.772
1.963AspPhe: 1.963 ± 0.86
5.888AspGly: 5.888 ± 0.278
3.925AspHis: 3.925 ± 1.721
4.907AspIle: 4.907 ± 0.772
1.963AspLys: 1.963 ± 1.921
2.944AspLeu: 2.944 ± 0.139
1.963AspMet: 1.963 ± 0.86
0.0AspAsn: 0.0 ± 0.0
1.963AspPro: 1.963 ± 0.86
1.963AspGln: 1.963 ± 0.86
0.981AspArg: 0.981 ± 0.961
0.0AspSer: 0.0 ± 0.0
3.925AspThr: 3.925 ± 1.07
7.851AspVal: 7.851 ± 3.442
4.907AspTrp: 4.907 ± 0.772
3.925AspTyr: 3.925 ± 1.07
0.0AspXaa: 0.0 ± 0.0
Glu
4.907GluAla: 4.907 ± 0.978
3.925GluCys: 3.925 ± 1.721
0.981GluAsp: 0.981 ± 0.961
1.963GluGlu: 1.963 ± 0.86
5.888GluPhe: 5.888 ± 2.581
1.963GluGly: 1.963 ± 0.86
0.0GluHis: 0.0 ± 0.0
2.944GluIle: 2.944 ± 0.139
3.925GluLys: 3.925 ± 1.07
0.0GluLeu: 0.0 ± 0.0
4.907GluMet: 4.907 ± 0.61
0.0GluAsn: 0.0 ± 0.0
0.0GluPro: 0.0 ± 0.0
0.0GluGln: 0.0 ± 0.0
0.0GluArg: 0.0 ± 0.0
3.925GluSer: 3.925 ± 1.07
1.963GluThr: 1.963 ± 0.86
5.888GluVal: 5.888 ± 2.581
1.963GluTrp: 1.963 ± 0.86
0.0GluTyr: 0.0 ± 0.0
0.0GluXaa: 0.0 ± 0.0
Phe
2.944PheAla: 2.944 ± 0.139
3.925PheCys: 3.925 ± 1.721
3.925PheAsp: 3.925 ± 1.721
0.0PheGlu: 0.0 ± 0.0
1.963PhePhe: 1.963 ± 0.86
5.888PheGly: 5.888 ± 2.581
3.925PheHis: 3.925 ± 1.721
1.963PheIle: 1.963 ± 0.86
2.944PheLys: 2.944 ± 0.139
1.963PheLeu: 1.963 ± 0.86
0.981PheMet: 0.981 ± 0.961
0.981PheAsn: 0.981 ± 0.961
0.0PhePro: 0.0 ± 0.0
1.963PheGln: 1.963 ± 1.499
4.907PheArg: 4.907 ± 0.772
2.944PheSer: 2.944 ± 0.139
0.981PheThr: 0.981 ± 0.961
1.963PheVal: 1.963 ± 0.86
1.963PheTrp: 1.963 ± 0.86
2.944PheTyr: 2.944 ± 0.139
0.0PheXaa: 0.0 ± 0.0
Gly
5.888GlyAla: 5.888 ± 0.278
0.0GlyCys: 0.0 ± 0.0
6.869GlyAsp: 6.869 ± 1.629
6.869GlyGlu: 6.869 ± 1.629
0.0GlyPhe: 0.0 ± 0.0
16.683GlyGly: 16.683 ± 4.36
0.0GlyHis: 0.0 ± 0.0
3.925GlyIle: 3.925 ± 1.721
4.907GlyLys: 4.907 ± 2.028
6.869GlyLeu: 6.869 ± 1.629
3.925GlyMet: 3.925 ± 3.843
2.944GlyAsn: 2.944 ± 2.882
6.869GlyPro: 6.869 ± 1.793
1.963GlyGln: 1.963 ± 0.86
9.814GlyArg: 9.814 ± 2.773
2.944GlySer: 2.944 ± 0.139
6.869GlyThr: 6.869 ± 2.889
5.888GlyVal: 5.888 ± 0.278
1.963GlyTrp: 1.963 ± 0.86
2.944GlyTyr: 2.944 ± 0.139
0.0GlyXaa: 0.0 ± 0.0
His
1.963HisAla: 1.963 ± 0.86
0.0HisCys: 0.0 ± 0.0
0.981HisAsp: 0.981 ± 0.961
0.981HisGlu: 0.981 ± 0.961
0.0HisPhe: 0.0 ± 0.0
2.944HisGly: 2.944 ± 0.139
0.0HisHis: 0.0 ± 0.0
0.0HisIle: 0.0 ± 0.0
0.0HisLys: 0.0 ± 0.0
7.851HisLeu: 7.851 ± 3.442
0.0HisMet: 0.0 ± 0.0
0.0HisAsn: 0.0 ± 0.0
4.907HisPro: 4.907 ± 0.772
1.963HisGln: 1.963 ± 0.86
0.0HisArg: 0.0 ± 0.0
0.981HisSer: 0.981 ± 0.961
0.0HisThr: 0.0 ± 0.0
3.925HisVal: 3.925 ± 1.721
0.0HisTrp: 0.0 ± 0.0
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
5.888IleAla: 5.888 ± 0.278
0.981IleCys: 0.981 ± 0.961
1.963IleAsp: 1.963 ± 0.86
3.925IleGlu: 3.925 ± 1.07
1.963IlePhe: 1.963 ± 0.86
0.981IleGly: 0.981 ± 0.961
1.963IleHis: 1.963 ± 0.86
0.981IleIle: 0.981 ± 0.961
3.925IleLys: 3.925 ± 1.721
0.981IleLeu: 0.981 ± 0.961
0.0IleMet: 0.0 ± 0.0
0.0IleAsn: 0.0 ± 0.0
1.963IlePro: 1.963 ± 1.921
1.963IleGln: 1.963 ± 1.921
3.925IleArg: 3.925 ± 1.07
2.944IleSer: 2.944 ± 1.769
1.963IleThr: 1.963 ± 1.921
4.907IleVal: 4.907 ± 2.028
0.0IleTrp: 0.0 ± 0.0
1.963IleTyr: 1.963 ± 0.86
0.0IleXaa: 0.0 ± 0.0
Lys
0.981LysAla: 0.981 ± 0.749
0.981LysCys: 0.981 ± 0.749
4.907LysAsp: 4.907 ± 0.772
0.0LysGlu: 0.0 ± 0.0
2.944LysPhe: 2.944 ± 0.139
8.832LysGly: 8.832 ± 0.418
0.981LysHis: 0.981 ± 0.961
0.981LysIle: 0.981 ± 0.961
2.944LysLys: 2.944 ± 2.882
0.981LysLeu: 0.981 ± 0.961
0.981LysMet: 0.981 ± 1.652
3.925LysAsn: 3.925 ± 1.07
0.0LysPro: 0.0 ± 0.0
0.0LysGln: 0.0 ± 0.0
2.944LysArg: 2.944 ± 2.882
3.925LysSer: 3.925 ± 1.721
2.944LysThr: 2.944 ± 0.139
0.0LysVal: 0.0 ± 0.0
1.963LysTrp: 1.963 ± 0.86
8.832LysTyr: 8.832 ± 0.418
0.0LysXaa: 0.0 ± 0.0
Leu
6.869LeuAla: 6.869 ± 1.629
1.963LeuCys: 1.963 ± 0.86
5.888LeuAsp: 5.888 ± 2.581
2.944LeuGlu: 2.944 ± 0.139
4.907LeuPhe: 4.907 ± 0.772
5.888LeuGly: 5.888 ± 1.16
1.963LeuHis: 1.963 ± 0.86
2.944LeuIle: 2.944 ± 0.139
0.981LeuLys: 0.981 ± 0.961
2.944LeuLeu: 2.944 ± 1.295
0.0LeuMet: 0.0 ± 0.0
4.907LeuAsn: 4.907 ± 0.978
2.944LeuPro: 2.944 ± 0.139
0.981LeuGln: 0.981 ± 0.749
2.944LeuArg: 2.944 ± 1.43
7.851LeuSer: 7.851 ± 0.701
3.925LeuThr: 3.925 ± 1.07
2.944LeuVal: 2.944 ± 0.139
0.0LeuTrp: 0.0 ± 0.0
2.944LeuTyr: 2.944 ± 0.139
0.0LeuXaa: 0.0 ± 0.0
Met
0.0MetAla: 0.0 ± 0.0
0.0MetCys: 0.0 ± 0.0
0.0MetAsp: 0.0 ± 0.0
1.963MetGlu: 1.963 ± 0.86
0.0MetPhe: 0.0 ± 0.0
0.981MetGly: 0.981 ± 0.961
0.0MetHis: 0.0 ± 0.0
1.963MetIle: 1.963 ± 1.921
0.0MetLys: 0.0 ± 0.0
4.907MetLeu: 4.907 ± 0.978
0.0MetMet: 0.0 ± 0.0
0.981MetAsn: 0.981 ± 0.961
1.963MetPro: 1.963 ± 1.921
0.0MetGln: 0.0 ± 0.0
2.944MetArg: 2.944 ± 0.139
0.981MetSer: 0.981 ± 0.961
0.981MetThr: 0.981 ± 0.961
0.0MetVal: 0.0 ± 0.0
0.0MetTrp: 0.0 ± 0.0
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
3.925AsnAla: 3.925 ± 0.627
1.963AsnCys: 1.963 ± 0.86
1.963AsnAsp: 1.963 ± 0.86
0.981AsnGlu: 0.981 ± 0.961
1.963AsnPhe: 1.963 ± 0.86
0.0AsnGly: 0.0 ± 0.0
0.0AsnHis: 0.0 ± 0.0
0.981AsnIle: 0.981 ± 0.961
1.963AsnLys: 1.963 ± 1.921
2.944AsnLeu: 2.944 ± 2.882
0.981AsnMet: 0.981 ± 0.961
0.0AsnAsn: 0.0 ± 0.0
0.981AsnPro: 0.981 ± 0.961
0.981AsnGln: 0.981 ± 0.961
6.869AsnArg: 6.869 ± 0.514
2.944AsnSer: 2.944 ± 1.295
2.944AsnThr: 2.944 ± 0.139
4.907AsnVal: 4.907 ± 0.772
0.0AsnTrp: 0.0 ± 0.0
0.981AsnTyr: 0.981 ± 0.961
0.0AsnXaa: 0.0 ± 0.0
Pro
2.944ProAla: 2.944 ± 0.139
0.0ProCys: 0.0 ± 0.0
0.981ProAsp: 0.981 ± 0.961
1.963ProGlu: 1.963 ± 0.86
0.981ProPhe: 0.981 ± 0.749
1.963ProGly: 1.963 ± 0.86
2.944ProHis: 2.944 ± 0.139
3.925ProIle: 3.925 ± 1.07
4.907ProLys: 4.907 ± 0.772
0.0ProLeu: 0.0 ± 0.0
0.0ProMet: 0.0 ± 0.0
2.944ProAsn: 2.944 ± 0.139
0.0ProPro: 0.0 ± 0.0
0.0ProGln: 0.0 ± 0.0
5.888ProArg: 5.888 ± 0.278
3.925ProSer: 3.925 ± 1.721
2.944ProThr: 2.944 ± 0.139
0.0ProVal: 0.0 ± 0.0
0.981ProTrp: 0.981 ± 0.961
0.0ProTyr: 0.0 ± 0.0
0.0ProXaa: 0.0 ± 0.0
Gln
0.981GlnAla: 0.981 ± 0.961
1.963GlnCys: 1.963 ± 0.86
1.963GlnAsp: 1.963 ± 0.86
1.963GlnGlu: 1.963 ± 0.86
1.963GlnPhe: 1.963 ± 0.86
2.944GlnGly: 2.944 ± 1.295
0.0GlnHis: 0.0 ± 0.0
0.981GlnIle: 0.981 ± 0.961
0.0GlnLys: 0.0 ± 0.0
0.981GlnLeu: 0.981 ± 0.749
0.0GlnMet: 0.0 ± 0.0
0.0GlnAsn: 0.0 ± 0.0
0.0GlnPro: 0.0 ± 0.0
1.963GlnGln: 1.963 ± 0.86
0.0GlnArg: 0.0 ± 0.0
4.907GlnSer: 4.907 ± 0.772
0.981GlnThr: 0.981 ± 0.961
0.981GlnVal: 0.981 ± 0.961
3.925GlnTrp: 3.925 ± 0.627
0.981GlnTyr: 0.981 ± 0.961
0.0GlnXaa: 0.0 ± 0.0
Arg
1.963ArgAla: 1.963 ± 0.96
0.0ArgCys: 0.0 ± 0.0
4.907ArgAsp: 4.907 ± 0.772
6.869ArgGlu: 6.869 ± 2.889
1.963ArgPhe: 1.963 ± 1.921
10.795ArgGly: 10.795 ± 0.652
1.963ArgHis: 1.963 ± 0.86
1.963ArgIle: 1.963 ± 1.921
2.944ArgLys: 2.944 ± 0.139
0.981ArgLeu: 0.981 ± 0.961
0.0ArgMet: 0.0 ± 0.0
1.963ArgAsn: 1.963 ± 0.96
5.888ArgPro: 5.888 ± 1.629
0.0ArgGln: 0.0 ± 0.0
8.832ArgArg: 8.832 ± 3.601
10.795ArgSer: 10.795 ± 4.576
5.888ArgThr: 5.888 ± 2.988
4.907ArgVal: 4.907 ± 2.028
0.981ArgTrp: 0.981 ± 0.961
2.944ArgTyr: 2.944 ± 0.139
0.0ArgXaa: 0.0 ± 0.0
Ser
4.907SerAla: 4.907 ± 2.028
0.0SerCys: 0.0 ± 0.0
4.907SerAsp: 4.907 ± 0.772
0.981SerGlu: 0.981 ± 0.961
1.963SerPhe: 1.963 ± 0.86
9.814SerGly: 9.814 ± 1.545
0.0SerHis: 0.0 ± 0.0
1.963SerIle: 1.963 ± 0.96
8.832SerLys: 8.832 ± 1.026
5.888SerLeu: 5.888 ± 2.581
0.981SerMet: 0.981 ± 0.749
3.925SerAsn: 3.925 ± 1.07
1.963SerPro: 1.963 ± 0.86
0.981SerGln: 0.981 ± 0.961
6.869SerArg: 6.869 ± 1.629
4.907SerSer: 4.907 ± 2.028
2.944SerThr: 2.944 ± 0.139
3.925SerVal: 3.925 ± 2.68
1.963SerTrp: 1.963 ± 0.86
5.888SerTyr: 5.888 ± 0.278
0.0SerXaa: 0.0 ± 0.0
Thr
2.944ThrAla: 2.944 ± 2.882
0.981ThrCys: 0.981 ± 0.961
0.981ThrAsp: 0.981 ± 0.961
0.981ThrGlu: 0.981 ± 0.961
1.963ThrPhe: 1.963 ± 0.86
4.907ThrGly: 4.907 ± 0.772
0.0ThrHis: 0.0 ± 0.0
1.963ThrIle: 1.963 ± 1.921
1.963ThrLys: 1.963 ± 1.921
7.851ThrLeu: 7.851 ± 1.014
0.981ThrMet: 0.981 ± 0.961
0.981ThrAsn: 0.981 ± 0.961
3.925ThrPro: 3.925 ± 1.721
1.963ThrGln: 1.963 ± 0.86
3.925ThrArg: 3.925 ± 1.07
3.925ThrSer: 3.925 ± 1.07
5.888ThrThr: 5.888 ± 2.988
0.0ThrVal: 0.0 ± 0.0
0.0ThrTrp: 0.0 ± 0.0
3.925ThrTyr: 3.925 ± 1.07
0.0ThrXaa: 0.0 ± 0.0
Val
5.888ValAla: 5.888 ± 0.278
0.0ValCys: 0.0 ± 0.0
12.758ValAsp: 12.758 ± 1.468
1.963ValGlu: 1.963 ± 0.86
7.851ValPhe: 7.851 ± 3.442
1.963ValGly: 1.963 ± 1.921
1.963ValHis: 1.963 ± 0.86
0.981ValIle: 0.981 ± 0.961
2.944ValLys: 2.944 ± 0.139
6.869ValLeu: 6.869 ± 0.514
0.0ValMet: 0.0 ± 0.0
1.963ValAsn: 1.963 ± 0.86
1.963ValPro: 1.963 ± 0.86
0.0ValGln: 0.0 ± 0.0
2.944ValArg: 2.944 ± 1.665
5.888ValSer: 5.888 ± 1.16
0.981ValThr: 0.981 ± 0.961
5.888ValVal: 5.888 ± 2.581
0.981ValTrp: 0.981 ± 0.961
2.944ValTyr: 2.944 ± 0.139
0.0ValXaa: 0.0 ± 0.0
Trp
4.907TrpAla: 4.907 ± 2.067
2.944TrpCys: 2.944 ± 0.139
0.0TrpAsp: 0.0 ± 0.0
0.0TrpGlu: 0.0 ± 0.0
0.0TrpPhe: 0.0 ± 0.0
0.0TrpGly: 0.0 ± 0.0
1.963TrpHis: 1.963 ± 1.921
0.0TrpIle: 0.0 ± 0.0
1.963TrpLys: 1.963 ± 0.86
5.888TrpLeu: 5.888 ± 2.581
0.0TrpMet: 0.0 ± 0.0
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
1.963TrpGln: 1.963 ± 1.921
2.944TrpArg: 2.944 ± 0.139
0.0TrpSer: 0.0 ± 0.0
0.0TrpThr: 0.0 ± 0.0
0.0TrpVal: 0.0 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
11.776TyrAla: 11.776 ± 2.4
0.0TyrCys: 0.0 ± 0.0
3.925TyrAsp: 3.925 ± 1.07
1.963TyrGlu: 1.963 ± 0.86
3.925TyrPhe: 3.925 ± 1.07
8.832TyrGly: 8.832 ± 1.026
0.981TyrHis: 0.981 ± 0.961
1.963TyrIle: 1.963 ± 1.921
0.0TyrLys: 0.0 ± 0.0
2.944TyrLeu: 2.944 ± 0.139
0.0TyrMet: 0.0 ± 0.0
0.981TyrAsn: 0.981 ± 0.961
0.0TyrPro: 0.0 ± 0.0
0.0TyrGln: 0.0 ± 0.0
1.963TyrArg: 1.963 ± 1.921
1.963TyrSer: 1.963 ± 1.921
0.0TyrThr: 0.0 ± 0.0
1.963TyrVal: 1.963 ± 0.86
0.981TyrTrp: 0.981 ± 0.961
0.981TyrTyr: 0.981 ± 0.961
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (1020 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski