Amino acid dipepetide frequency for Tortoise microvirus 91

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.824AlaAla: 5.824 ± 1.449
0.832AlaCys: 0.832 ± 0.74
4.992AlaAsp: 4.992 ± 2.061
3.328AlaGlu: 3.328 ± 0.861
3.328AlaPhe: 3.328 ± 1.777
6.656AlaGly: 6.656 ± 2.125
0.832AlaHis: 0.832 ± 0.552
2.496AlaIle: 2.496 ± 2.352
4.16AlaLys: 4.16 ± 2.352
7.488AlaLeu: 7.488 ± 1.715
1.664AlaMet: 1.664 ± 0.605
7.488AlaAsn: 7.488 ± 3.285
4.16AlaPro: 4.16 ± 0.974
4.992AlaGln: 4.992 ± 3.846
5.824AlaArg: 5.824 ± 2.359
1.664AlaSer: 1.664 ± 0.968
3.328AlaThr: 3.328 ± 1.303
4.16AlaVal: 4.16 ± 2.76
2.496AlaTrp: 2.496 ± 1.334
4.992AlaTyr: 4.992 ± 2.052
0.0AlaXaa: 0.0 ± 0.0
Cys
0.832CysAla: 0.832 ± 0.74
0.0CysCys: 0.0 ± 0.0
0.0CysAsp: 0.0 ± 0.0
0.832CysGlu: 0.832 ± 0.552
0.832CysPhe: 0.832 ± 0.74
0.832CysGly: 0.832 ± 0.74
1.664CysHis: 1.664 ± 0.704
0.0CysIle: 0.0 ± 0.0
0.0CysLys: 0.0 ± 0.0
0.0CysLeu: 0.0 ± 0.0
0.0CysMet: 0.0 ± 0.0
0.832CysAsn: 0.832 ± 0.552
0.832CysPro: 0.832 ± 0.74
0.832CysGln: 0.832 ± 0.74
0.0CysArg: 0.0 ± 0.0
0.0CysSer: 0.0 ± 0.0
0.832CysThr: 0.832 ± 0.74
0.0CysVal: 0.0 ± 0.0
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
3.328AspAla: 3.328 ± 1.228
0.832AspCys: 0.832 ± 0.74
4.992AspAsp: 4.992 ± 3.272
3.328AspGlu: 3.328 ± 2.125
2.496AspPhe: 2.496 ± 1.029
3.328AspGly: 3.328 ± 1.303
0.0AspHis: 0.0 ± 0.0
0.832AspIle: 0.832 ± 0.552
3.328AspLys: 3.328 ± 1.182
4.992AspLeu: 4.992 ± 1.752
0.832AspMet: 0.832 ± 0.784
0.832AspAsn: 0.832 ± 0.784
6.656AspPro: 6.656 ± 3.423
1.664AspGln: 1.664 ± 1.168
4.992AspArg: 4.992 ± 3.466
2.496AspSer: 2.496 ± 0.576
2.496AspThr: 2.496 ± 1.128
4.992AspVal: 4.992 ± 2.255
1.664AspTrp: 1.664 ± 0.704
3.328AspTyr: 3.328 ± 2.208
0.0AspXaa: 0.0 ± 0.0
Glu
4.16GluAla: 4.16 ± 1.979
0.832GluCys: 0.832 ± 0.552
2.496GluAsp: 2.496 ± 1.399
4.992GluGlu: 4.992 ± 1.684
0.0GluPhe: 0.0 ± 0.0
4.16GluGly: 4.16 ± 0.65
2.496GluHis: 2.496 ± 1.026
4.16GluIle: 4.16 ± 2.057
0.0GluLys: 0.0 ± 0.0
3.328GluLeu: 3.328 ± 1.048
1.664GluMet: 1.664 ± 1.1
3.328GluAsn: 3.328 ± 1.21
0.832GluPro: 0.832 ± 0.552
9.151GluGln: 9.151 ± 3.719
4.16GluArg: 4.16 ± 1.922
2.496GluSer: 2.496 ± 2.352
3.328GluThr: 3.328 ± 1.048
2.496GluVal: 2.496 ± 1.399
0.832GluTrp: 0.832 ± 0.552
4.16GluTyr: 4.16 ± 0.974
0.0GluXaa: 0.0 ± 0.0
Phe
6.656PheAla: 6.656 ± 2.71
0.0PheCys: 0.0 ± 0.0
3.328PheAsp: 3.328 ± 2.044
0.832PheGlu: 0.832 ± 0.552
1.664PhePhe: 1.664 ± 0.704
2.496PheGly: 2.496 ± 1.656
0.832PheHis: 0.832 ± 0.552
0.832PheIle: 0.832 ± 0.552
2.496PheLys: 2.496 ± 1.292
2.496PheLeu: 2.496 ± 1.287
0.832PheMet: 0.832 ± 0.74
4.16PheAsn: 4.16 ± 1.467
0.832PhePro: 0.832 ± 0.552
0.832PheGln: 0.832 ± 0.552
2.496PheArg: 2.496 ± 1.026
0.832PheSer: 0.832 ± 0.74
4.16PheThr: 4.16 ± 1.999
3.328PheVal: 3.328 ± 1.18
0.0PheTrp: 0.0 ± 0.0
0.832PheTyr: 0.832 ± 1.176
0.0PheXaa: 0.0 ± 0.0
Gly
4.992GlyAla: 4.992 ± 1.705
0.832GlyCys: 0.832 ± 0.74
4.992GlyAsp: 4.992 ± 2.052
3.328GlyGlu: 3.328 ± 1.228
1.664GlyPhe: 1.664 ± 0.704
4.16GlyGly: 4.16 ± 2.976
0.832GlyHis: 0.832 ± 0.552
6.656GlyIle: 6.656 ± 2.605
3.328GlyLys: 3.328 ± 1.407
4.992GlyLeu: 4.992 ± 1.411
1.664GlyMet: 1.664 ± 1.104
4.16GlyAsn: 4.16 ± 0.921
0.832GlyPro: 0.832 ± 1.176
3.328GlyGln: 3.328 ± 1.21
2.496GlyArg: 2.496 ± 0.576
3.328GlySer: 3.328 ± 1.048
3.328GlyThr: 3.328 ± 1.21
1.664GlyVal: 1.664 ± 0.704
0.0GlyTrp: 0.0 ± 0.0
1.664GlyTyr: 1.664 ± 1.104
0.0GlyXaa: 0.0 ± 0.0
His
0.832HisAla: 0.832 ± 1.176
0.0HisCys: 0.0 ± 0.0
2.496HisAsp: 2.496 ± 1.026
0.0HisGlu: 0.0 ± 0.0
0.832HisPhe: 0.832 ± 0.552
2.496HisGly: 2.496 ± 1.026
0.0HisHis: 0.0 ± 0.0
0.832HisIle: 0.832 ± 0.74
1.664HisLys: 1.664 ± 1.48
0.832HisLeu: 0.832 ± 0.552
0.0HisMet: 0.0 ± 0.0
1.664HisAsn: 1.664 ± 0.968
0.832HisPro: 0.832 ± 0.784
0.0HisGln: 0.0 ± 0.0
2.496HisArg: 2.496 ± 1.399
1.664HisSer: 1.664 ± 0.704
0.0HisThr: 0.0 ± 0.0
2.496HisVal: 2.496 ± 1.334
0.0HisTrp: 0.0 ± 0.0
1.664HisTyr: 1.664 ± 0.704
0.0HisXaa: 0.0 ± 0.0
Ile
4.992IleAla: 4.992 ± 2.075
0.0IleCys: 0.0 ± 0.0
0.0IleAsp: 0.0 ± 0.0
5.824IleGlu: 5.824 ± 2.437
2.496IlePhe: 2.496 ± 1.656
2.496IleGly: 2.496 ± 1.656
2.496IleHis: 2.496 ± 1.029
1.664IleIle: 1.664 ± 0.605
1.664IleLys: 1.664 ± 0.968
4.992IleLeu: 4.992 ± 0.754
3.328IleMet: 3.328 ± 1.663
1.664IleAsn: 1.664 ± 0.605
6.656IlePro: 6.656 ± 2.653
2.496IleGln: 2.496 ± 1.029
2.496IleArg: 2.496 ± 0.576
4.16IleSer: 4.16 ± 0.974
4.16IleThr: 4.16 ± 2.001
1.664IleVal: 1.664 ± 1.283
0.0IleTrp: 0.0 ± 0.0
1.664IleTyr: 1.664 ± 0.704
0.0IleXaa: 0.0 ± 0.0
Lys
4.16LysAla: 4.16 ± 1.495
0.832LysCys: 0.832 ± 0.74
2.496LysAsp: 2.496 ± 2.347
2.496LysGlu: 2.496 ± 0.853
2.496LysPhe: 2.496 ± 1.656
4.992LysGly: 4.992 ± 2.255
0.832LysHis: 0.832 ± 0.74
3.328LysIle: 3.328 ± 2.04
4.992LysLys: 4.992 ± 3.139
1.664LysLeu: 1.664 ± 1.104
4.16LysMet: 4.16 ± 1.108
4.16LysAsn: 4.16 ± 1.495
4.16LysPro: 4.16 ± 0.65
4.16LysGln: 4.16 ± 1.2
3.328LysArg: 3.328 ± 2.336
1.664LysSer: 1.664 ± 0.605
4.992LysThr: 4.992 ± 2.291
0.0LysVal: 0.0 ± 0.0
0.0LysTrp: 0.0 ± 0.0
2.496LysTyr: 2.496 ± 0.576
0.0LysXaa: 0.0 ± 0.0
Leu
7.488LeuAla: 7.488 ± 2.896
0.0LeuCys: 0.0 ± 0.0
5.824LeuAsp: 5.824 ± 3.133
3.328LeuGlu: 3.328 ± 1.182
2.496LeuPhe: 2.496 ± 2.219
4.16LeuGly: 4.16 ± 0.921
0.0LeuHis: 0.0 ± 0.0
3.328LeuIle: 3.328 ± 0.579
4.992LeuLys: 4.992 ± 1.61
1.664LeuLeu: 1.664 ± 0.605
1.664LeuMet: 1.664 ± 0.704
5.824LeuAsn: 5.824 ± 1.449
4.992LeuPro: 4.992 ± 2.135
4.16LeuGln: 4.16 ± 1.212
6.656LeuArg: 6.656 ± 1.062
7.488LeuSer: 7.488 ± 1.784
4.16LeuThr: 4.16 ± 1.671
2.496LeuVal: 2.496 ± 1.287
0.832LeuTrp: 0.832 ± 0.552
0.832LeuTyr: 0.832 ± 0.552
0.0LeuXaa: 0.0 ± 0.0
Met
4.16MetAla: 4.16 ± 0.921
0.0MetCys: 0.0 ± 0.0
4.16MetAsp: 4.16 ± 1.372
2.496MetGlu: 2.496 ± 1.029
0.832MetPhe: 0.832 ± 0.552
3.328MetGly: 3.328 ± 1.303
1.664MetHis: 1.664 ± 0.704
1.664MetIle: 1.664 ± 1.104
0.832MetLys: 0.832 ± 1.176
0.0MetLeu: 0.0 ± 0.0
0.832MetMet: 0.832 ± 0.784
3.328MetAsn: 3.328 ± 1.21
4.992MetPro: 4.992 ± 1.474
2.496MetGln: 2.496 ± 1.534
0.832MetArg: 0.832 ± 1.176
2.496MetSer: 2.496 ± 1.334
0.832MetThr: 0.832 ± 0.74
0.832MetVal: 0.832 ± 0.552
0.0MetTrp: 0.0 ± 0.0
1.664MetTyr: 1.664 ± 0.968
0.0MetXaa: 0.0 ± 0.0
Asn
1.664AsnAla: 1.664 ± 0.605
0.0AsnCys: 0.0 ± 0.0
0.0AsnAsp: 0.0 ± 0.0
4.992AsnGlu: 4.992 ± 1.815
2.496AsnPhe: 2.496 ± 2.352
4.992AsnGly: 4.992 ± 1.138
1.664AsnHis: 1.664 ± 0.968
2.496AsnIle: 2.496 ± 1.656
2.496AsnLys: 2.496 ± 1.656
4.992AsnLeu: 4.992 ± 2.742
2.496AsnMet: 2.496 ± 0.853
1.664AsnAsn: 1.664 ± 1.568
5.824AsnPro: 5.824 ± 1.907
1.664AsnGln: 1.664 ± 0.968
4.992AsnArg: 4.992 ± 2.687
4.992AsnSer: 4.992 ± 0.754
4.16AsnThr: 4.16 ± 2.816
3.328AsnVal: 3.328 ± 1.182
2.496AsnTrp: 2.496 ± 1.026
1.664AsnTyr: 1.664 ± 0.704
0.0AsnXaa: 0.0 ± 0.0
Pro
4.992ProAla: 4.992 ± 1.836
0.832ProCys: 0.832 ± 0.74
4.16ProAsp: 4.16 ± 4.597
0.832ProGlu: 0.832 ± 0.552
4.16ProPhe: 4.16 ± 2.517
3.328ProGly: 3.328 ± 1.49
0.832ProHis: 0.832 ± 0.74
3.328ProIle: 3.328 ± 1.182
5.824ProLys: 5.824 ± 2.526
3.328ProLeu: 3.328 ± 0.861
4.992ProMet: 4.992 ± 2.052
4.16ProAsn: 4.16 ± 1.019
4.16ProPro: 4.16 ± 2.517
3.328ProGln: 3.328 ± 1.407
2.496ProArg: 2.496 ± 1.026
7.488ProSer: 7.488 ± 3.032
1.664ProThr: 1.664 ± 1.104
1.664ProVal: 1.664 ± 1.104
0.832ProTrp: 0.832 ± 0.552
1.664ProTyr: 1.664 ± 0.605
0.0ProXaa: 0.0 ± 0.0
Gln
4.16GlnAla: 4.16 ± 1.853
0.0GlnCys: 0.0 ± 0.0
4.16GlnAsp: 4.16 ± 1.2
2.496GlnGlu: 2.496 ± 0.576
2.496GlnPhe: 2.496 ± 1.599
2.496GlnGly: 2.496 ± 0.853
0.832GlnHis: 0.832 ± 0.74
3.328GlnIle: 3.328 ± 2.411
3.328GlnLys: 3.328 ± 2.336
6.656GlnLeu: 6.656 ± 3.125
2.496GlnMet: 2.496 ± 0.576
2.496GlnAsn: 2.496 ± 1.287
0.832GlnPro: 0.832 ± 0.552
4.992GlnGln: 4.992 ± 2.687
7.488GlnArg: 7.488 ± 2.887
2.496GlnSer: 2.496 ± 1.029
3.328GlnThr: 3.328 ± 1.936
1.664GlnVal: 1.664 ± 1.568
0.0GlnTrp: 0.0 ± 0.0
1.664GlnTyr: 1.664 ± 0.704
0.0GlnXaa: 0.0 ± 0.0
Arg
8.319ArgAla: 8.319 ± 2.252
0.832ArgCys: 0.832 ± 0.552
4.16ArgAsp: 4.16 ± 2.09
7.488ArgGlu: 7.488 ± 4.959
2.496ArgPhe: 2.496 ± 0.853
0.0ArgGly: 0.0 ± 0.0
0.832ArgHis: 0.832 ± 1.176
2.496ArgIle: 2.496 ± 0.576
4.16ArgLys: 4.16 ± 1.467
5.824ArgLeu: 5.824 ± 1.449
2.496ArgMet: 2.496 ± 1.037
1.664ArgAsn: 1.664 ± 0.605
3.328ArgPro: 3.328 ± 0.579
3.328ArgGln: 3.328 ± 1.709
3.328ArgArg: 3.328 ± 1.303
1.664ArgSer: 1.664 ± 0.605
3.328ArgThr: 3.328 ± 2.044
2.496ArgVal: 2.496 ± 0.576
0.832ArgTrp: 0.832 ± 0.74
3.328ArgTyr: 3.328 ± 1.407
0.0ArgXaa: 0.0 ± 0.0
Ser
6.656SerAla: 6.656 ± 1.802
0.0SerCys: 0.0 ± 0.0
4.16SerAsp: 4.16 ± 1.999
4.16SerGlu: 4.16 ± 1.467
3.328SerPhe: 3.328 ± 1.18
1.664SerGly: 1.664 ± 0.605
1.664SerHis: 1.664 ± 1.104
4.16SerIle: 4.16 ± 2.517
2.496SerLys: 2.496 ± 1.534
6.656SerLeu: 6.656 ± 1.384
2.496SerMet: 2.496 ± 0.853
2.496SerAsn: 2.496 ± 0.853
0.0SerPro: 0.0 ± 0.0
4.16SerGln: 4.16 ± 3.123
3.328SerArg: 3.328 ± 2.044
1.664SerSer: 1.664 ± 0.605
3.328SerThr: 3.328 ± 1.228
2.496SerVal: 2.496 ± 0.853
0.0SerTrp: 0.0 ± 0.0
0.832SerTyr: 0.832 ± 0.784
0.0SerXaa: 0.0 ± 0.0
Thr
4.992ThrAla: 4.992 ± 2.575
0.832ThrCys: 0.832 ± 0.74
1.664ThrAsp: 1.664 ± 1.104
2.496ThrGlu: 2.496 ± 1.534
1.664ThrPhe: 1.664 ± 1.104
2.496ThrGly: 2.496 ± 1.656
0.0ThrHis: 0.0 ± 0.0
8.319ThrIle: 8.319 ± 2.019
3.328ThrLys: 3.328 ± 1.18
6.656ThrLeu: 6.656 ± 1.739
0.832ThrMet: 0.832 ± 0.74
3.328ThrAsn: 3.328 ± 2.044
5.824ThrPro: 5.824 ± 1.933
1.664ThrGln: 1.664 ± 0.605
1.664ThrArg: 1.664 ± 1.104
5.824ThrSer: 5.824 ± 2.498
4.992ThrThr: 4.992 ± 0.953
2.496ThrVal: 2.496 ± 1.287
0.0ThrTrp: 0.0 ± 0.0
0.832ThrTyr: 0.832 ± 0.74
0.0ThrXaa: 0.0 ± 0.0
Val
1.664ValAla: 1.664 ± 1.104
0.0ValCys: 0.0 ± 0.0
0.832ValAsp: 0.832 ± 0.74
3.328ValGlu: 3.328 ± 1.777
0.0ValPhe: 0.0 ± 0.0
1.664ValGly: 1.664 ± 1.277
0.832ValHis: 0.832 ± 0.74
1.664ValIle: 1.664 ± 0.968
4.992ValLys: 4.992 ± 0.918
2.496ValLeu: 2.496 ± 1.026
2.496ValMet: 2.496 ± 1.287
3.328ValAsn: 3.328 ± 0.861
5.824ValPro: 5.824 ± 2.498
1.664ValGln: 1.664 ± 0.605
1.664ValArg: 1.664 ± 0.605
0.832ValSer: 0.832 ± 0.784
4.992ValThr: 4.992 ± 1.474
0.832ValVal: 0.832 ± 0.552
0.0ValTrp: 0.0 ± 0.0
3.328ValTyr: 3.328 ± 1.18
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.0TrpCys: 0.0 ± 0.0
0.832TrpAsp: 0.832 ± 0.552
1.664TrpGlu: 1.664 ± 1.104
0.832TrpPhe: 0.832 ± 0.552
0.832TrpGly: 0.832 ± 0.74
1.664TrpHis: 1.664 ± 0.704
0.832TrpIle: 0.832 ± 0.552
0.832TrpLys: 0.832 ± 0.74
0.832TrpLeu: 0.832 ± 0.552
0.0TrpMet: 0.0 ± 0.0
0.0TrpAsn: 0.0 ± 0.0
1.664TrpPro: 1.664 ± 0.704
0.0TrpGln: 0.0 ± 0.0
0.0TrpArg: 0.0 ± 0.0
1.664TrpSer: 1.664 ± 0.704
0.0TrpThr: 0.0 ± 0.0
0.0TrpVal: 0.0 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.832TyrAla: 0.832 ± 0.552
1.664TyrCys: 1.664 ± 0.704
1.664TyrAsp: 1.664 ± 0.704
0.832TyrGlu: 0.832 ± 0.784
3.328TyrPhe: 3.328 ± 1.49
1.664TyrGly: 1.664 ± 0.968
0.832TyrHis: 0.832 ± 0.74
2.496TyrIle: 2.496 ± 0.576
2.496TyrLys: 2.496 ± 0.576
2.496TyrLeu: 2.496 ± 2.219
1.664TyrMet: 1.664 ± 1.104
2.496TyrAsn: 2.496 ± 0.853
0.832TyrPro: 0.832 ± 1.176
2.496TyrGln: 2.496 ± 1.656
2.496TyrArg: 2.496 ± 0.853
0.832TyrSer: 0.832 ± 0.74
2.496TyrThr: 2.496 ± 1.026
3.328TyrVal: 3.328 ± 1.407
1.664TyrTrp: 1.664 ± 1.104
4.16TyrTyr: 4.16 ± 1.671
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4 proteins (1203 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski