Amino acid dipepetide frequency for Tortoise microvirus 40

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.325AlaAla: 6.325 ± 3.825
0.633AlaCys: 0.633 ± 0.495
3.795AlaAsp: 3.795 ± 1.719
5.693AlaGlu: 5.693 ± 2.934
3.795AlaPhe: 3.795 ± 1.63
3.795AlaGly: 3.795 ± 2.614
0.633AlaHis: 0.633 ± 0.725
2.53AlaIle: 2.53 ± 0.898
4.428AlaLys: 4.428 ± 0.809
4.428AlaLeu: 4.428 ± 1.526
0.633AlaMet: 0.633 ± 0.495
3.795AlaAsn: 3.795 ± 0.72
1.898AlaPro: 1.898 ± 1.307
8.223AlaGln: 8.223 ± 6.465
2.53AlaArg: 2.53 ± 0.492
4.428AlaSer: 4.428 ± 2.178
4.428AlaThr: 4.428 ± 2.3
2.53AlaVal: 2.53 ± 1.029
2.53AlaTrp: 2.53 ± 0.598
4.428AlaTyr: 4.428 ± 0.597
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.0CysCys: 0.0 ± 0.0
0.633CysAsp: 0.633 ± 0.495
0.0CysGlu: 0.0 ± 0.0
0.0CysPhe: 0.0 ± 0.0
0.633CysGly: 0.633 ± 0.495
0.633CysHis: 0.633 ± 0.45
0.633CysIle: 0.633 ± 0.45
0.633CysLys: 0.633 ± 0.495
1.265CysLeu: 1.265 ± 0.502
0.0CysMet: 0.0 ± 0.0
1.265CysAsn: 1.265 ± 0.502
0.633CysPro: 0.633 ± 0.495
1.898CysGln: 1.898 ± 1.486
0.0CysArg: 0.0 ± 0.0
3.163CysSer: 3.163 ± 1.435
3.163CysThr: 3.163 ± 1.829
0.0CysVal: 0.0 ± 0.0
0.0CysTrp: 0.0 ± 0.0
0.633CysTyr: 0.633 ± 0.495
0.0CysXaa: 0.0 ± 0.0
Asp
3.795AspAla: 3.795 ± 1.941
1.265AspCys: 1.265 ± 0.991
1.898AspAsp: 1.898 ± 1.614
3.795AspGlu: 3.795 ± 2.315
1.898AspPhe: 1.898 ± 0.815
1.898AspGly: 1.898 ± 1.351
0.0AspHis: 0.0 ± 0.0
4.428AspIle: 4.428 ± 1.126
2.53AspLys: 2.53 ± 1.982
10.12AspLeu: 10.12 ± 1.848
2.53AspMet: 2.53 ± 0.492
3.163AspAsn: 3.163 ± 1.461
1.898AspPro: 1.898 ± 0.999
4.428AspGln: 4.428 ± 1.497
3.163AspArg: 3.163 ± 1.435
5.693AspSer: 5.693 ± 1.99
2.53AspThr: 2.53 ± 1.004
3.795AspVal: 3.795 ± 1.506
0.0AspTrp: 0.0 ± 0.0
5.06AspTyr: 5.06 ± 2.179
0.0AspXaa: 0.0 ± 0.0
Glu
3.795GluAla: 3.795 ± 2.515
0.0GluCys: 0.0 ± 0.0
2.53GluAsp: 2.53 ± 1.35
0.633GluGlu: 0.633 ± 0.786
5.693GluPhe: 5.693 ± 1.72
0.633GluGly: 0.633 ± 0.725
0.633GluHis: 0.633 ± 0.725
3.163GluIle: 3.163 ± 0.702
4.428GluLys: 4.428 ± 0.809
2.53GluLeu: 2.53 ± 1.477
1.898GluMet: 1.898 ± 0.651
2.53GluAsn: 2.53 ± 0.566
0.633GluPro: 0.633 ± 0.786
0.633GluGln: 0.633 ± 0.45
2.53GluArg: 2.53 ± 1.298
4.428GluSer: 4.428 ± 2.223
3.163GluThr: 3.163 ± 0.702
2.53GluVal: 2.53 ± 0.898
0.0GluTrp: 0.0 ± 0.0
2.53GluTyr: 2.53 ± 0.492
0.0GluXaa: 0.0 ± 0.0
Phe
3.795PheAla: 3.795 ± 0.748
0.633PheCys: 0.633 ± 0.495
5.693PheAsp: 5.693 ± 1.99
1.898PheGlu: 1.898 ± 1.22
3.795PhePhe: 3.795 ± 1.474
5.06PheGly: 5.06 ± 0.733
1.265PheHis: 1.265 ± 0.502
1.898PheIle: 1.898 ± 1.22
4.428PheLys: 4.428 ± 0.791
3.795PheLeu: 3.795 ± 1.366
1.265PheMet: 1.265 ± 0.655
4.428PheAsn: 4.428 ± 1.441
2.53PhePro: 2.53 ± 1.004
1.898PheGln: 1.898 ± 0.89
1.265PheArg: 1.265 ± 0.502
4.428PheSer: 4.428 ± 1.126
3.795PheThr: 3.795 ± 2.082
3.795PheVal: 3.795 ± 1.366
0.0PheTrp: 0.0 ± 0.0
5.693PheTyr: 5.693 ± 1.703
0.0PheXaa: 0.0 ± 0.0
Gly
3.795GlyAla: 3.795 ± 2.464
0.0GlyCys: 0.0 ± 0.0
5.693GlyAsp: 5.693 ± 1.257
1.898GlyGlu: 1.898 ± 0.89
3.795GlyPhe: 3.795 ± 1.63
1.265GlyGly: 1.265 ± 0.655
1.265GlyHis: 1.265 ± 0.991
1.265GlyIle: 1.265 ± 0.502
1.898GlyLys: 1.898 ± 0.36
4.428GlyLeu: 4.428 ± 2.095
1.265GlyMet: 1.265 ± 0.655
3.163GlyAsn: 3.163 ± 0.872
0.633GlyPro: 0.633 ± 0.495
2.53GlyGln: 2.53 ± 0.566
2.53GlyArg: 2.53 ± 0.492
5.06GlySer: 5.06 ± 2.621
1.898GlyThr: 1.898 ± 1.307
5.06GlyVal: 5.06 ± 1.946
0.0GlyTrp: 0.0 ± 0.0
1.265GlyTyr: 1.265 ± 0.9
0.0GlyXaa: 0.0 ± 0.0
His
1.265HisAla: 1.265 ± 0.9
1.265HisCys: 1.265 ± 0.502
0.0HisAsp: 0.0 ± 0.0
0.633HisGlu: 0.633 ± 0.725
2.53HisPhe: 2.53 ± 1.981
0.633HisGly: 0.633 ± 0.45
0.0HisHis: 0.0 ± 0.0
1.265HisIle: 1.265 ± 0.649
0.633HisLys: 0.633 ± 0.495
0.633HisLeu: 0.633 ± 0.495
0.0HisMet: 0.0 ± 0.0
0.633HisAsn: 0.633 ± 0.495
0.633HisPro: 0.633 ± 0.495
0.633HisGln: 0.633 ± 0.45
0.633HisArg: 0.633 ± 0.495
0.633HisSer: 0.633 ± 0.495
1.898HisThr: 1.898 ± 0.899
0.633HisVal: 0.633 ± 0.495
0.633HisTrp: 0.633 ± 0.45
0.633HisTyr: 0.633 ± 0.495
0.0HisXaa: 0.0 ± 0.0
Ile
6.325IleAla: 6.325 ± 3.65
0.633IleCys: 0.633 ± 0.786
4.428IleAsp: 4.428 ± 0.922
2.53IleGlu: 2.53 ± 0.598
1.898IlePhe: 1.898 ± 0.815
2.53IleGly: 2.53 ± 1.029
0.0IleHis: 0.0 ± 0.0
0.0IleIle: 0.0 ± 0.0
1.265IleLys: 1.265 ± 0.991
1.898IleLeu: 1.898 ± 0.89
1.265IleMet: 1.265 ± 0.9
1.898IleAsn: 1.898 ± 0.859
1.898IlePro: 1.898 ± 0.89
1.898IleGln: 1.898 ± 1.575
0.633IleArg: 0.633 ± 0.786
6.325IleSer: 6.325 ± 1.575
1.898IleThr: 1.898 ± 0.815
1.265IleVal: 1.265 ± 0.899
0.0IleTrp: 0.0 ± 0.0
4.428IleTyr: 4.428 ± 0.791
0.0IleXaa: 0.0 ± 0.0
Lys
3.795LysAla: 3.795 ± 1.966
0.0LysCys: 0.0 ± 0.0
2.53LysAsp: 2.53 ± 1.205
5.693LysGlu: 5.693 ± 1.206
1.898LysPhe: 1.898 ± 0.774
3.163LysGly: 3.163 ± 0.949
3.163LysHis: 3.163 ± 1.829
3.163LysIle: 3.163 ± 1.001
2.53LysLys: 2.53 ± 1.298
4.428LysLeu: 4.428 ± 1.841
3.163LysMet: 3.163 ± 3.438
2.53LysAsn: 2.53 ± 1.004
0.0LysPro: 0.0 ± 0.0
5.06LysGln: 5.06 ± 3.18
3.795LysArg: 3.795 ± 0.442
3.163LysSer: 3.163 ± 1.616
4.428LysThr: 4.428 ± 1.36
4.428LysVal: 4.428 ± 2.022
0.633LysTrp: 0.633 ± 0.45
1.898LysTyr: 1.898 ± 1.285
0.0LysXaa: 0.0 ± 0.0
Leu
6.325LeuAla: 6.325 ± 0.652
1.265LeuCys: 1.265 ± 0.502
6.958LeuAsp: 6.958 ± 0.8
2.53LeuGlu: 2.53 ± 0.566
7.59LeuPhe: 7.59 ± 2.526
5.693LeuGly: 5.693 ± 0.222
0.633LeuHis: 0.633 ± 0.495
5.693LeuIle: 5.693 ± 2.934
3.163LeuLys: 3.163 ± 1.461
6.958LeuLeu: 6.958 ± 2.36
2.53LeuMet: 2.53 ± 2.393
6.958LeuAsn: 6.958 ± 2.326
3.163LeuPro: 3.163 ± 1.604
5.06LeuGln: 5.06 ± 1.328
4.428LeuArg: 4.428 ± 0.55
8.223LeuSer: 8.223 ± 1.216
2.53LeuThr: 2.53 ± 0.598
3.163LeuVal: 3.163 ± 0.487
0.633LeuTrp: 0.633 ± 0.45
1.898LeuTyr: 1.898 ± 0.899
0.0LeuXaa: 0.0 ± 0.0
Met
3.163MetAla: 3.163 ± 2.021
1.265MetCys: 1.265 ± 0.9
2.53MetAsp: 2.53 ± 2.01
0.0MetGlu: 0.0 ± 0.0
2.53MetPhe: 2.53 ± 0.893
0.0MetGly: 0.0 ± 0.0
0.633MetHis: 0.633 ± 0.495
1.265MetIle: 1.265 ± 0.899
0.633MetLys: 0.633 ± 0.495
1.265MetLeu: 1.265 ± 1.012
0.633MetMet: 0.633 ± 0.725
1.898MetAsn: 1.898 ± 1.307
1.265MetPro: 1.265 ± 0.655
1.265MetGln: 1.265 ± 0.655
1.265MetArg: 1.265 ± 0.655
2.53MetSer: 2.53 ± 1.494
0.0MetThr: 0.0 ± 0.0
2.53MetVal: 2.53 ± 0.566
0.0MetTrp: 0.0 ± 0.0
1.265MetTyr: 1.265 ± 0.655
0.0MetXaa: 0.0 ± 0.0
Asn
2.53AsnAla: 2.53 ± 1.311
0.633AsnCys: 0.633 ± 0.495
3.795AsnAsp: 3.795 ± 0.85
1.898AsnGlu: 1.898 ± 0.774
3.795AsnPhe: 3.795 ± 2.024
5.06AsnGly: 5.06 ± 1.543
0.0AsnHis: 0.0 ± 0.0
2.53AsnIle: 2.53 ± 1.205
6.325AsnLys: 6.325 ± 1.868
4.428AsnLeu: 4.428 ± 1.31
1.265AsnMet: 1.265 ± 0.655
5.693AsnAsn: 5.693 ± 2.739
5.06AsnPro: 5.06 ± 0.985
1.898AsnGln: 1.898 ± 0.844
2.53AsnArg: 2.53 ± 0.566
6.325AsnSer: 6.325 ± 1.048
3.163AsnThr: 3.163 ± 0.872
3.795AsnVal: 3.795 ± 0.748
0.633AsnTrp: 0.633 ± 0.45
2.53AsnTyr: 2.53 ± 1.217
0.0AsnXaa: 0.0 ± 0.0
Pro
0.633ProAla: 0.633 ± 0.725
0.633ProCys: 0.633 ± 0.495
0.633ProAsp: 0.633 ± 0.725
2.53ProGlu: 2.53 ± 2.262
1.898ProPhe: 1.898 ± 1.486
1.265ProGly: 1.265 ± 0.9
1.265ProHis: 1.265 ± 0.502
0.633ProIle: 0.633 ± 0.45
2.53ProLys: 2.53 ± 1.094
3.163ProLeu: 3.163 ± 2.251
0.633ProMet: 0.633 ± 0.649
1.265ProAsn: 1.265 ± 0.9
0.633ProPro: 0.633 ± 0.725
1.898ProGln: 1.898 ± 0.815
1.898ProArg: 1.898 ± 0.89
3.795ProSer: 3.795 ± 0.748
1.265ProThr: 1.265 ± 0.655
4.428ProVal: 4.428 ± 1.767
0.0ProTrp: 0.0 ± 0.0
5.693ProTyr: 5.693 ± 0.222
0.0ProXaa: 0.0 ± 0.0
Gln
2.53GlnAla: 2.53 ± 0.566
1.898GlnCys: 1.898 ± 1.486
0.633GlnAsp: 0.633 ± 0.495
1.898GlnGlu: 1.898 ± 1.22
3.795GlnPhe: 3.795 ± 1.366
3.795GlnGly: 3.795 ± 3.413
0.633GlnHis: 0.633 ± 0.45
2.53GlnIle: 2.53 ± 1.029
5.693GlnLys: 5.693 ± 2.222
5.693GlnLeu: 5.693 ± 1.464
1.898GlnMet: 1.898 ± 1.307
1.898GlnAsn: 1.898 ± 2.176
3.795GlnPro: 3.795 ± 2.996
3.795GlnGln: 3.795 ± 1.584
3.163GlnArg: 3.163 ± 1.398
4.428GlnSer: 4.428 ± 1.497
0.633GlnThr: 0.633 ± 0.725
3.795GlnVal: 3.795 ± 0.748
1.898GlnTrp: 1.898 ± 0.36
1.265GlnTyr: 1.265 ± 0.9
0.0GlnXaa: 0.0 ± 0.0
Arg
1.898ArgAla: 1.898 ± 0.899
0.633ArgCys: 0.633 ± 0.495
1.898ArgAsp: 1.898 ± 0.815
3.163ArgGlu: 3.163 ± 0.702
1.898ArgPhe: 1.898 ± 0.999
0.633ArgGly: 0.633 ± 0.45
0.0ArgHis: 0.0 ± 0.0
2.53ArgIle: 2.53 ± 1.298
1.898ArgLys: 1.898 ± 0.899
6.325ArgLeu: 6.325 ± 2.364
1.265ArgMet: 1.265 ± 0.502
2.53ArgAsn: 2.53 ± 1.801
1.898ArgPro: 1.898 ± 1.486
0.633ArgGln: 0.633 ± 0.45
1.265ArgArg: 1.265 ± 0.899
2.53ArgSer: 2.53 ± 1.041
2.53ArgThr: 2.53 ± 1.368
1.265ArgVal: 1.265 ± 0.655
0.633ArgTrp: 0.633 ± 0.495
4.428ArgTyr: 4.428 ± 1.844
0.0ArgXaa: 0.0 ± 0.0
Ser
5.693SerAla: 5.693 ± 3.14
1.265SerCys: 1.265 ± 0.991
5.693SerAsp: 5.693 ± 2.972
5.693SerGlu: 5.693 ± 1.206
5.06SerPhe: 5.06 ± 1.877
3.163SerGly: 3.163 ± 1.604
0.0SerHis: 0.0 ± 0.0
4.428SerIle: 4.428 ± 1.684
6.325SerLys: 6.325 ± 2.75
10.12SerLeu: 10.12 ± 1.43
1.265SerMet: 1.265 ± 0.649
3.795SerAsn: 3.795 ± 1.298
3.163SerPro: 3.163 ± 1.732
5.06SerGln: 5.06 ± 0.903
3.795SerArg: 3.795 ± 1.549
16.445SerSer: 16.445 ± 4.719
3.795SerThr: 3.795 ± 1.298
5.693SerVal: 5.693 ± 1.038
1.265SerTrp: 1.265 ± 0.655
5.693SerTyr: 5.693 ± 2.438
0.0SerXaa: 0.0 ± 0.0
Thr
7.59ThrAla: 7.59 ± 4.028
0.633ThrCys: 0.633 ± 0.45
6.325ThrAsp: 6.325 ± 2.554
0.633ThrGlu: 0.633 ± 0.495
3.163ThrPhe: 3.163 ± 0.487
2.53ThrGly: 2.53 ± 1.217
1.265ThrHis: 1.265 ± 0.649
0.633ThrIle: 0.633 ± 0.495
1.265ThrLys: 1.265 ± 0.655
5.06ThrLeu: 5.06 ± 1.835
0.633ThrMet: 0.633 ± 0.45
1.265ThrAsn: 1.265 ± 0.649
3.163ThrPro: 3.163 ± 1.679
2.53ThrGln: 2.53 ± 1.205
0.633ThrArg: 0.633 ± 0.45
3.795ThrSer: 3.795 ± 1.366
2.53ThrThr: 2.53 ± 0.893
1.898ThrVal: 1.898 ± 0.859
0.0ThrTrp: 0.0 ± 0.0
1.898ThrTyr: 1.898 ± 0.89
0.0ThrXaa: 0.0 ± 0.0
Val
2.53ValAla: 2.53 ± 0.492
1.898ValCys: 1.898 ± 1.486
5.06ValAsp: 5.06 ± 1.785
1.898ValGlu: 1.898 ± 0.36
0.633ValPhe: 0.633 ± 0.495
3.795ValGly: 3.795 ± 1.598
2.53ValHis: 2.53 ± 1.981
2.53ValIle: 2.53 ± 1.341
5.693ValLys: 5.693 ± 1.588
2.53ValLeu: 2.53 ± 0.492
0.0ValMet: 0.0 ± 0.0
8.855ValAsn: 8.855 ± 1.746
3.163ValPro: 3.163 ± 1.277
3.163ValGln: 3.163 ± 2.021
1.898ValArg: 1.898 ± 0.89
6.958ValSer: 6.958 ± 2.641
0.0ValThr: 0.0 ± 0.0
3.795ValVal: 3.795 ± 1.63
0.633ValTrp: 0.633 ± 0.495
1.265ValTyr: 1.265 ± 0.899
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.0TrpCys: 0.0 ± 0.0
0.633TrpAsp: 0.633 ± 0.725
1.898TrpGlu: 1.898 ± 0.815
0.0TrpPhe: 0.0 ± 0.0
0.0TrpGly: 0.0 ± 0.0
0.0TrpHis: 0.0 ± 0.0
0.0TrpIle: 0.0 ± 0.0
0.633TrpLys: 0.633 ± 0.45
0.0TrpLeu: 0.0 ± 0.0
0.633TrpMet: 0.633 ± 0.725
1.265TrpAsn: 1.265 ± 0.9
0.0TrpPro: 0.0 ± 0.0
1.265TrpGln: 1.265 ± 0.655
0.633TrpArg: 0.633 ± 0.495
0.633TrpSer: 0.633 ± 0.786
0.633TrpThr: 0.633 ± 0.495
1.265TrpVal: 1.265 ± 0.502
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
5.693TyrAla: 5.693 ± 1.402
0.633TyrCys: 0.633 ± 0.45
2.53TyrAsp: 2.53 ± 1.217
0.633TyrGlu: 0.633 ± 0.495
5.693TyrPhe: 5.693 ± 1.72
3.163TyrGly: 3.163 ± 1.277
1.265TyrHis: 1.265 ± 0.502
1.898TyrIle: 1.898 ± 0.89
2.53TyrLys: 2.53 ± 0.492
6.325TyrLeu: 6.325 ± 1.176
2.53TyrMet: 2.53 ± 0.855
5.06TyrAsn: 5.06 ± 2.216
0.633TyrPro: 0.633 ± 0.495
1.898TyrGln: 1.898 ± 0.859
1.265TyrArg: 1.265 ± 0.502
4.428TyrSer: 4.428 ± 2.535
3.163TyrThr: 3.163 ± 1.358
3.163TyrVal: 3.163 ± 2.085
0.0TyrTrp: 0.0 ± 0.0
3.163TyrTyr: 3.163 ± 1.829
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4 proteins (1582 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski