Amino acid dipepetide frequency for Jingmen tombus-like virus 2

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.762AlaAla: 4.762 ± 1.662
0.952AlaCys: 0.952 ± 0.626
1.905AlaAsp: 1.905 ± 1.472
2.857AlaGlu: 2.857 ± 0.94
1.905AlaPhe: 1.905 ± 0.813
5.714AlaGly: 5.714 ± 2.366
2.857AlaHis: 2.857 ± 1.878
7.619AlaIle: 7.619 ± 1.905
0.952AlaLys: 0.952 ± 0.626
4.762AlaLeu: 4.762 ± 1.613
1.905AlaMet: 1.905 ± 0.813
4.762AlaAsn: 4.762 ± 2.57
4.762AlaPro: 4.762 ± 1.613
2.857AlaGln: 2.857 ± 0.521
6.667AlaArg: 6.667 ± 3.273
6.667AlaSer: 6.667 ± 1.894
6.667AlaThr: 6.667 ± 1.389
3.81AlaVal: 3.81 ± 1.493
0.0AlaTrp: 0.0 ± 0.0
2.857AlaTyr: 2.857 ± 1.154
0.0AlaXaa: 0.0 ± 0.0
Cys
1.905CysAla: 1.905 ± 1.252
0.952CysCys: 0.952 ± 0.626
0.952CysAsp: 0.952 ± 0.626
0.952CysGlu: 0.952 ± 1.002
0.0CysPhe: 0.0 ± 0.0
0.952CysGly: 0.952 ± 0.626
0.952CysHis: 0.952 ± 1.002
0.0CysIle: 0.0 ± 0.0
0.0CysLys: 0.0 ± 0.0
0.0CysLeu: 0.0 ± 0.0
0.0CysMet: 0.0 ± 0.0
0.952CysAsn: 0.952 ± 0.626
0.952CysPro: 0.952 ± 1.002
0.0CysGln: 0.0 ± 0.0
0.952CysArg: 0.952 ± 0.736
2.857CysSer: 2.857 ± 1.05
0.952CysThr: 0.952 ± 0.736
0.0CysVal: 0.0 ± 0.0
0.952CysTrp: 0.952 ± 0.626
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
1.905AspAla: 1.905 ± 0.566
0.0AspCys: 0.0 ± 0.0
6.667AspAsp: 6.667 ± 2.071
1.905AspGlu: 1.905 ± 1.252
5.714AspPhe: 5.714 ± 1.042
1.905AspGly: 1.905 ± 0.813
4.762AspHis: 4.762 ± 1.771
4.762AspIle: 4.762 ± 1.419
0.0AspLys: 0.0 ± 0.0
1.905AspLeu: 1.905 ± 0.566
2.857AspMet: 2.857 ± 0.94
2.857AspAsn: 2.857 ± 0.521
1.905AspPro: 1.905 ± 0.813
0.952AspGln: 0.952 ± 1.002
1.905AspArg: 1.905 ± 0.813
1.905AspSer: 1.905 ± 0.813
1.905AspThr: 1.905 ± 0.813
0.952AspVal: 0.952 ± 0.626
2.857AspTrp: 2.857 ± 1.878
4.762AspTyr: 4.762 ± 3.13
0.0AspXaa: 0.0 ± 0.0
Glu
1.905GluAla: 1.905 ± 0.566
0.0GluCys: 0.0 ± 0.0
0.0GluAsp: 0.0 ± 0.0
2.857GluGlu: 2.857 ± 1.878
1.905GluPhe: 1.905 ± 0.566
4.762GluGly: 4.762 ± 0.844
0.0GluHis: 0.0 ± 0.0
4.762GluIle: 4.762 ± 2.083
1.905GluLys: 1.905 ± 1.252
4.762GluLeu: 4.762 ± 2.083
2.857GluMet: 2.857 ± 1.079
0.952GluAsn: 0.952 ± 1.002
2.857GluPro: 2.857 ± 0.94
0.0GluGln: 0.0 ± 0.0
2.857GluArg: 2.857 ± 1.878
5.714GluSer: 5.714 ± 2.015
3.81GluThr: 3.81 ± 0.317
2.857GluVal: 2.857 ± 1.979
0.0GluTrp: 0.0 ± 0.0
2.857GluTyr: 2.857 ± 0.94
0.0GluXaa: 0.0 ± 0.0
Phe
0.952PheAla: 0.952 ± 0.736
0.952PheCys: 0.952 ± 0.626
3.81PheAsp: 3.81 ± 1.627
2.857PheGlu: 2.857 ± 1.878
0.952PhePhe: 0.952 ± 0.626
0.952PheGly: 0.952 ± 0.736
1.905PheHis: 1.905 ± 0.813
3.81PheIle: 3.81 ± 0.982
2.857PheLys: 2.857 ± 1.878
0.952PheLeu: 0.952 ± 1.002
0.952PheMet: 0.952 ± 1.12
2.857PheAsn: 2.857 ± 0.521
0.952PhePro: 0.952 ± 0.736
3.81PheGln: 3.81 ± 2.935
1.905PheArg: 1.905 ± 0.566
4.762PheSer: 4.762 ± 2.897
5.714PheThr: 5.714 ± 2.308
5.714PheVal: 5.714 ± 0.424
0.952PheTrp: 0.952 ± 0.736
0.952PheTyr: 0.952 ± 0.626
0.0PheXaa: 0.0 ± 0.0
Gly
1.905GlyAla: 1.905 ± 1.472
0.952GlyCys: 0.952 ± 1.002
4.762GlyAsp: 4.762 ± 1.419
3.81GlyGlu: 3.81 ± 1.493
4.762GlyPhe: 4.762 ± 0.424
1.905GlyGly: 1.905 ± 1.472
0.0GlyHis: 0.0 ± 0.0
0.952GlyIle: 0.952 ± 1.002
0.952GlyLys: 0.952 ± 1.002
4.762GlyLeu: 4.762 ± 1.151
1.905GlyMet: 1.905 ± 0.973
0.952GlyAsn: 0.952 ± 0.736
6.667GlyPro: 6.667 ± 0.854
0.952GlyGln: 0.952 ± 0.626
2.857GlyArg: 2.857 ± 1.591
3.81GlySer: 3.81 ± 1.493
5.714GlyThr: 5.714 ± 3.598
6.667GlyVal: 6.667 ± 2.198
0.952GlyTrp: 0.952 ± 0.736
0.952GlyTyr: 0.952 ± 0.626
0.0GlyXaa: 0.0 ± 0.0
His
3.81HisAla: 3.81 ± 1.526
0.952HisCys: 0.952 ± 0.626
0.952HisAsp: 0.952 ± 0.626
0.952HisGlu: 0.952 ± 1.002
0.952HisPhe: 0.952 ± 0.626
0.0HisGly: 0.0 ± 0.0
0.952HisHis: 0.952 ± 0.626
1.905HisIle: 1.905 ± 1.252
1.905HisLys: 1.905 ± 0.813
1.905HisLeu: 1.905 ± 0.813
0.952HisMet: 0.952 ± 0.626
0.952HisAsn: 0.952 ± 0.736
2.857HisPro: 2.857 ± 1.05
0.0HisGln: 0.0 ± 0.0
0.0HisArg: 0.0 ± 0.0
1.905HisSer: 1.905 ± 0.566
0.0HisThr: 0.0 ± 0.0
0.952HisVal: 0.952 ± 0.626
0.0HisTrp: 0.0 ± 0.0
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
5.714IleAla: 5.714 ± 2.5
0.952IleCys: 0.952 ± 0.626
0.952IleAsp: 0.952 ± 0.626
1.905IleGlu: 1.905 ± 1.252
1.905IlePhe: 1.905 ± 1.252
2.857IleGly: 2.857 ± 0.521
0.952IleHis: 0.952 ± 0.626
5.714IleIle: 5.714 ± 2.101
3.81IleLys: 3.81 ± 2.504
4.762IleLeu: 4.762 ± 1.662
1.905IleMet: 1.905 ± 2.003
2.857IleAsn: 2.857 ± 1.05
6.667IlePro: 6.667 ± 0.854
1.905IleGln: 1.905 ± 0.566
5.714IleArg: 5.714 ± 1.097
3.81IleSer: 3.81 ± 0.317
0.952IleThr: 0.952 ± 1.002
2.857IleVal: 2.857 ± 0.94
0.0IleTrp: 0.0 ± 0.0
0.952IleTyr: 0.952 ± 0.626
0.0IleXaa: 0.0 ± 0.0
Lys
1.905LysAla: 1.905 ± 0.566
0.0LysCys: 0.0 ± 0.0
0.0LysAsp: 0.0 ± 0.0
0.952LysGlu: 0.952 ± 0.626
0.952LysPhe: 0.952 ± 1.002
0.0LysGly: 0.0 ± 0.0
0.952LysHis: 0.952 ± 0.626
1.905LysIle: 1.905 ± 0.813
2.857LysLys: 2.857 ± 1.05
4.762LysLeu: 4.762 ± 2.083
0.952LysMet: 0.952 ± 0.626
1.905LysAsn: 1.905 ± 1.252
0.0LysPro: 0.0 ± 0.0
0.0LysGln: 0.0 ± 0.0
1.905LysArg: 1.905 ± 1.252
2.857LysSer: 2.857 ± 0.521
2.857LysThr: 2.857 ± 1.878
2.857LysVal: 2.857 ± 1.878
1.905LysTrp: 1.905 ± 0.566
3.81LysTyr: 3.81 ± 1.526
0.0LysXaa: 0.0 ± 0.0
Leu
8.571LeuAla: 8.571 ± 1.111
1.905LeuCys: 1.905 ± 0.813
1.905LeuAsp: 1.905 ± 1.252
4.762LeuGlu: 4.762 ± 1.662
0.0LeuPhe: 0.0 ± 0.0
4.762LeuGly: 4.762 ± 1.151
0.952LeuHis: 0.952 ± 0.736
4.762LeuIle: 4.762 ± 2.488
1.905LeuLys: 1.905 ± 1.252
3.81LeuLeu: 3.81 ± 1.627
3.81LeuMet: 3.81 ± 0.982
5.714LeuAsn: 5.714 ± 0.424
2.857LeuPro: 2.857 ± 0.521
5.714LeuGln: 5.714 ± 1.452
5.714LeuArg: 5.714 ± 1.042
4.762LeuSer: 4.762 ± 0.424
5.714LeuThr: 5.714 ± 1.042
6.667LeuVal: 6.667 ± 0.593
0.952LeuTrp: 0.952 ± 0.626
0.952LeuTyr: 0.952 ± 1.002
0.0LeuXaa: 0.0 ± 0.0
Met
1.905MetAla: 1.905 ± 0.566
0.952MetCys: 0.952 ± 0.626
1.905MetAsp: 1.905 ± 0.566
0.952MetGlu: 0.952 ± 0.626
0.0MetPhe: 0.0 ± 0.0
0.0MetGly: 0.0 ± 0.0
0.0MetHis: 0.0 ± 0.0
0.952MetIle: 0.952 ± 0.626
1.905MetLys: 1.905 ± 0.813
2.857MetLeu: 2.857 ± 0.521
0.0MetMet: 0.0 ± 0.0
2.857MetAsn: 2.857 ± 0.94
1.905MetPro: 1.905 ± 0.566
0.0MetGln: 0.0 ± 0.0
3.81MetArg: 3.81 ± 1.132
2.857MetSer: 2.857 ± 1.05
0.0MetThr: 0.0 ± 0.0
1.905MetVal: 1.905 ± 2.003
0.952MetTrp: 0.952 ± 0.626
2.857MetTyr: 2.857 ± 1.05
0.0MetXaa: 0.0 ± 0.0
Asn
2.857AsnAla: 2.857 ± 1.05
0.0AsnCys: 0.0 ± 0.0
1.905AsnAsp: 1.905 ± 1.107
2.857AsnGlu: 2.857 ± 1.878
3.81AsnPhe: 3.81 ± 1.132
5.714AsnGly: 5.714 ± 1.698
0.952AsnHis: 0.952 ± 0.626
0.0AsnIle: 0.0 ± 0.0
1.905AsnLys: 1.905 ± 1.252
3.81AsnLeu: 3.81 ± 0.982
0.0AsnMet: 0.0 ± 0.0
7.619AsnAsn: 7.619 ± 3.702
1.905AsnPro: 1.905 ± 1.472
1.905AsnGln: 1.905 ± 0.566
6.667AsnArg: 6.667 ± 1.389
0.0AsnSer: 0.0 ± 0.0
1.905AsnThr: 1.905 ± 1.472
1.905AsnVal: 1.905 ± 0.813
0.952AsnTrp: 0.952 ± 0.626
4.762AsnTyr: 4.762 ± 2.488
0.0AsnXaa: 0.0 ± 0.0
Pro
8.571ProAla: 8.571 ± 0.139
0.0ProCys: 0.0 ± 0.0
1.905ProAsp: 1.905 ± 2.003
2.857ProGlu: 2.857 ± 1.878
3.81ProPhe: 3.81 ± 2.218
0.952ProGly: 0.952 ± 0.736
1.905ProHis: 1.905 ± 1.252
3.81ProIle: 3.81 ± 1.493
1.905ProLys: 1.905 ± 1.107
4.762ProLeu: 4.762 ± 2.57
0.0ProMet: 0.0 ± 0.0
1.905ProAsn: 1.905 ± 0.813
2.857ProPro: 2.857 ± 1.591
1.905ProGln: 1.905 ± 0.566
0.952ProArg: 0.952 ± 0.626
7.619ProSer: 7.619 ± 3.197
4.762ProThr: 4.762 ± 2.641
1.905ProVal: 1.905 ± 0.566
0.0ProTrp: 0.0 ± 0.0
2.857ProTyr: 2.857 ± 1.05
0.0ProXaa: 0.0 ± 0.0
Gln
0.952GlnAla: 0.952 ± 0.736
0.0GlnCys: 0.0 ± 0.0
0.0GlnAsp: 0.0 ± 0.0
0.0GlnGlu: 0.0 ± 0.0
3.81GlnPhe: 3.81 ± 1.851
1.905GlnGly: 1.905 ± 1.472
0.0GlnHis: 0.0 ± 0.0
2.857GlnIle: 2.857 ± 0.94
0.0GlnLys: 0.0 ± 0.0
3.81GlnLeu: 3.81 ± 1.627
2.857GlnMet: 2.857 ± 1.154
0.0GlnAsn: 0.0 ± 0.0
2.857GlnPro: 2.857 ± 0.94
1.905GlnGln: 1.905 ± 0.813
5.714GlnArg: 5.714 ± 1.097
3.81GlnSer: 3.81 ± 0.317
2.857GlnThr: 2.857 ± 3.005
1.905GlnVal: 1.905 ± 0.566
0.0GlnTrp: 0.0 ± 0.0
0.0GlnTyr: 0.0 ± 0.0
0.0GlnXaa: 0.0 ± 0.0
Arg
7.619ArgAla: 7.619 ± 0.948
2.857ArgCys: 2.857 ± 0.94
4.762ArgAsp: 4.762 ± 2.083
1.905ArgGlu: 1.905 ± 1.472
1.905ArgPhe: 1.905 ± 1.252
3.81ArgGly: 3.81 ± 0.317
1.905ArgHis: 1.905 ± 0.813
2.857ArgIle: 2.857 ± 1.878
1.905ArgLys: 1.905 ± 0.566
8.571ArgLeu: 8.571 ± 2.591
0.952ArgMet: 0.952 ± 0.626
2.857ArgAsn: 2.857 ± 1.591
2.857ArgPro: 2.857 ± 1.154
2.857ArgGln: 2.857 ± 0.94
11.429ArgArg: 11.429 ± 3.512
4.762ArgSer: 4.762 ± 0.844
5.714ArgThr: 5.714 ± 2.5
2.857ArgVal: 2.857 ± 1.154
0.952ArgTrp: 0.952 ± 0.626
4.762ArgTyr: 4.762 ± 2.083
0.0ArgXaa: 0.0 ± 0.0
Ser
5.714SerAla: 5.714 ± 2.015
0.952SerCys: 0.952 ± 0.736
2.857SerAsp: 2.857 ± 1.05
3.81SerGlu: 3.81 ± 0.982
5.714SerPhe: 5.714 ± 1.452
7.619SerGly: 7.619 ± 0.763
0.952SerHis: 0.952 ± 0.626
3.81SerIle: 3.81 ± 1.132
0.952SerLys: 0.952 ± 0.626
7.619SerLeu: 7.619 ± 0.763
1.905SerMet: 1.905 ± 0.813
6.667SerAsn: 6.667 ± 1.389
1.905SerPro: 1.905 ± 1.472
1.905SerGln: 1.905 ± 1.252
6.667SerArg: 6.667 ± 0.593
10.476SerSer: 10.476 ± 3.279
4.762SerThr: 4.762 ± 2.641
5.714SerVal: 5.714 ± 2.308
0.0SerTrp: 0.0 ± 0.0
0.952SerTyr: 0.952 ± 0.626
0.0SerXaa: 0.0 ± 0.0
Thr
5.714ThrAla: 5.714 ± 0.424
1.905ThrCys: 1.905 ± 2.003
4.762ThrAsp: 4.762 ± 0.844
1.905ThrGlu: 1.905 ± 2.003
5.714ThrPhe: 5.714 ± 2.5
4.762ThrGly: 4.762 ± 2.641
0.0ThrHis: 0.0 ± 0.0
2.857ThrIle: 2.857 ± 1.591
0.952ThrLys: 0.952 ± 0.736
2.857ThrLeu: 2.857 ± 0.521
0.952ThrMet: 0.952 ± 0.626
2.857ThrAsn: 2.857 ± 0.94
5.714ThrPro: 5.714 ± 2.5
3.81ThrGln: 3.81 ± 2.218
5.714ThrArg: 5.714 ± 1.159
5.714ThrSer: 5.714 ± 1.042
7.619ThrThr: 7.619 ± 4.429
3.81ThrVal: 3.81 ± 2.943
0.0ThrTrp: 0.0 ± 0.0
2.857ThrTyr: 2.857 ± 2.208
0.0ThrXaa: 0.0 ± 0.0
Val
4.762ValAla: 4.762 ± 3.047
0.0ValCys: 0.0 ± 0.0
4.762ValAsp: 4.762 ± 1.151
5.714ValGlu: 5.714 ± 0.424
4.762ValPhe: 4.762 ± 1.655
5.714ValGly: 5.714 ± 3.296
0.0ValHis: 0.0 ± 0.0
0.952ValIle: 0.952 ± 0.626
4.762ValLys: 4.762 ± 2.083
4.762ValLeu: 4.762 ± 2.088
0.0ValMet: 0.0 ± 0.0
0.952ValAsn: 0.952 ± 0.626
2.857ValPro: 2.857 ± 1.154
3.81ValGln: 3.81 ± 0.982
3.81ValArg: 3.81 ± 1.493
2.857ValSer: 2.857 ± 2.208
2.857ValThr: 2.857 ± 1.591
4.762ValVal: 4.762 ± 3.679
2.857ValTrp: 2.857 ± 0.521
0.952ValTyr: 0.952 ± 0.626
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.0TrpCys: 0.0 ± 0.0
1.905TrpAsp: 1.905 ± 1.252
0.952TrpGlu: 0.952 ± 0.626
0.952TrpPhe: 0.952 ± 0.626
0.0TrpGly: 0.0 ± 0.0
0.952TrpHis: 0.952 ± 0.736
0.952TrpIle: 0.952 ± 0.736
0.952TrpLys: 0.952 ± 0.626
1.905TrpLeu: 1.905 ± 0.566
0.0TrpMet: 0.0 ± 0.0
0.952TrpAsn: 0.952 ± 0.626
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
0.952TrpArg: 0.952 ± 0.626
0.0TrpSer: 0.0 ± 0.0
2.857TrpThr: 2.857 ± 0.521
0.952TrpVal: 0.952 ± 0.626
0.952TrpTrp: 0.952 ± 0.626
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.81TyrAla: 3.81 ± 0.982
0.0TyrCys: 0.0 ± 0.0
5.714TyrAsp: 5.714 ± 2.671
2.857TyrGlu: 2.857 ± 1.05
0.0TyrPhe: 0.0 ± 0.0
1.905TyrGly: 1.905 ± 0.813
0.952TyrHis: 0.952 ± 0.626
1.905TyrIle: 1.905 ± 0.813
0.952TyrLys: 0.952 ± 0.626
2.857TyrLeu: 2.857 ± 1.05
2.857TyrMet: 2.857 ± 1.878
0.0TyrAsn: 0.0 ± 0.0
1.905TyrPro: 1.905 ± 0.813
0.952TyrGln: 0.952 ± 0.736
1.905TyrArg: 1.905 ± 0.566
3.81TyrSer: 3.81 ± 2.504
2.857TyrThr: 2.857 ± 0.94
2.857TyrVal: 2.857 ± 1.591
0.0TyrTrp: 0.0 ± 0.0
1.905TyrTyr: 1.905 ± 1.252
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (1051 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski