Amino acid dipepetide frequency for Wenzhou tombus-like virus 12

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.805AlaAla: 4.805 ± 1.754
1.201AlaCys: 1.201 ± 0.537
1.802AlaAsp: 1.802 ± 0.871
3.003AlaGlu: 3.003 ± 1.733
1.802AlaPhe: 1.802 ± 0.348
3.604AlaGly: 3.604 ± 0.456
0.601AlaHis: 0.601 ± 0.542
3.604AlaIle: 3.604 ± 2.552
2.402AlaLys: 2.402 ± 0.878
5.405AlaLeu: 5.405 ± 1.893
1.802AlaMet: 1.802 ± 1.058
4.204AlaAsn: 4.204 ± 1.423
4.805AlaPro: 4.805 ± 1.748
1.802AlaGln: 1.802 ± 0.88
4.805AlaArg: 4.805 ± 0.518
2.402AlaSer: 2.402 ± 0.633
4.204AlaThr: 4.204 ± 1.423
4.805AlaVal: 4.805 ± 0.976
2.402AlaTrp: 2.402 ± 0.878
1.802AlaTyr: 1.802 ± 0.348
0.0AlaXaa: 0.0 ± 0.0
Cys
0.601CysAla: 0.601 ± 0.477
0.601CysCys: 0.601 ± 0.542
0.601CysAsp: 0.601 ± 0.542
0.601CysGlu: 0.601 ± 0.542
1.802CysPhe: 1.802 ± 0.967
4.204CysGly: 4.204 ± 1.771
0.0CysHis: 0.0 ± 0.0
1.201CysIle: 1.201 ± 1.083
1.201CysLys: 1.201 ± 0.537
0.601CysLeu: 0.601 ± 0.477
0.601CysMet: 0.601 ± 0.542
0.0CysAsn: 0.0 ± 0.0
1.802CysPro: 1.802 ± 0.604
1.201CysGln: 1.201 ± 0.537
1.802CysArg: 1.802 ± 0.348
1.802CysSer: 1.802 ± 1.625
1.201CysThr: 1.201 ± 0.52
0.0CysVal: 0.0 ± 0.0
0.0CysTrp: 0.0 ± 0.0
0.601CysTyr: 0.601 ± 0.542
0.0CysXaa: 0.0 ± 0.0
Asp
5.405AspAla: 5.405 ± 1.696
1.802AspCys: 1.802 ± 1.058
3.604AspAsp: 3.604 ± 0.526
1.802AspGlu: 1.802 ± 0.967
3.003AspPhe: 3.003 ± 0.474
2.402AspGly: 2.402 ± 1.292
0.601AspHis: 0.601 ± 0.542
2.402AspIle: 2.402 ± 1.107
1.802AspLys: 1.802 ± 0.548
5.405AspLeu: 5.405 ± 1.8
1.201AspMet: 1.201 ± 0.944
1.802AspAsn: 1.802 ± 0.999
3.003AspPro: 3.003 ± 1.161
2.402AspGln: 2.402 ± 0.872
0.601AspArg: 0.601 ± 0.542
3.003AspSer: 3.003 ± 0.939
1.201AspThr: 1.201 ± 0.944
1.802AspVal: 1.802 ± 0.871
0.0AspTrp: 0.0 ± 0.0
1.802AspTyr: 1.802 ± 0.576
0.0AspXaa: 0.0 ± 0.0
Glu
2.402GluAla: 2.402 ± 0.626
0.601GluCys: 0.601 ± 0.542
4.204GluAsp: 4.204 ± 1.422
6.006GluGlu: 6.006 ± 3.416
3.604GluPhe: 3.604 ± 1.788
1.802GluGly: 1.802 ± 1.625
0.601GluHis: 0.601 ± 0.542
0.601GluIle: 0.601 ± 0.472
4.805GluLys: 4.805 ± 1.0
3.604GluLeu: 3.604 ± 1.095
0.0GluMet: 0.0 ± 0.0
2.402GluAsn: 2.402 ± 0.871
1.802GluPro: 1.802 ± 0.604
2.402GluGln: 2.402 ± 1.552
4.204GluArg: 4.204 ± 1.451
8.408GluSer: 8.408 ± 2.187
2.402GluThr: 2.402 ± 1.282
4.204GluVal: 4.204 ± 0.165
0.601GluTrp: 0.601 ± 0.472
3.003GluTyr: 3.003 ± 0.549
0.0GluXaa: 0.0 ± 0.0
Phe
2.402PheAla: 2.402 ± 1.074
0.601PheCys: 0.601 ± 0.542
3.003PheAsp: 3.003 ± 1.061
3.003PheGlu: 3.003 ± 1.381
0.601PhePhe: 0.601 ± 0.542
1.201PheGly: 1.201 ± 0.644
0.601PheHis: 0.601 ± 0.542
1.802PheIle: 1.802 ± 0.989
3.604PheLys: 3.604 ± 0.481
1.201PheLeu: 1.201 ± 0.651
0.0PheMet: 0.0 ± 0.0
1.201PheAsn: 1.201 ± 0.651
1.802PhePro: 1.802 ± 1.057
3.003PheGln: 3.003 ± 0.474
2.402PheArg: 2.402 ± 0.748
3.003PheSer: 3.003 ± 0.549
2.402PheThr: 2.402 ± 1.107
4.204PheVal: 4.204 ± 0.165
0.601PheTrp: 0.601 ± 0.542
2.402PheTyr: 2.402 ± 0.748
0.0PheXaa: 0.0 ± 0.0
Gly
4.204GlyAla: 4.204 ± 1.666
0.0GlyCys: 0.0 ± 0.0
5.405GlyAsp: 5.405 ± 1.387
4.204GlyGlu: 4.204 ± 1.422
2.402GlyPhe: 2.402 ± 0.633
4.805GlyGly: 4.805 ± 1.961
0.0GlyHis: 0.0 ± 0.0
5.405GlyIle: 5.405 ± 1.842
7.207GlyLys: 7.207 ± 2.483
6.607GlyLeu: 6.607 ± 0.966
2.402GlyMet: 2.402 ± 0.619
3.003GlyAsn: 3.003 ± 0.45
1.802GlyPro: 1.802 ± 0.604
1.802GlyGln: 1.802 ± 0.576
3.604GlyArg: 3.604 ± 0.925
8.408GlySer: 8.408 ± 2.817
4.204GlyThr: 4.204 ± 2.113
1.201GlyVal: 1.201 ± 0.52
2.402GlyTrp: 2.402 ± 0.633
0.0GlyTyr: 0.0 ± 0.0
0.0GlyXaa: 0.0 ± 0.0
His
0.0HisAla: 0.0 ± 0.0
0.601HisCys: 0.601 ± 0.472
1.201HisAsp: 1.201 ± 0.651
0.601HisGlu: 0.601 ± 0.542
0.0HisPhe: 0.0 ± 0.0
0.0HisGly: 0.0 ± 0.0
0.601HisHis: 0.601 ± 0.542
3.003HisIle: 3.003 ± 0.793
0.601HisLys: 0.601 ± 0.542
1.201HisLeu: 1.201 ± 0.554
0.601HisMet: 0.601 ± 0.472
2.402HisAsn: 2.402 ± 1.074
2.402HisPro: 2.402 ± 0.878
0.601HisGln: 0.601 ± 0.477
1.802HisArg: 1.802 ± 0.989
2.402HisSer: 2.402 ± 0.198
0.601HisThr: 0.601 ± 0.472
1.201HisVal: 1.201 ± 1.083
0.601HisTrp: 0.601 ± 0.472
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
1.802IleAla: 1.802 ± 0.989
0.0IleCys: 0.0 ± 0.0
4.204IleAsp: 4.204 ± 1.39
2.402IleGlu: 2.402 ± 1.473
3.003IlePhe: 3.003 ± 0.739
3.003IleGly: 3.003 ± 1.094
1.802IleHis: 1.802 ± 0.576
4.204IleIle: 4.204 ± 1.322
5.405IleLys: 5.405 ± 0.638
4.204IleLeu: 4.204 ± 1.987
1.201IleMet: 1.201 ± 0.554
2.402IleAsn: 2.402 ± 0.633
3.003IlePro: 3.003 ± 0.793
2.402IleGln: 2.402 ± 0.748
1.201IleArg: 1.201 ± 0.554
3.604IleSer: 3.604 ± 1.281
3.604IleThr: 3.604 ± 1.864
3.604IleVal: 3.604 ± 1.279
1.201IleTrp: 1.201 ± 0.944
0.601IleTyr: 0.601 ± 0.477
0.0IleXaa: 0.0 ± 0.0
Lys
3.003LysAla: 3.003 ± 1.468
0.601LysCys: 0.601 ± 0.477
0.0LysAsp: 0.0 ± 0.0
3.003LysGlu: 3.003 ± 1.353
4.204LysPhe: 4.204 ± 1.663
6.006LysGly: 6.006 ± 1.674
1.802LysHis: 1.802 ± 0.576
4.805LysIle: 4.805 ± 2.455
9.61LysLys: 9.61 ± 5.573
1.802LysLeu: 1.802 ± 0.548
0.601LysMet: 0.601 ± 0.472
0.601LysAsn: 0.601 ± 0.477
4.805LysPro: 4.805 ± 2.577
2.402LysGln: 2.402 ± 0.626
4.204LysArg: 4.204 ± 1.422
2.402LysSer: 2.402 ± 1.909
7.808LysThr: 7.808 ± 3.535
4.204LysVal: 4.204 ± 2.13
1.201LysTrp: 1.201 ± 0.644
2.402LysTyr: 2.402 ± 1.107
0.0LysXaa: 0.0 ± 0.0
Leu
4.805LeuAla: 4.805 ± 1.487
0.601LeuCys: 0.601 ± 0.542
0.601LeuAsp: 0.601 ± 0.542
6.607LeuGlu: 6.607 ± 0.916
1.201LeuPhe: 1.201 ± 1.085
8.408LeuGly: 8.408 ± 3.828
1.802LeuHis: 1.802 ± 0.989
3.604LeuIle: 3.604 ± 0.526
1.802LeuLys: 1.802 ± 0.996
5.405LeuLeu: 5.405 ± 1.012
1.201LeuMet: 1.201 ± 0.651
4.204LeuAsn: 4.204 ± 1.184
5.405LeuPro: 5.405 ± 1.294
1.802LeuGln: 1.802 ± 0.548
7.808LeuArg: 7.808 ± 1.229
6.006LeuSer: 6.006 ± 1.473
3.604LeuThr: 3.604 ± 0.456
5.405LeuVal: 5.405 ± 1.278
1.201LeuTrp: 1.201 ± 0.954
1.201LeuTyr: 1.201 ± 0.554
0.0LeuXaa: 0.0 ± 0.0
Met
3.003MetAla: 3.003 ± 1.522
1.201MetCys: 1.201 ± 0.537
0.0MetAsp: 0.0 ± 0.0
1.802MetGlu: 1.802 ± 1.058
0.601MetPhe: 0.601 ± 0.542
0.601MetGly: 0.601 ± 0.472
0.0MetHis: 0.0 ± 0.0
1.201MetIle: 1.201 ± 1.083
2.402MetLys: 2.402 ± 1.107
1.201MetLeu: 1.201 ± 0.642
0.0MetMet: 0.0 ± 0.0
1.201MetAsn: 1.201 ± 0.554
1.802MetPro: 1.802 ± 0.348
1.802MetGln: 1.802 ± 0.604
2.402MetArg: 2.402 ± 0.748
1.201MetSer: 1.201 ± 0.651
0.601MetThr: 0.601 ± 0.542
1.802MetVal: 1.802 ± 0.989
0.601MetTrp: 0.601 ± 0.542
1.201MetTyr: 1.201 ± 1.083
0.0MetXaa: 0.0 ± 0.0
Asn
2.402AsnAla: 2.402 ± 0.872
1.201AsnCys: 1.201 ± 0.651
1.802AsnAsp: 1.802 ± 0.875
1.802AsnGlu: 1.802 ± 0.871
1.802AsnPhe: 1.802 ± 0.348
5.405AsnGly: 5.405 ± 3.588
0.601AsnHis: 0.601 ± 0.477
0.601AsnIle: 0.601 ± 0.542
4.204AsnLys: 4.204 ± 0.921
4.204AsnLeu: 4.204 ± 1.423
0.601AsnMet: 0.601 ± 0.542
4.204AsnAsn: 4.204 ± 0.804
3.604AsnPro: 3.604 ± 0.481
1.802AsnGln: 1.802 ± 1.091
1.201AsnArg: 1.201 ± 0.537
5.405AsnSer: 5.405 ± 1.2
2.402AsnThr: 2.402 ± 0.633
2.402AsnVal: 2.402 ± 1.04
0.601AsnTrp: 0.601 ± 0.542
1.802AsnTyr: 1.802 ± 1.057
0.0AsnXaa: 0.0 ± 0.0
Pro
3.003ProAla: 3.003 ± 1.738
2.402ProCys: 2.402 ± 0.633
3.003ProAsp: 3.003 ± 0.793
3.003ProGlu: 3.003 ± 0.45
2.402ProPhe: 2.402 ± 0.198
6.006ProGly: 6.006 ± 2.227
1.802ProHis: 1.802 ± 0.871
3.003ProIle: 3.003 ± 0.45
3.604ProLys: 3.604 ± 1.759
5.405ProLeu: 5.405 ± 1.212
2.402ProMet: 2.402 ± 1.284
4.204ProAsn: 4.204 ± 0.835
6.006ProPro: 6.006 ± 2.711
3.604ProGln: 3.604 ± 1.018
4.204ProArg: 4.204 ± 0.717
3.003ProSer: 3.003 ± 0.988
3.003ProThr: 3.003 ± 0.45
4.204ProVal: 4.204 ± 1.428
1.802ProTrp: 1.802 ± 0.576
1.802ProTyr: 1.802 ± 1.416
0.0ProXaa: 0.0 ± 0.0
Gln
0.601GlnAla: 0.601 ± 0.477
1.201GlnCys: 1.201 ± 0.537
3.003GlnAsp: 3.003 ± 1.753
2.402GlnGlu: 2.402 ± 1.422
2.402GlnPhe: 2.402 ± 1.301
1.802GlnGly: 1.802 ± 0.871
0.601GlnHis: 0.601 ± 0.542
1.802GlnIle: 1.802 ± 0.348
3.604GlnLys: 3.604 ± 1.642
3.003GlnLeu: 3.003 ± 0.45
1.802GlnMet: 1.802 ± 0.777
1.201GlnAsn: 1.201 ± 1.083
2.402GlnPro: 2.402 ± 1.282
2.402GlnGln: 2.402 ± 0.633
3.003GlnArg: 3.003 ± 0.739
2.402GlnSer: 2.402 ± 0.972
1.802GlnThr: 1.802 ± 0.88
3.604GlnVal: 3.604 ± 0.456
0.0GlnTrp: 0.0 ± 0.0
0.601GlnTyr: 0.601 ± 0.542
0.0GlnXaa: 0.0 ± 0.0
Arg
5.405ArgAla: 5.405 ± 1.989
1.802ArgCys: 1.802 ± 1.057
3.604ArgAsp: 3.604 ± 1.524
3.604ArgGlu: 3.604 ± 1.581
2.402ArgPhe: 2.402 ± 1.107
3.003ArgGly: 3.003 ± 0.549
1.802ArgHis: 1.802 ± 0.348
1.802ArgIle: 1.802 ± 1.101
2.402ArgLys: 2.402 ± 0.834
4.805ArgLeu: 4.805 ± 0.857
4.805ArgMet: 4.805 ± 2.48
3.003ArgAsn: 3.003 ± 0.474
4.805ArgPro: 4.805 ± 1.062
3.604ArgGln: 3.604 ± 2.24
5.405ArgArg: 5.405 ± 0.936
3.604ArgSer: 3.604 ± 0.798
4.805ArgThr: 4.805 ± 1.208
2.402ArgVal: 2.402 ± 0.834
0.601ArgTrp: 0.601 ± 0.477
4.204ArgTyr: 4.204 ± 1.322
0.0ArgXaa: 0.0 ± 0.0
Ser
5.405SerAla: 5.405 ± 1.278
1.802SerCys: 1.802 ± 0.548
4.204SerAsp: 4.204 ± 0.812
4.204SerGlu: 4.204 ± 1.812
1.802SerPhe: 1.802 ± 0.989
5.405SerGly: 5.405 ± 0.507
2.402SerHis: 2.402 ± 0.871
5.405SerIle: 5.405 ± 0.936
3.003SerLys: 3.003 ± 1.27
4.805SerLeu: 4.805 ± 0.518
1.802SerMet: 1.802 ± 1.057
2.402SerAsn: 2.402 ± 1.609
6.006SerPro: 6.006 ± 1.283
3.003SerGln: 3.003 ± 1.102
4.805SerArg: 4.805 ± 0.947
11.411SerSer: 11.411 ± 4.626
3.003SerThr: 3.003 ± 2.073
5.405SerVal: 5.405 ± 1.69
2.402SerTrp: 2.402 ± 1.473
2.402SerTyr: 2.402 ± 1.292
0.0SerXaa: 0.0 ± 0.0
Thr
6.006ThrAla: 6.006 ± 0.895
1.802ThrCys: 1.802 ± 0.859
0.601ThrAsp: 0.601 ± 0.542
1.802ThrGlu: 1.802 ± 0.576
2.402ThrPhe: 2.402 ± 1.495
3.604ThrGly: 3.604 ± 1.561
1.201ThrHis: 1.201 ± 0.651
2.402ThrIle: 2.402 ± 0.872
3.604ThrLys: 3.604 ± 1.718
4.805ThrLeu: 4.805 ± 0.92
1.201ThrMet: 1.201 ± 0.954
0.601ThrAsn: 0.601 ± 0.477
4.805ThrPro: 4.805 ± 0.958
1.802ThrGln: 1.802 ± 0.348
3.604ThrArg: 3.604 ± 0.697
4.204ThrSer: 4.204 ± 1.142
3.003ThrThr: 3.003 ± 0.45
5.405ThrVal: 5.405 ± 0.543
1.802ThrTrp: 1.802 ± 0.999
0.601ThrTyr: 0.601 ± 0.542
0.0ThrXaa: 0.0 ± 0.0
Val
3.604ValAla: 3.604 ± 0.697
1.201ValCys: 1.201 ± 0.954
1.802ValAsp: 1.802 ± 0.548
4.805ValGlu: 4.805 ± 0.92
1.201ValPhe: 1.201 ± 0.644
4.805ValGly: 4.805 ± 0.933
1.802ValHis: 1.802 ± 0.967
3.003ValIle: 3.003 ± 0.772
3.604ValLys: 3.604 ± 2.863
5.405ValLeu: 5.405 ± 2.03
0.601ValMet: 0.601 ± 0.542
3.604ValAsn: 3.604 ± 0.907
3.604ValPro: 3.604 ± 1.178
1.802ValGln: 1.802 ± 0.88
6.607ValArg: 6.607 ± 2.019
4.805ValSer: 4.805 ± 2.607
3.604ValThr: 3.604 ± 0.697
7.808ValVal: 7.808 ± 2.383
1.201ValTrp: 1.201 ± 0.52
0.0ValTyr: 0.0 ± 0.0
0.0ValXaa: 0.0 ± 0.0
Trp
1.201TrpAla: 1.201 ± 0.554
0.601TrpCys: 0.601 ± 0.477
0.0TrpAsp: 0.0 ± 0.0
0.601TrpGlu: 0.601 ± 0.472
1.802TrpPhe: 1.802 ± 0.548
1.802TrpGly: 1.802 ± 0.348
0.601TrpHis: 0.601 ± 0.472
2.402TrpIle: 2.402 ± 1.023
0.601TrpLys: 0.601 ± 0.477
1.201TrpLeu: 1.201 ± 0.537
0.601TrpMet: 0.601 ± 0.542
1.802TrpAsn: 1.802 ± 0.875
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
1.802TrpArg: 1.802 ± 1.416
2.402TrpSer: 2.402 ± 0.802
0.601TrpThr: 0.601 ± 0.542
1.201TrpVal: 1.201 ± 0.642
0.0TrpTrp: 0.0 ± 0.0
0.601TrpTyr: 0.601 ± 0.472
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.201TyrAla: 1.201 ± 0.554
0.601TyrCys: 0.601 ± 0.542
2.402TyrAsp: 2.402 ± 0.834
2.402TyrGlu: 2.402 ± 0.748
0.601TyrPhe: 0.601 ± 0.472
1.802TyrGly: 1.802 ± 0.989
1.201TyrHis: 1.201 ± 0.537
0.601TyrIle: 0.601 ± 0.472
0.0TyrLys: 0.0 ± 0.0
2.402TyrLeu: 2.402 ± 1.421
0.601TyrMet: 0.601 ± 0.542
3.604TyrAsn: 3.604 ± 1.073
4.204TyrPro: 4.204 ± 1.45
0.0TyrGln: 0.0 ± 0.0
2.402TyrArg: 2.402 ± 1.495
1.201TyrSer: 1.201 ± 1.083
1.201TyrThr: 1.201 ± 0.537
0.0TyrVal: 0.0 ± 0.0
0.601TyrTrp: 0.601 ± 0.472
0.0TyrTyr: 0.0 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4 proteins (1666 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski