Amino acid dipepetide frequency for Xenotropic MuLV-related virus (isolate VP62) (XMRV)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.209AlaAla: 7.209 ± 0.8
0.687AlaCys: 0.687 ± 0.278
2.746AlaAsp: 2.746 ± 0.269
4.119AlaGlu: 4.119 ± 1.273
4.119AlaPhe: 4.119 ± 0.786
5.149AlaGly: 5.149 ± 0.448
1.03AlaHis: 1.03 ± 0.626
0.687AlaIle: 0.687 ± 0.278
3.09AlaLys: 3.09 ± 0.471
10.642AlaLeu: 10.642 ± 0.967
0.687AlaMet: 0.687 ± 0.417
0.687AlaAsn: 0.687 ± 0.759
4.806AlaPro: 4.806 ± 0.405
3.433AlaGln: 3.433 ± 1.159
3.776AlaArg: 3.776 ± 1.336
3.433AlaSer: 3.433 ± 0.848
5.149AlaThr: 5.149 ± 0.602
3.776AlaVal: 3.776 ± 0.813
1.373AlaTrp: 1.373 ± 0.834
4.119AlaTyr: 4.119 ± 0.461
0.0AlaXaa: 0.0 ± 0.0
Cys
1.716CysAla: 1.716 ± 0.056
0.343CysCys: 0.343 ± 0.38
0.343CysAsp: 0.343 ± 0.38
0.343CysGlu: 0.343 ± 0.38
0.687CysPhe: 0.687 ± 0.759
0.687CysGly: 0.687 ± 0.759
0.0CysHis: 0.0 ± 0.0
0.687CysIle: 0.687 ± 0.759
0.687CysLys: 0.687 ± 0.36
1.716CysLeu: 1.716 ± 0.554
0.0CysMet: 0.0 ± 0.0
0.687CysAsn: 0.687 ± 0.759
2.403CysPro: 2.403 ± 0.41
1.716CysGln: 1.716 ± 0.579
0.687CysArg: 0.687 ± 0.278
1.373CysSer: 1.373 ± 0.71
0.343CysThr: 0.343 ± 0.209
0.343CysVal: 0.343 ± 0.38
0.343CysTrp: 0.343 ± 0.38
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
2.06AspAla: 2.06 ± 0.626
2.403AspCys: 2.403 ± 0.734
1.373AspAsp: 1.373 ± 0.211
1.716AspGlu: 1.716 ± 0.579
1.373AspPhe: 1.373 ± 0.453
3.433AspGly: 3.433 ± 0.709
0.343AspHis: 0.343 ± 0.38
1.716AspIle: 1.716 ± 0.056
1.716AspLys: 1.716 ± 0.516
7.896AspLeu: 7.896 ± 0.18
0.0AspMet: 0.0 ± 0.373
0.687AspAsn: 0.687 ± 0.278
6.179AspPro: 6.179 ± 1.563
3.776AspGln: 3.776 ± 1.336
3.776AspArg: 3.776 ± 0.245
3.433AspSer: 3.433 ± 0.49
1.373AspThr: 1.373 ± 0.453
1.716AspVal: 1.716 ± 0.663
1.373AspTrp: 1.373 ± 0.424
1.716AspTyr: 1.716 ± 0.056
0.343AspXaa: 0.343 ± 0.209
Glu
6.179GluAla: 6.179 ± 1.885
0.687GluCys: 0.687 ± 0.759
4.806GluAsp: 4.806 ± 1.644
5.836GluGlu: 5.836 ± 2.217
0.687GluPhe: 0.687 ± 0.278
4.119GluGly: 4.119 ± 1.12
0.0GluHis: 0.0 ± 0.0
2.746GluIle: 2.746 ± 1.151
4.463GluLys: 4.463 ± 1.087
1.716GluLeu: 1.716 ± 0.632
2.06GluMet: 2.06 ± 0.391
0.687GluAsn: 0.687 ± 0.36
2.06GluPro: 2.06 ± 0.626
1.716GluGln: 1.716 ± 0.516
6.866GluArg: 6.866 ± 3.021
2.746GluSer: 2.746 ± 0.895
5.149GluThr: 5.149 ± 1.572
4.119GluVal: 4.119 ± 0.168
1.03GluTrp: 1.03 ± 0.334
0.343GluTyr: 0.343 ± 0.209
0.0GluXaa: 0.0 ± 0.0
Phe
1.373PheAla: 1.373 ± 0.834
2.06PheCys: 2.06 ± 0.668
1.03PheAsp: 1.03 ± 0.632
2.746PheGlu: 2.746 ± 1.114
0.343PhePhe: 0.343 ± 0.209
1.03PheGly: 1.03 ± 0.313
0.687PheHis: 0.687 ± 0.417
0.687PheIle: 0.687 ± 0.36
0.343PheLys: 0.343 ± 0.209
2.746PheLeu: 2.746 ± 0.633
0.0PheMet: 0.0 ± 0.0
3.09PheAsn: 3.09 ± 0.882
2.746PhePro: 2.746 ± 0.389
0.0PheGln: 0.0 ± 0.0
0.687PheArg: 0.687 ± 0.417
2.403PheSer: 2.403 ± 0.626
1.03PheThr: 1.03 ± 0.632
1.716PheVal: 1.716 ± 0.056
0.0PheTrp: 0.0 ± 0.0
0.687PheTyr: 0.687 ± 0.759
0.0PheXaa: 0.0 ± 0.0
Gly
6.179GlyAla: 6.179 ± 1.946
1.03GlyCys: 1.03 ± 1.139
4.119GlyAsp: 4.119 ± 1.277
1.716GlyGlu: 1.716 ± 0.663
1.03GlyPhe: 1.03 ± 0.626
5.493GlyGly: 5.493 ± 1.361
2.403GlyHis: 2.403 ± 0.41
3.09GlyIle: 3.09 ± 0.426
3.09GlyLys: 3.09 ± 0.426
5.149GlyLeu: 5.149 ± 1.748
0.687GlyMet: 0.687 ± 0.417
2.06GlyAsn: 2.06 ± 0.22
9.612GlyPro: 9.612 ± 1.35
8.239GlyGln: 8.239 ± 0.336
4.806GlyArg: 4.806 ± 1.389
2.403GlySer: 2.403 ± 0.82
6.522GlyThr: 6.522 ± 0.98
2.403GlyVal: 2.403 ± 0.717
1.716GlyTrp: 1.716 ± 0.056
2.06GlyTyr: 2.06 ± 0.825
0.0GlyXaa: 0.0 ± 0.0
His
0.687HisAla: 0.687 ± 0.417
0.687HisCys: 0.687 ± 0.417
0.687HisAsp: 0.687 ± 0.417
0.343HisGlu: 0.343 ± 0.209
0.687HisPhe: 0.687 ± 0.417
1.716HisGly: 1.716 ± 0.579
0.0HisHis: 0.0 ± 0.0
0.343HisIle: 0.343 ± 0.209
0.343HisLys: 0.343 ± 0.38
1.716HisLeu: 1.716 ± 0.579
0.0HisMet: 0.0 ± 0.0
0.0HisAsn: 0.0 ± 0.0
2.06HisPro: 2.06 ± 0.76
2.746HisGln: 2.746 ± 0.582
1.716HisArg: 1.716 ± 0.579
1.03HisSer: 1.03 ± 0.313
1.373HisThr: 1.373 ± 1.518
1.03HisVal: 1.03 ± 0.626
1.716HisTrp: 1.716 ± 0.663
0.687HisTyr: 0.687 ± 0.417
0.0HisXaa: 0.0 ± 0.0
Ile
1.716IleAla: 1.716 ± 0.056
0.343IleCys: 0.343 ± 0.209
1.373IleAsp: 1.373 ± 0.211
2.06IleGlu: 2.06 ± 0.668
1.03IlePhe: 1.03 ± 0.334
2.746IleGly: 2.746 ± 0.895
1.716IleHis: 1.716 ± 0.632
1.03IleIle: 1.03 ± 0.313
2.403IleLys: 2.403 ± 0.953
3.09IleLeu: 3.09 ± 0.939
1.03IleMet: 1.03 ± 1.139
0.343IleAsn: 0.343 ± 0.38
1.373IlePro: 1.373 ± 0.834
1.373IleGln: 1.373 ± 0.453
1.03IleArg: 1.03 ± 0.334
2.403IleSer: 2.403 ± 0.717
2.403IleThr: 2.403 ± 0.626
1.03IleVal: 1.03 ± 0.416
1.373IleTrp: 1.373 ± 0.211
0.343IleTyr: 0.343 ± 0.209
0.0IleXaa: 0.0 ± 0.0
Lys
3.776LysAla: 3.776 ± 1.76
0.0LysCys: 0.0 ± 0.0
2.746LysAsp: 2.746 ± 0.422
4.463LysGlu: 4.463 ± 0.445
0.0LysPhe: 0.0 ± 0.0
3.09LysGly: 3.09 ± 0.471
0.0LysHis: 0.0 ± 0.0
2.06LysIle: 2.06 ± 0.338
3.433LysLys: 3.433 ± 0.616
6.522LysLeu: 6.522 ± 0.391
0.687LysMet: 0.687 ± 0.417
2.06LysAsn: 2.06 ± 0.76
6.179LysPro: 6.179 ± 1.096
2.746LysGln: 2.746 ± 0.633
4.463LysArg: 4.463 ± 0.534
3.433LysSer: 3.433 ± 1.042
3.09LysThr: 3.09 ± 1.476
3.09LysVal: 3.09 ± 0.841
0.343LysTrp: 0.343 ± 0.38
1.03LysTyr: 1.03 ± 0.313
0.0LysXaa: 0.0 ± 0.0
Leu
7.209LeuAla: 7.209 ± 0.857
1.716LeuCys: 1.716 ± 1.898
3.433LeuAsp: 3.433 ± 1.049
5.836LeuGlu: 5.836 ± 1.08
3.433LeuPhe: 3.433 ± 1.392
8.239LeuGly: 8.239 ± 0.882
2.06LeuHis: 2.06 ± 1.251
5.149LeuIle: 5.149 ± 1.005
6.522LeuLys: 6.522 ± 0.849
16.478LeuLeu: 16.478 ± 1.764
0.343LeuMet: 0.343 ± 0.38
3.433LeuAsn: 3.433 ± 1.8
6.522LeuPro: 6.522 ± 1.077
5.493LeuGln: 5.493 ± 0.823
3.776LeuArg: 3.776 ± 0.245
4.119LeuSer: 4.119 ± 0.714
14.761LeuThr: 14.761 ± 1.139
6.522LeuVal: 6.522 ± 2.292
0.687LeuTrp: 0.687 ± 0.278
4.463LeuTyr: 4.463 ± 0.436
0.0LeuXaa: 0.0 ± 0.0
Met
1.373MetAla: 1.373 ± 0.453
0.0MetCys: 0.0 ± 0.0
0.0MetAsp: 0.0 ± 0.0
0.343MetGlu: 0.343 ± 0.38
0.0MetPhe: 0.0 ± 0.0
4.119MetGly: 4.119 ± 0.168
0.0MetHis: 0.0 ± 0.0
0.0MetIle: 0.0 ± 0.0
0.687MetLys: 0.687 ± 0.417
1.03MetLeu: 1.03 ± 0.313
0.0MetMet: 0.0 ± 0.0
0.343MetAsn: 0.343 ± 0.209
0.687MetPro: 0.687 ± 0.417
0.0MetGln: 0.0 ± 0.0
0.0MetArg: 0.0 ± 0.0
2.06MetSer: 2.06 ± 0.668
1.03MetThr: 1.03 ± 0.632
1.373MetVal: 1.373 ± 0.211
0.0MetTrp: 0.0 ± 0.0
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
2.403AsnAla: 2.403 ± 0.41
0.343AsnCys: 0.343 ± 0.38
1.03AsnAsp: 1.03 ± 0.334
2.403AsnGlu: 2.403 ± 1.013
0.343AsnPhe: 0.343 ± 0.209
0.343AsnGly: 0.343 ± 0.209
1.03AsnHis: 1.03 ± 0.416
0.343AsnIle: 0.343 ± 0.209
1.373AsnLys: 1.373 ± 0.424
3.776AsnLeu: 3.776 ± 0.847
0.0AsnMet: 0.0 ± 0.0
2.06AsnAsn: 2.06 ± 1.081
3.09AsnPro: 3.09 ± 0.501
1.373AsnGln: 1.373 ± 0.211
2.06AsnArg: 2.06 ± 0.835
1.03AsnSer: 1.03 ± 0.626
1.373AsnThr: 1.373 ± 0.557
2.403AsnVal: 2.403 ± 0.626
1.373AsnTrp: 1.373 ± 0.211
0.343AsnTyr: 0.343 ± 0.38
0.0AsnXaa: 0.0 ± 0.0
Pro
4.119ProAla: 4.119 ± 0.168
1.716ProCys: 1.716 ± 0.9
6.522ProAsp: 6.522 ± 1.077
3.433ProGlu: 3.433 ± 0.111
2.06ProPhe: 2.06 ± 0.668
6.866ProGly: 6.866 ± 0.456
3.09ProHis: 3.09 ± 0.841
0.687ProIle: 0.687 ± 0.278
4.463ProLys: 4.463 ± 1.4
10.299ProLeu: 10.299 ± 1.635
1.716ProMet: 1.716 ± 0.536
2.403ProAsn: 2.403 ± 0.626
12.702ProPro: 12.702 ± 2.386
4.463ProGln: 4.463 ± 1.388
4.806ProArg: 4.806 ± 0.675
6.866ProSer: 6.866 ± 2.066
5.493ProThr: 5.493 ± 1.719
5.149ProVal: 5.149 ± 1.017
2.06ProTrp: 2.06 ± 0.338
4.463ProTyr: 4.463 ± 0.895
0.0ProXaa: 0.0 ± 0.0
Gln
5.149GlnAla: 5.149 ± 1.017
0.687GlnCys: 0.687 ± 0.36
2.403GlnAsp: 2.403 ± 0.734
2.746GlnGlu: 2.746 ± 0.633
1.373GlnPhe: 1.373 ± 1.005
4.806GlnGly: 4.806 ± 0.446
1.716GlnHis: 1.716 ± 0.056
1.03GlnIle: 1.03 ± 0.313
2.403GlnLys: 2.403 ± 0.75
5.493GlnLeu: 5.493 ± 1.226
0.0GlnMet: 0.0 ± 0.0
1.716GlnAsn: 1.716 ± 0.056
5.493GlnPro: 5.493 ± 0.823
2.06GlnGln: 2.06 ± 0.338
5.493GlnArg: 5.493 ± 0.779
2.746GlnSer: 2.746 ± 0.849
2.746GlnThr: 2.746 ± 0.389
5.149GlnVal: 5.149 ± 0.656
0.343GlnTrp: 0.343 ± 0.209
2.06GlnTyr: 2.06 ± 0.22
0.0GlnXaa: 0.0 ± 0.0
Arg
3.433ArgAla: 3.433 ± 0.615
0.343ArgCys: 0.343 ± 0.38
3.776ArgAsp: 3.776 ± 0.245
9.269ArgGlu: 9.269 ± 1.723
0.343ArgPhe: 0.343 ± 0.209
5.149ArgGly: 5.149 ± 0.772
1.03ArgHis: 1.03 ± 0.334
2.403ArgIle: 2.403 ± 0.41
2.403ArgLys: 2.403 ± 0.41
7.209ArgLeu: 7.209 ± 0.497
2.06ArgMet: 2.06 ± 0.76
1.03ArgAsn: 1.03 ± 0.334
4.806ArgPro: 4.806 ± 1.057
3.433ArgGln: 3.433 ± 0.709
8.926ArgArg: 8.926 ± 3.092
2.746ArgSer: 2.746 ± 0.895
1.373ArgThr: 1.373 ± 0.557
3.09ArgVal: 3.09 ± 0.471
2.06ArgTrp: 2.06 ± 0.76
1.373ArgTyr: 1.373 ± 0.424
0.0ArgXaa: 0.0 ± 0.0
Ser
6.522SerAla: 6.522 ± 1.473
0.0SerCys: 0.0 ± 0.0
2.06SerAsp: 2.06 ± 0.22
2.746SerGlu: 2.746 ± 0.422
2.06SerPhe: 2.06 ± 0.668
4.119SerGly: 4.119 ± 0.764
0.343SerHis: 0.343 ± 0.209
2.06SerIle: 2.06 ± 0.338
4.119SerLys: 4.119 ± 0.786
4.119SerLeu: 4.119 ± 1.761
1.373SerMet: 1.373 ± 0.71
1.373SerAsn: 1.373 ± 0.211
7.552SerPro: 7.552 ± 1.57
4.119SerGln: 4.119 ± 0.168
2.746SerArg: 2.746 ± 0.87
4.463SerSer: 4.463 ± 1.087
3.433SerThr: 3.433 ± 0.848
4.119SerVal: 4.119 ± 1.217
0.687SerTrp: 0.687 ± 0.759
0.343SerTyr: 0.343 ± 0.38
0.0SerXaa: 0.0 ± 0.0
Thr
4.463ThrAla: 4.463 ± 0.645
0.343ThrCys: 0.343 ± 0.209
3.433ThrAsp: 3.433 ± 1.109
3.776ThrGlu: 3.776 ± 0.79
3.09ThrPhe: 3.09 ± 0.77
5.493ThrGly: 5.493 ± 2.592
2.746ThrHis: 2.746 ± 0.633
1.373ThrIle: 1.373 ± 0.557
3.433ThrLys: 3.433 ± 1.264
7.209ThrLeu: 7.209 ± 0.318
1.03ThrMet: 1.03 ± 0.632
2.403ThrAsn: 2.403 ± 0.41
7.552ThrPro: 7.552 ± 0.49
3.433ThrGln: 3.433 ± 0.49
2.403ThrArg: 2.403 ± 0.75
6.522ThrSer: 6.522 ± 0.865
6.866ThrThr: 6.866 ± 1.491
3.433ThrVal: 3.433 ± 0.111
3.09ThrTrp: 3.09 ± 0.228
1.03ThrTyr: 1.03 ± 0.632
0.0ThrXaa: 0.0 ± 0.0
Val
2.403ValAla: 2.403 ± 0.75
0.343ValCys: 0.343 ± 0.38
2.746ValAsp: 2.746 ± 1.151
2.403ValGlu: 2.403 ± 0.82
1.716ValPhe: 1.716 ± 0.056
3.09ValGly: 3.09 ± 0.957
0.687ValHis: 0.687 ± 0.417
2.746ValIle: 2.746 ± 0.389
4.806ValLys: 4.806 ± 1.057
9.955ValLeu: 9.955 ± 1.314
0.687ValMet: 0.687 ± 0.417
1.716ValAsn: 1.716 ± 0.579
3.09ValPro: 3.09 ± 0.501
3.776ValGln: 3.776 ± 0.78
3.433ValArg: 3.433 ± 0.111
4.806ValSer: 4.806 ± 0.907
5.149ValThr: 5.149 ± 1.005
2.746ValVal: 2.746 ± 0.582
0.687ValTrp: 0.687 ± 0.417
0.687ValTyr: 0.687 ± 0.278
0.0ValXaa: 0.0 ± 0.0
Trp
1.716TrpAla: 1.716 ± 0.056
0.0TrpCys: 0.0 ± 0.0
2.403TrpAsp: 2.403 ± 1.101
1.03TrpGlu: 1.03 ± 0.334
0.687TrpPhe: 0.687 ± 0.759
2.06TrpGly: 2.06 ± 0.881
0.0TrpHis: 0.0 ± 0.0
0.687TrpIle: 0.687 ± 0.417
2.746TrpLys: 2.746 ± 0.269
1.373TrpLeu: 1.373 ± 0.453
0.0TrpMet: 0.0 ± 0.0
0.343TrpAsn: 0.343 ± 0.209
2.746TrpPro: 2.746 ± 0.87
0.687TrpGln: 0.687 ± 0.36
0.687TrpArg: 0.687 ± 0.417
0.0TrpSer: 0.0 ± 0.0
0.687TrpThr: 0.687 ± 0.417
3.09TrpVal: 3.09 ± 0.994
0.0TrpTrp: 0.0 ± 0.0
0.687TrpTyr: 0.687 ± 0.417
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.03TyrAla: 1.03 ± 0.313
1.373TyrCys: 1.373 ± 0.71
1.716TyrAsp: 1.716 ± 0.68
0.687TyrGlu: 0.687 ± 0.278
0.343TyrPhe: 0.343 ± 0.38
1.716TyrGly: 1.716 ± 0.056
0.687TyrHis: 0.687 ± 0.759
0.687TyrIle: 0.687 ± 0.36
0.687TyrLys: 0.687 ± 0.278
2.06TyrLeu: 2.06 ± 0.626
0.343TyrMet: 0.343 ± 0.209
1.373TyrAsn: 1.373 ± 0.424
1.716TyrPro: 1.716 ± 0.579
1.03TyrGln: 1.03 ± 0.626
4.119TyrArg: 4.119 ± 0.978
0.343TyrSer: 0.343 ± 0.38
3.776TyrThr: 3.776 ± 1.336
1.373TyrVal: 1.373 ± 0.557
1.373TyrTrp: 1.373 ± 0.211
1.03TyrTyr: 1.03 ± 0.632
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.343XaaGly: 0.343 ± 0.209
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (2914 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski