Amino acid dipepetide frequency for Red clover necrotic mosaic virus (RCNMV)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.841AlaAla: 7.841 ± 2.985
1.206AlaCys: 1.206 ± 0.76
2.413AlaAsp: 2.413 ± 1.17
1.206AlaGlu: 1.206 ± 0.608
4.825AlaPhe: 4.825 ± 1.749
3.619AlaGly: 3.619 ± 0.274
1.809AlaHis: 1.809 ± 0.858
6.031AlaIle: 6.031 ± 1.594
3.016AlaLys: 3.016 ± 1.73
4.825AlaLeu: 4.825 ± 0.977
3.016AlaMet: 3.016 ± 0.816
1.206AlaAsn: 1.206 ± 0.585
3.619AlaPro: 3.619 ± 1.902
1.206AlaGln: 1.206 ± 0.76
4.825AlaArg: 4.825 ± 0.909
4.825AlaSer: 4.825 ± 0.507
2.413AlaThr: 2.413 ± 1.14
7.841AlaVal: 7.841 ± 2.081
0.0AlaTrp: 0.0 ± 0.0
5.428AlaTyr: 5.428 ± 1.644
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.0CysCys: 0.0 ± 0.0
0.603CysAsp: 0.603 ± 0.38
1.809CysGlu: 1.809 ± 0.657
1.206CysPhe: 1.206 ± 0.76
1.206CysGly: 1.206 ± 0.678
0.603CysHis: 0.603 ± 0.662
0.603CysIle: 0.603 ± 0.38
0.603CysLys: 0.603 ± 0.736
1.809CysLeu: 1.809 ± 0.718
0.0CysMet: 0.0 ± 0.0
1.809CysAsn: 1.809 ± 0.858
0.0CysPro: 0.0 ± 0.0
0.0CysGln: 0.0 ± 0.0
3.016CysArg: 3.016 ± 0.623
3.619CysSer: 3.619 ± 1.056
0.603CysThr: 0.603 ± 0.662
1.206CysVal: 1.206 ± 0.76
1.206CysTrp: 1.206 ± 0.678
0.603CysTyr: 0.603 ± 0.38
0.0CysXaa: 0.0 ± 0.0
Asp
6.634AspAla: 6.634 ± 2.472
4.222AspCys: 4.222 ± 1.349
3.619AspAsp: 3.619 ± 1.715
3.619AspGlu: 3.619 ± 2.035
3.016AspPhe: 3.016 ± 1.187
3.016AspGly: 3.016 ± 1.44
1.206AspHis: 1.206 ± 1.325
2.413AspIle: 2.413 ± 0.668
1.809AspLys: 1.809 ± 0.718
1.809AspLeu: 1.809 ± 0.59
1.809AspMet: 1.809 ± 0.718
1.809AspAsn: 1.809 ± 1.274
1.809AspPro: 1.809 ± 0.59
2.413AspGln: 2.413 ± 1.14
1.206AspArg: 1.206 ± 0.585
4.825AspSer: 4.825 ± 1.317
2.413AspThr: 2.413 ± 0.76
4.222AspVal: 4.222 ± 0.609
1.206AspTrp: 1.206 ± 0.608
1.206AspTyr: 1.206 ± 0.608
0.0AspXaa: 0.0 ± 0.0
Glu
3.016GluAla: 3.016 ± 1.032
0.603GluCys: 0.603 ± 0.38
3.619GluAsp: 3.619 ± 1.715
3.619GluGlu: 3.619 ± 1.715
3.016GluPhe: 3.016 ± 1.187
3.016GluGly: 3.016 ± 0.262
1.206GluHis: 1.206 ± 0.678
1.809GluIle: 1.809 ± 0.657
6.634GluLys: 6.634 ± 2.728
3.619GluLeu: 3.619 ± 1.197
0.603GluMet: 0.603 ± 0.38
1.809GluAsn: 1.809 ± 0.718
2.413GluPro: 2.413 ± 1.858
3.016GluGln: 3.016 ± 1.467
2.413GluArg: 2.413 ± 0.76
1.809GluSer: 1.809 ± 0.718
4.825GluThr: 4.825 ± 0.92
3.016GluVal: 3.016 ± 0.262
0.603GluTrp: 0.603 ± 0.38
1.206GluTyr: 1.206 ± 0.76
0.0GluXaa: 0.0 ± 0.0
Phe
0.603PheAla: 0.603 ± 0.736
2.413PheCys: 2.413 ± 1.14
6.031PheAsp: 6.031 ± 2.026
1.809PheGlu: 1.809 ± 0.657
2.413PhePhe: 2.413 ± 1.357
4.222PheGly: 4.222 ± 1.333
0.0PheHis: 0.0 ± 0.0
4.222PheIle: 4.222 ± 1.333
4.222PheLys: 4.222 ± 1.349
2.413PheLeu: 2.413 ± 0.9
0.0PheMet: 0.0 ± 0.612
1.809PheAsn: 1.809 ± 0.858
1.809PhePro: 1.809 ± 0.657
0.603PheGln: 0.603 ± 0.662
4.222PheArg: 4.222 ± 0.609
2.413PheSer: 2.413 ± 0.473
1.206PheThr: 1.206 ± 0.873
1.809PheVal: 1.809 ± 2.208
0.0PheTrp: 0.0 ± 0.0
0.603PheTyr: 0.603 ± 0.38
0.0PheXaa: 0.0 ± 0.0
Gly
0.603GlyAla: 0.603 ± 0.662
0.603GlyCys: 0.603 ± 0.38
5.428GlyAsp: 5.428 ± 0.705
2.413GlyGlu: 2.413 ± 1.14
3.619GlyPhe: 3.619 ± 1.306
3.016GlyGly: 3.016 ± 0.97
0.603GlyHis: 0.603 ± 0.662
6.031GlyIle: 6.031 ± 0.371
4.222GlyLys: 4.222 ± 0.843
5.428GlyLeu: 5.428 ± 1.352
1.809GlyMet: 1.809 ± 0.652
3.619GlyAsn: 3.619 ± 1.18
2.413GlyPro: 2.413 ± 1.188
3.016GlyGln: 3.016 ± 0.623
3.619GlyArg: 3.619 ± 1.18
6.031GlySer: 6.031 ± 3.172
2.413GlyThr: 2.413 ± 1.745
5.428GlyVal: 5.428 ± 1.148
0.0GlyTrp: 0.0 ± 0.0
0.603GlyTyr: 0.603 ± 0.662
0.0GlyXaa: 0.0 ± 0.0
His
1.206HisAla: 1.206 ± 0.608
0.603HisCys: 0.603 ± 0.38
0.0HisAsp: 0.0 ± 0.0
0.603HisGlu: 0.603 ± 0.662
0.0HisPhe: 0.0 ± 0.0
0.0HisGly: 0.0 ± 0.0
1.809HisHis: 1.809 ± 0.718
1.206HisIle: 1.206 ± 0.678
2.413HisLys: 2.413 ± 1.52
0.603HisLeu: 0.603 ± 0.662
1.809HisMet: 1.809 ± 0.858
0.603HisAsn: 0.603 ± 0.38
0.0HisPro: 0.0 ± 0.0
1.206HisGln: 1.206 ± 0.608
1.809HisArg: 1.809 ± 1.213
0.603HisSer: 0.603 ± 0.736
3.016HisThr: 3.016 ± 1.187
1.809HisVal: 1.809 ± 1.987
0.0HisTrp: 0.0 ± 0.0
1.809HisTyr: 1.809 ± 0.858
0.0HisXaa: 0.0 ± 0.0
Ile
2.413IleAla: 2.413 ± 0.473
3.016IleCys: 3.016 ± 1.187
3.619IleAsp: 3.619 ± 1.213
1.809IleGlu: 1.809 ± 0.59
2.413IlePhe: 2.413 ± 0.593
3.016IleGly: 3.016 ± 1.377
1.206IleHis: 1.206 ± 0.76
1.809IleIle: 1.809 ± 0.59
5.428IleLys: 5.428 ± 0.866
4.222IleLeu: 4.222 ± 1.131
0.0IleMet: 0.0 ± 0.0
4.825IleAsn: 4.825 ± 1.72
6.031IlePro: 6.031 ± 2.652
1.809IleGln: 1.809 ± 1.14
3.619IleArg: 3.619 ± 1.306
1.206IleSer: 1.206 ± 0.585
2.413IleThr: 2.413 ± 1.11
2.413IleVal: 2.413 ± 1.17
0.603IleTrp: 0.603 ± 0.736
0.0IleTyr: 0.0 ± 0.0
0.0IleXaa: 0.0 ± 0.0
Lys
6.031LysAla: 6.031 ± 0.981
1.206LysCys: 1.206 ± 0.873
3.016LysAsp: 3.016 ± 1.467
1.206LysGlu: 1.206 ± 0.76
2.413LysPhe: 2.413 ± 1.14
4.222LysGly: 4.222 ± 1.476
1.206LysHis: 1.206 ± 0.678
3.619LysIle: 3.619 ± 1.306
4.222LysLys: 4.222 ± 0.553
9.047LysLeu: 9.047 ± 2.213
0.603LysMet: 0.603 ± 0.38
1.206LysAsn: 1.206 ± 0.608
3.016LysPro: 3.016 ± 0.623
2.413LysGln: 2.413 ± 1.188
4.825LysArg: 4.825 ± 1.927
6.634LysSer: 6.634 ± 0.915
5.428LysThr: 5.428 ± 2.556
4.222LysVal: 4.222 ± 1.58
2.413LysTrp: 2.413 ± 1.14
0.603LysTyr: 0.603 ± 0.38
0.0LysXaa: 0.0 ± 0.0
Leu
6.031LeuAla: 6.031 ± 0.981
1.206LeuCys: 1.206 ± 0.608
4.222LeuAsp: 4.222 ± 1.404
8.444LeuGlu: 8.444 ± 2.931
3.619LeuPhe: 3.619 ± 1.765
6.634LeuGly: 6.634 ± 1.358
1.809LeuHis: 1.809 ± 0.657
2.413LeuIle: 2.413 ± 1.858
4.222LeuLys: 4.222 ± 1.259
3.619LeuLeu: 3.619 ± 0.873
5.428LeuMet: 5.428 ± 0.824
5.428LeuAsn: 5.428 ± 1.889
2.413LeuPro: 2.413 ± 1.048
1.206LeuGln: 1.206 ± 0.678
1.809LeuArg: 1.809 ± 0.657
10.856LeuSer: 10.856 ± 3.427
2.413LeuThr: 2.413 ± 2.944
6.634LeuVal: 6.634 ± 2.859
0.603LeuTrp: 0.603 ± 0.38
0.603LeuTyr: 0.603 ± 0.38
0.0LeuXaa: 0.0 ± 0.0
Met
1.206MetAla: 1.206 ± 0.608
0.603MetCys: 0.603 ± 0.38
0.0MetAsp: 0.0 ± 0.0
2.413MetGlu: 2.413 ± 0.76
0.0MetPhe: 0.0 ± 0.0
2.413MetGly: 2.413 ± 1.357
0.603MetHis: 0.603 ± 0.38
1.809MetIle: 1.809 ± 0.59
1.809MetLys: 1.809 ± 0.858
0.603MetLeu: 0.603 ± 0.38
1.809MetMet: 1.809 ± 0.718
1.206MetAsn: 1.206 ± 0.585
1.809MetPro: 1.809 ± 0.718
1.206MetGln: 1.206 ± 0.678
0.603MetArg: 0.603 ± 0.736
3.619MetSer: 3.619 ± 1.902
1.809MetThr: 1.809 ± 0.858
2.413MetVal: 2.413 ± 1.14
1.206MetTrp: 1.206 ± 0.678
0.603MetTyr: 0.603 ± 0.736
0.0MetXaa: 0.0 ± 0.0
Asn
2.413AsnAla: 2.413 ± 1.14
0.0AsnCys: 0.0 ± 0.0
1.206AsnAsp: 1.206 ± 0.608
3.016AsnGlu: 3.016 ± 1.499
1.206AsnPhe: 1.206 ± 0.678
1.809AsnGly: 1.809 ± 0.657
0.0AsnHis: 0.0 ± 0.0
1.206AsnIle: 1.206 ± 0.873
1.809AsnLys: 1.809 ± 0.858
5.428AsnLeu: 5.428 ± 2.004
0.603AsnMet: 0.603 ± 0.38
1.206AsnAsn: 1.206 ± 0.585
7.238AsnPro: 7.238 ± 1.107
2.413AsnGln: 2.413 ± 1.745
5.428AsnArg: 5.428 ± 1.128
3.619AsnSer: 3.619 ± 1.902
3.619AsnThr: 3.619 ± 1.656
2.413AsnVal: 2.413 ± 1.858
0.603AsnTrp: 0.603 ± 0.662
1.809AsnTyr: 1.809 ± 1.472
0.0AsnXaa: 0.0 ± 0.0
Pro
3.619ProAla: 3.619 ± 2.0
0.603ProCys: 0.603 ± 0.662
2.413ProAsp: 2.413 ± 1.286
2.413ProGlu: 2.413 ± 0.76
1.809ProPhe: 1.809 ± 1.363
4.222ProGly: 4.222 ± 1.429
1.206ProHis: 1.206 ± 0.678
3.016ProIle: 3.016 ± 1.377
4.222ProLys: 4.222 ± 0.553
3.619ProLeu: 3.619 ± 0.754
0.0ProMet: 0.0 ± 0.0
2.413ProAsn: 2.413 ± 1.958
3.619ProPro: 3.619 ± 2.0
1.809ProGln: 1.809 ± 1.472
1.809ProArg: 1.809 ± 0.657
3.016ProSer: 3.016 ± 2.306
3.619ProThr: 3.619 ± 1.18
3.016ProVal: 3.016 ± 1.9
0.0ProTrp: 0.0 ± 0.0
1.206ProTyr: 1.206 ± 0.76
0.0ProXaa: 0.0 ± 0.0
Gln
3.619GlnAla: 3.619 ± 1.715
0.603GlnCys: 0.603 ± 0.38
1.809GlnAsp: 1.809 ± 0.59
0.603GlnGlu: 0.603 ± 0.736
1.206GlnPhe: 1.206 ± 0.873
1.809GlnGly: 1.809 ± 0.59
0.603GlnHis: 0.603 ± 0.38
1.809GlnIle: 1.809 ± 0.657
1.809GlnLys: 1.809 ± 0.59
4.222GlnLeu: 4.222 ± 1.0
3.016GlnMet: 3.016 ± 1.113
1.809GlnAsn: 1.809 ± 0.657
2.413GlnPro: 2.413 ± 1.17
0.603GlnGln: 0.603 ± 0.38
4.825GlnArg: 4.825 ± 0.893
0.603GlnSer: 0.603 ± 0.736
1.809GlnThr: 1.809 ± 0.768
1.206GlnVal: 1.206 ± 0.608
2.413GlnTrp: 2.413 ± 0.9
0.603GlnTyr: 0.603 ± 0.38
0.0GlnXaa: 0.0 ± 0.0
Arg
4.825ArgAla: 4.825 ± 1.749
1.206ArgCys: 1.206 ± 0.873
1.809ArgAsp: 1.809 ± 1.14
2.413ArgGlu: 2.413 ± 0.76
3.016ArgPhe: 3.016 ± 1.113
1.809ArgGly: 1.809 ± 0.59
1.809ArgHis: 1.809 ± 0.718
3.619ArgIle: 3.619 ± 1.252
5.428ArgLys: 5.428 ± 1.841
4.222ArgLeu: 4.222 ± 1.0
1.809ArgMet: 1.809 ± 0.754
3.619ArgAsn: 3.619 ± 0.754
1.206ArgPro: 1.206 ± 0.678
1.206ArgGln: 1.206 ± 0.585
2.413ArgArg: 2.413 ± 0.76
6.634ArgSer: 6.634 ± 0.378
4.825ArgThr: 4.825 ± 0.507
7.841ArgVal: 7.841 ± 0.864
0.0ArgTrp: 0.0 ± 0.0
3.016ArgTyr: 3.016 ± 0.946
0.0ArgXaa: 0.0 ± 0.0
Ser
6.634SerAla: 6.634 ± 1.949
0.0SerCys: 0.0 ± 0.0
3.619SerAsp: 3.619 ± 1.63
3.619SerGlu: 3.619 ± 1.197
1.809SerPhe: 1.809 ± 1.274
6.634SerGly: 6.634 ± 3.408
1.206SerHis: 1.206 ± 0.608
5.428SerIle: 5.428 ± 1.148
7.238SerLys: 7.238 ± 0.701
8.444SerLeu: 8.444 ± 1.806
1.809SerMet: 1.809 ± 1.213
3.016SerAsn: 3.016 ± 1.886
0.603SerPro: 0.603 ± 0.662
3.619SerGln: 3.619 ± 1.056
5.428SerArg: 5.428 ± 1.338
7.841SerSer: 7.841 ± 4.268
3.619SerThr: 3.619 ± 2.548
8.444SerVal: 8.444 ± 3.302
1.206SerTrp: 1.206 ± 0.608
2.413SerTyr: 2.413 ± 0.473
0.0SerXaa: 0.0 ± 0.0
Thr
3.016ThrAla: 3.016 ± 1.127
1.206ThrCys: 1.206 ± 0.585
1.809ThrAsp: 1.809 ± 1.987
1.206ThrGlu: 1.206 ± 0.678
2.413ThrPhe: 2.413 ± 0.473
5.428ThrGly: 5.428 ± 1.34
1.809ThrHis: 1.809 ± 0.59
3.016ThrIle: 3.016 ± 0.262
3.016ThrLys: 3.016 ± 1.802
6.031ThrLeu: 6.031 ± 2.914
0.0ThrMet: 0.0 ± 0.0
4.222ThrAsn: 4.222 ± 0.602
3.619ThrPro: 3.619 ± 1.435
2.413ThrGln: 2.413 ± 1.17
2.413ThrArg: 2.413 ± 0.76
4.825ThrSer: 4.825 ± 2.246
3.016ThrThr: 3.016 ± 2.306
5.428ThrVal: 5.428 ± 3.827
0.0ThrTrp: 0.0 ± 0.0
0.603ThrTyr: 0.603 ± 0.736
0.0ThrXaa: 0.0 ± 0.0
Val
7.238ValAla: 7.238 ± 1.729
1.206ValCys: 1.206 ± 0.76
7.841ValAsp: 7.841 ± 2.44
7.841ValGlu: 7.841 ± 1.98
1.809ValPhe: 1.809 ± 0.657
3.619ValGly: 3.619 ± 2.0
2.413ValHis: 2.413 ± 1.216
0.603ValIle: 0.603 ± 0.736
2.413ValLys: 2.413 ± 0.668
4.825ValLeu: 4.825 ± 1.485
3.016ValMet: 3.016 ± 0.262
2.413ValAsn: 2.413 ± 0.593
3.016ValPro: 3.016 ± 0.623
4.825ValGln: 4.825 ± 3.098
4.825ValArg: 4.825 ± 1.72
6.031ValSer: 6.031 ± 0.524
3.619ValThr: 3.619 ± 2.945
5.428ValVal: 5.428 ± 0.333
1.206ValTrp: 1.206 ± 0.678
3.016ValTyr: 3.016 ± 1.187
0.0ValXaa: 0.0 ± 0.0
Trp
1.809TrpAla: 1.809 ± 0.858
0.0TrpCys: 0.0 ± 0.0
0.603TrpAsp: 0.603 ± 0.736
0.0TrpGlu: 0.0 ± 0.0
1.809TrpPhe: 1.809 ± 0.858
0.603TrpGly: 0.603 ± 0.662
0.0TrpHis: 0.0 ± 0.0
0.0TrpIle: 0.0 ± 0.0
0.603TrpLys: 0.603 ± 0.662
1.809TrpLeu: 1.809 ± 0.768
0.0TrpMet: 0.0 ± 0.0
1.206TrpAsn: 1.206 ± 0.678
0.0TrpPro: 0.0 ± 0.0
0.603TrpGln: 0.603 ± 0.38
2.413TrpArg: 2.413 ± 0.9
1.206TrpSer: 1.206 ± 0.608
0.0TrpThr: 0.0 ± 0.0
1.206TrpVal: 1.206 ± 0.678
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.413TyrAla: 2.413 ± 1.204
0.0TyrCys: 0.0 ± 0.0
0.603TyrAsp: 0.603 ± 0.736
1.809TyrGlu: 1.809 ± 1.274
1.809TyrPhe: 1.809 ± 1.14
0.603TyrGly: 0.603 ± 0.736
0.0TyrHis: 0.0 ± 0.0
1.206TyrIle: 1.206 ± 0.585
2.413TyrLys: 2.413 ± 0.76
3.016TyrLeu: 3.016 ± 1.113
0.0TyrMet: 0.0 ± 0.0
1.809TyrAsn: 1.809 ± 0.858
0.603TyrPro: 0.603 ± 0.38
1.809TyrGln: 1.809 ± 0.768
1.206TyrArg: 1.206 ± 0.585
2.413TyrSer: 2.413 ± 0.9
2.413TyrThr: 2.413 ± 1.204
1.206TyrVal: 1.206 ± 0.873
0.603TyrTrp: 0.603 ± 0.662
0.603TyrTyr: 0.603 ± 0.662
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4 proteins (1659 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski