Amino acid dipepetide frequency for Rubus yellow net virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.692AlaAla: 8.692 ± 3.855
0.79AlaCys: 0.79 ± 1.223
3.951AlaAsp: 3.951 ± 1.986
6.322AlaGlu: 6.322 ± 1.674
2.766AlaPhe: 2.766 ± 0.922
1.976AlaGly: 1.976 ± 1.706
1.58AlaHis: 1.58 ± 0.836
1.58AlaIle: 1.58 ± 1.258
5.136AlaLys: 5.136 ± 1.666
4.741AlaLeu: 4.741 ± 1.802
1.185AlaMet: 1.185 ± 0.627
2.766AlaAsn: 2.766 ± 1.397
1.976AlaPro: 1.976 ± 1.045
3.556AlaGln: 3.556 ± 2.489
3.951AlaArg: 3.951 ± 2.334
3.951AlaSer: 3.951 ± 1.024
5.531AlaThr: 5.531 ± 4.699
4.741AlaVal: 4.741 ± 3.298
0.395AlaTrp: 0.395 ± 1.231
3.161AlaTyr: 3.161 ± 0.837
0.0AlaXaa: 0.0 ± 0.0
Cys
1.58CysAla: 1.58 ± 2.447
0.79CysCys: 0.79 ± 2.711
0.79CysAsp: 0.79 ± 0.418
0.0CysGlu: 0.0 ± 0.0
1.185CysPhe: 1.185 ± 1.115
1.185CysGly: 1.185 ± 0.925
0.0CysHis: 0.0 ± 0.0
0.79CysIle: 0.79 ± 0.418
2.371CysLys: 2.371 ± 0.947
0.0CysLeu: 0.0 ± 0.0
0.79CysMet: 0.79 ± 0.418
0.79CysAsn: 0.79 ± 0.418
1.185CysPro: 1.185 ± 1.003
0.395CysGln: 0.395 ± 0.209
0.79CysArg: 0.79 ± 0.418
1.58CysSer: 1.58 ± 1.038
2.371CysThr: 2.371 ± 1.578
0.0CysVal: 0.0 ± 0.0
0.0CysTrp: 0.0 ± 0.0
1.185CysTyr: 1.185 ± 1.699
0.0CysXaa: 0.0 ± 0.0
Asp
3.161AspAla: 3.161 ± 1.382
0.79AspCys: 0.79 ± 0.418
5.927AspAsp: 5.927 ± 2.38
5.531AspGlu: 5.531 ± 2.925
3.161AspPhe: 3.161 ± 1.071
1.976AspGly: 1.976 ± 1.547
0.395AspHis: 0.395 ± 0.209
1.976AspIle: 1.976 ± 0.92
1.58AspLys: 1.58 ± 1.258
5.531AspLeu: 5.531 ± 4.739
1.185AspMet: 1.185 ± 0.627
3.556AspAsn: 3.556 ± 1.254
2.371AspPro: 2.371 ± 1.255
2.766AspGln: 2.766 ± 0.922
2.766AspArg: 2.766 ± 1.397
3.161AspSer: 3.161 ± 1.067
3.161AspThr: 3.161 ± 1.123
3.556AspVal: 3.556 ± 1.323
0.395AspTrp: 0.395 ± 0.209
3.556AspTyr: 3.556 ± 1.254
0.0AspXaa: 0.0 ± 0.0
Glu
5.136GluAla: 5.136 ± 2.006
0.395GluCys: 0.395 ± 0.209
4.346GluAsp: 4.346 ± 0.962
9.482GluGlu: 9.482 ± 2.687
3.161GluPhe: 3.161 ± 2.119
4.741GluGly: 4.741 ± 1.742
1.976GluHis: 1.976 ± 1.045
4.346GluIle: 4.346 ± 3.021
7.507GluLys: 7.507 ± 2.868
7.902GluLeu: 7.902 ± 3.361
1.58GluMet: 1.58 ± 0.94
1.58GluAsn: 1.58 ± 0.697
4.346GluPro: 4.346 ± 1.62
1.976GluGln: 1.976 ± 0.721
4.741GluArg: 4.741 ± 2.43
4.346GluSer: 4.346 ± 1.278
4.741GluThr: 4.741 ± 1.557
4.741GluVal: 4.741 ± 1.145
0.79GluTrp: 0.79 ± 0.418
1.976GluTyr: 1.976 ± 1.045
0.0GluXaa: 0.0 ± 0.0
Phe
1.58PheAla: 1.58 ± 0.94
1.185PheCys: 1.185 ± 1.115
1.976PheAsp: 1.976 ± 0.721
1.185PheGlu: 1.185 ± 0.734
0.0PhePhe: 0.0 ± 0.0
1.976PheGly: 1.976 ± 0.88
0.79PheHis: 0.79 ± 0.418
3.556PheIle: 3.556 ± 1.07
1.185PheLys: 1.185 ± 0.627
2.766PheLeu: 2.766 ± 1.051
0.0PheMet: 0.0 ± 0.0
0.79PheAsn: 0.79 ± 1.104
1.58PhePro: 1.58 ± 0.697
0.79PheGln: 0.79 ± 0.418
2.766PheArg: 2.766 ± 0.922
1.976PheSer: 1.976 ± 2.079
1.185PheThr: 1.185 ± 0.627
1.58PheVal: 1.58 ± 1.038
0.0PheTrp: 0.0 ± 0.0
1.976PheTyr: 1.976 ± 1.045
0.0PheXaa: 0.0 ± 0.0
Gly
2.766GlyAla: 2.766 ± 1.462
1.58GlyCys: 1.58 ± 0.836
2.371GlyAsp: 2.371 ± 1.255
3.556GlyGlu: 3.556 ± 1.88
1.58GlyPhe: 1.58 ± 0.878
4.741GlyGly: 4.741 ± 1.154
0.395GlyHis: 0.395 ± 0.209
3.556GlyIle: 3.556 ± 1.88
5.927GlyLys: 5.927 ± 4.728
3.161GlyLeu: 3.161 ± 1.671
2.766GlyMet: 2.766 ± 0.971
1.185GlyAsn: 1.185 ± 1.003
2.371GlyPro: 2.371 ± 1.027
2.371GlyGln: 2.371 ± 3.041
3.951GlyArg: 3.951 ± 1.405
2.766GlySer: 2.766 ± 0.913
3.951GlyThr: 3.951 ± 1.449
5.136GlyVal: 5.136 ± 3.488
2.371GlyTrp: 2.371 ± 1.254
2.371GlyTyr: 2.371 ± 1.254
0.0GlyXaa: 0.0 ± 0.0
His
0.395HisAla: 0.395 ± 0.209
0.0HisCys: 0.0 ± 0.0
1.185HisAsp: 1.185 ± 1.115
1.185HisGlu: 1.185 ± 0.734
0.395HisPhe: 0.395 ± 1.355
1.185HisGly: 1.185 ± 0.627
0.0HisHis: 0.0 ± 0.0
2.371HisIle: 2.371 ± 0.947
0.395HisLys: 0.395 ± 0.209
0.79HisLeu: 0.79 ± 0.418
0.0HisMet: 0.0 ± 0.0
1.976HisAsn: 1.976 ± 0.721
0.79HisPro: 0.79 ± 1.223
1.58HisGln: 1.58 ± 0.836
1.58HisArg: 1.58 ± 0.697
1.185HisSer: 1.185 ± 0.925
1.976HisThr: 1.976 ± 0.88
1.58HisVal: 1.58 ± 0.836
1.185HisTrp: 1.185 ± 0.627
1.58HisTyr: 1.58 ± 0.697
0.0HisXaa: 0.0 ± 0.0
Ile
3.161IleAla: 3.161 ± 2.277
1.976IleCys: 1.976 ± 1.045
3.161IleAsp: 3.161 ± 1.671
4.346IleGlu: 4.346 ± 2.08
1.58IlePhe: 1.58 ± 0.94
4.346IleGly: 4.346 ± 2.298
1.185IleHis: 1.185 ± 0.734
3.161IleIle: 3.161 ± 1.143
3.951IleLys: 3.951 ± 0.871
1.58IleLeu: 1.58 ± 0.697
0.79IleMet: 0.79 ± 0.418
1.976IleAsn: 1.976 ± 1.045
4.346IlePro: 4.346 ± 1.6
3.951IleGln: 3.951 ± 1.76
1.976IleArg: 1.976 ± 2.099
5.531IleSer: 5.531 ± 1.211
5.136IleThr: 5.136 ± 1.598
1.976IleVal: 1.976 ± 1.0
0.0IleTrp: 0.0 ± 0.0
0.395IleTyr: 0.395 ± 0.209
0.0IleXaa: 0.0 ± 0.0
Lys
3.556LysAla: 3.556 ± 2.911
1.58LysCys: 1.58 ± 0.836
5.136LysAsp: 5.136 ± 1.159
7.507LysGlu: 7.507 ± 2.389
2.371LysPhe: 2.371 ± 1.254
3.951LysGly: 3.951 ± 1.76
1.58LysHis: 1.58 ± 0.836
3.161LysIle: 3.161 ± 1.394
4.741LysLys: 4.741 ± 1.087
5.927LysLeu: 5.927 ± 0.602
1.976LysMet: 1.976 ± 1.045
2.766LysAsn: 2.766 ± 1.021
3.951LysPro: 3.951 ± 0.871
1.976LysGln: 1.976 ± 1.108
2.371LysArg: 2.371 ± 0.93
2.766LysSer: 2.766 ± 1.462
2.766LysThr: 2.766 ± 0.922
4.346LysVal: 4.346 ± 2.731
1.976LysTrp: 1.976 ± 1.045
1.185LysTyr: 1.185 ± 0.627
0.0LysXaa: 0.0 ± 0.0
Leu
7.112LeuAla: 7.112 ± 2.401
0.79LeuCys: 0.79 ± 0.418
3.556LeuAsp: 3.556 ± 3.241
7.902LeuGlu: 7.902 ± 3.461
2.371LeuPhe: 2.371 ± 1.912
4.346LeuGly: 4.346 ± 2.282
0.79LeuHis: 0.79 ± 1.014
3.161LeuIle: 3.161 ± 1.071
5.531LeuLys: 5.531 ± 1.844
5.136LeuLeu: 5.136 ± 3.227
2.371LeuMet: 2.371 ± 1.316
2.371LeuAsn: 2.371 ± 0.801
5.136LeuPro: 5.136 ± 1.246
4.741LeuGln: 4.741 ± 1.602
5.927LeuArg: 5.927 ± 1.845
4.741LeuSer: 4.741 ± 1.252
5.927LeuThr: 5.927 ± 2.691
4.741LeuVal: 4.741 ± 1.087
0.395LeuTrp: 0.395 ± 0.209
3.161LeuTyr: 3.161 ± 1.671
0.0LeuXaa: 0.0 ± 0.0
Met
2.371MetAla: 2.371 ± 4.658
0.395MetCys: 0.395 ± 1.355
1.976MetAsp: 1.976 ± 1.045
1.976MetGlu: 1.976 ± 1.045
0.79MetPhe: 0.79 ± 0.418
0.79MetGly: 0.79 ± 0.418
0.395MetHis: 0.395 ± 0.209
1.58MetIle: 1.58 ± 0.836
1.976MetLys: 1.976 ± 1.045
2.371MetLeu: 2.371 ± 1.254
0.79MetMet: 0.79 ± 0.603
1.185MetAsn: 1.185 ± 0.925
1.58MetPro: 1.58 ± 0.836
1.58MetGln: 1.58 ± 0.697
0.79MetArg: 0.79 ± 0.418
1.976MetSer: 1.976 ± 1.045
0.79MetThr: 0.79 ± 0.418
1.185MetVal: 1.185 ± 0.627
0.395MetTrp: 0.395 ± 0.209
0.395MetTyr: 0.395 ± 0.209
0.0MetXaa: 0.0 ± 0.0
Asn
2.371AsnAla: 2.371 ± 1.254
0.0AsnCys: 0.0 ± 0.0
1.58AsnAsp: 1.58 ± 0.697
0.0AsnGlu: 0.0 ± 0.0
1.185AsnPhe: 1.185 ± 0.627
3.556AsnGly: 3.556 ± 1.288
0.395AsnHis: 0.395 ± 0.209
1.58AsnIle: 1.58 ± 0.697
1.976AsnLys: 1.976 ± 1.045
5.136AsnLeu: 5.136 ± 4.081
0.79AsnMet: 0.79 ± 0.418
0.395AsnAsn: 0.395 ± 1.134
1.976AsnPro: 1.976 ± 1.621
2.766AsnGln: 2.766 ± 1.754
3.161AsnArg: 3.161 ± 1.123
2.766AsnSer: 2.766 ± 1.017
4.346AsnThr: 4.346 ± 1.293
1.976AsnVal: 1.976 ± 0.92
0.79AsnTrp: 0.79 ± 1.104
0.79AsnTyr: 0.79 ± 0.418
0.0AsnXaa: 0.0 ± 0.0
Pro
2.371ProAla: 2.371 ± 1.254
1.185ProCys: 1.185 ± 2.597
2.766ProAsp: 2.766 ± 0.922
5.927ProGlu: 5.927 ± 1.572
0.79ProPhe: 0.79 ± 0.418
3.556ProGly: 3.556 ± 1.249
2.371ProHis: 2.371 ± 0.979
2.766ProIle: 2.766 ± 1.858
3.161ProLys: 3.161 ± 1.071
2.371ProLeu: 2.371 ± 0.979
1.185ProMet: 1.185 ± 0.627
1.185ProAsn: 1.185 ± 0.627
4.346ProPro: 4.346 ± 1.701
2.766ProGln: 2.766 ± 1.462
3.161ProArg: 3.161 ± 1.257
4.346ProSer: 4.346 ± 0.908
5.531ProThr: 5.531 ± 1.317
3.556ProVal: 3.556 ± 1.025
1.185ProTrp: 1.185 ± 0.627
1.185ProTyr: 1.185 ± 0.734
0.0ProXaa: 0.0 ± 0.0
Gln
3.556GlnAla: 3.556 ± 1.463
0.395GlnCys: 0.395 ± 0.209
1.58GlnAsp: 1.58 ± 0.697
5.136GlnGlu: 5.136 ± 1.681
1.185GlnPhe: 1.185 ± 0.627
3.161GlnGly: 3.161 ± 1.257
1.976GlnHis: 1.976 ± 1.706
0.79GlnIle: 0.79 ± 0.824
2.371GlnLys: 2.371 ± 0.801
5.531GlnLeu: 5.531 ± 2.88
1.185GlnMet: 1.185 ± 0.781
3.161GlnAsn: 3.161 ± 1.143
2.766GlnPro: 2.766 ± 0.922
1.58GlnGln: 1.58 ± 0.836
3.556GlnArg: 3.556 ± 1.237
1.185GlnSer: 1.185 ± 0.734
0.79GlnThr: 0.79 ± 0.418
3.556GlnVal: 3.556 ± 1.758
0.395GlnTrp: 0.395 ± 0.209
2.371GlnTyr: 2.371 ± 0.801
0.0GlnXaa: 0.0 ± 0.0
Arg
5.531ArgAla: 5.531 ± 4.25
0.395ArgCys: 0.395 ± 1.355
3.161ArgAsp: 3.161 ± 3.485
3.161ArgGlu: 3.161 ± 2.015
0.79ArgPhe: 0.79 ± 0.418
3.161ArgGly: 3.161 ± 1.123
1.976ArgHis: 1.976 ± 1.0
2.766ArgIle: 2.766 ± 1.462
3.161ArgLys: 3.161 ± 0.827
5.531ArgLeu: 5.531 ± 0.524
3.161ArgMet: 3.161 ± 1.671
2.371ArgAsn: 2.371 ± 0.801
1.976ArgPro: 1.976 ± 1.045
2.766ArgGln: 2.766 ± 1.416
6.717ArgArg: 6.717 ± 4.865
5.927ArgSer: 5.927 ± 1.259
4.346ArgThr: 4.346 ± 3.509
3.556ArgVal: 3.556 ± 1.88
2.766ArgTrp: 2.766 ± 0.882
1.185ArgTyr: 1.185 ± 0.627
0.0ArgXaa: 0.0 ± 0.0
Ser
4.741SerAla: 4.741 ± 1.141
0.79SerCys: 0.79 ± 1.104
4.346SerAsp: 4.346 ± 1.568
4.346SerGlu: 4.346 ± 1.51
1.185SerPhe: 1.185 ± 0.734
4.741SerGly: 4.741 ± 2.507
0.79SerHis: 0.79 ± 0.418
4.741SerIle: 4.741 ± 1.8
3.951SerLys: 3.951 ± 2.089
7.507SerLeu: 7.507 ± 2.41
0.79SerMet: 0.79 ± 0.418
3.161SerAsn: 3.161 ± 1.257
3.951SerPro: 3.951 ± 1.365
1.976SerGln: 1.976 ± 1.167
4.346SerArg: 4.346 ± 1.987
6.322SerSer: 6.322 ± 1.301
4.741SerThr: 4.741 ± 1.989
2.371SerVal: 2.371 ± 1.254
1.58SerTrp: 1.58 ± 0.697
0.79SerTyr: 0.79 ± 0.418
0.0SerXaa: 0.0 ± 0.0
Thr
3.556ThrAla: 3.556 ± 2.214
2.371ThrCys: 2.371 ± 1.238
3.556ThrAsp: 3.556 ± 1.249
3.161ThrGlu: 3.161 ± 1.067
1.185ThrPhe: 1.185 ± 0.627
5.136ThrGly: 5.136 ± 2.816
2.371ThrHis: 2.371 ± 1.254
5.927ThrIle: 5.927 ± 1.572
3.556ThrLys: 3.556 ± 1.463
4.741ThrLeu: 4.741 ± 1.22
1.58ThrMet: 1.58 ± 0.94
1.976ThrAsn: 1.976 ± 1.368
5.136ThrPro: 5.136 ± 2.121
2.766ThrGln: 2.766 ± 1.416
5.136ThrArg: 5.136 ± 1.266
5.531ThrSer: 5.531 ± 2.042
5.136ThrThr: 5.136 ± 2.24
1.976ThrVal: 1.976 ± 2.219
1.976ThrTrp: 1.976 ± 1.045
1.976ThrTyr: 1.976 ± 3.239
0.0ThrXaa: 0.0 ± 0.0
Val
3.556ValAla: 3.556 ± 3.8
1.58ValCys: 1.58 ± 1.038
2.766ValAsp: 2.766 ± 1.468
2.371ValGlu: 2.371 ± 1.254
2.766ValPhe: 2.766 ± 1.468
1.976ValGly: 1.976 ± 0.721
1.58ValHis: 1.58 ± 1.537
3.951ValIle: 3.951 ± 2.089
3.161ValLys: 3.161 ± 0.827
4.741ValLeu: 4.741 ± 2.103
1.58ValMet: 1.58 ± 0.808
2.371ValAsn: 2.371 ± 1.647
3.161ValPro: 3.161 ± 1.671
1.976ValGln: 1.976 ± 1.108
3.951ValArg: 3.951 ± 1.385
3.951ValSer: 3.951 ± 1.449
3.951ValThr: 3.951 ± 2.371
1.976ValVal: 1.976 ± 1.045
0.0ValTrp: 0.0 ± 0.0
1.976ValTyr: 1.976 ± 1.045
0.0ValXaa: 0.0 ± 0.0
Trp
0.79TrpAla: 0.79 ± 0.418
0.0TrpCys: 0.0 ± 0.0
1.185TrpAsp: 1.185 ± 0.627
2.371TrpGlu: 2.371 ± 1.468
0.0TrpPhe: 0.0 ± 0.0
0.79TrpGly: 0.79 ± 0.418
0.395TrpHis: 0.395 ± 0.209
0.0TrpIle: 0.0 ± 0.0
1.976TrpLys: 1.976 ± 0.92
2.371TrpLeu: 2.371 ± 1.254
0.395TrpMet: 0.395 ± 0.209
0.395TrpAsn: 0.395 ± 0.209
0.79TrpPro: 0.79 ± 0.418
1.58TrpGln: 1.58 ± 0.878
0.79TrpArg: 0.79 ± 0.418
1.58TrpSer: 1.58 ± 0.836
0.79TrpThr: 0.79 ± 0.418
0.0TrpVal: 0.0 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
0.395TrpTyr: 0.395 ± 1.231
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.766TyrAla: 2.766 ± 1.017
0.79TyrCys: 0.79 ± 0.418
1.185TyrAsp: 1.185 ± 0.627
4.346TyrGlu: 4.346 ± 1.568
0.395TyrPhe: 0.395 ± 0.209
1.58TyrGly: 1.58 ± 0.836
0.395TyrHis: 0.395 ± 0.952
2.766TyrIle: 2.766 ± 1.021
1.976TyrLys: 1.976 ± 1.045
2.371TyrLeu: 2.371 ± 1.468
1.185TyrMet: 1.185 ± 0.627
1.58TyrAsn: 1.58 ± 0.697
1.976TyrPro: 1.976 ± 0.721
2.371TyrGln: 2.371 ± 1.004
1.976TyrArg: 1.976 ± 1.388
1.58TyrSer: 1.58 ± 0.94
1.58TyrThr: 1.58 ± 0.878
0.79TyrVal: 0.79 ± 0.418
0.0TyrTrp: 0.0 ± 0.0
1.185TyrTyr: 1.185 ± 0.627
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5 proteins (2532 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski