Amino acid dipepetide frequency for Rubella virus (strain TO-336 vaccine) (RUBV)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
20.138AlaAla: 20.138 ± 3.528
4.405AlaCys: 4.405 ± 1.279
5.664AlaAsp: 5.664 ± 1.664
5.979AlaGlu: 5.979 ± 1.849
4.72AlaPhe: 4.72 ± 0.008
12.901AlaGly: 12.901 ± 2.596
3.461AlaHis: 3.461 ± 0.729
3.461AlaIle: 3.461 ± 0.925
1.573AlaLys: 1.573 ± 0.37
16.362AlaLeu: 16.362 ± 2.419
1.573AlaMet: 1.573 ± 0.37
2.517AlaAsn: 2.517 ± 0.372
10.699AlaPro: 10.699 ± 3.107
5.349AlaGln: 5.349 ± 0.929
11.013AlaArg: 11.013 ± 2.593
6.293AlaSer: 6.293 ± 1.276
8.496AlaThr: 8.496 ± 0.015
7.552AlaVal: 7.552 ± 0.565
2.832AlaTrp: 2.832 ± 1.108
4.72AlaTyr: 4.72 ± 1.111
0.0AlaXaa: 0.0 ± 0.0
Cys
6.608CysAla: 6.608 ± 1.115
0.315CysCys: 0.315 ± 0.367
0.315CysAsp: 0.315 ± 0.184
1.888CysGlu: 1.888 ± 0.548
0.315CysPhe: 0.315 ± 0.367
5.035CysGly: 5.035 ± 1.462
1.888CysHis: 1.888 ± 1.1
0.629CysIle: 0.629 ± 0.183
0.629CysLys: 0.629 ± 0.369
0.944CysLeu: 0.944 ± 0.002
0.315CysMet: 0.315 ± 0.184
0.944CysAsn: 0.944 ± 0.55
2.832CysPro: 2.832 ± 1.108
0.629CysGln: 0.629 ± 0.183
3.461CysArg: 3.461 ± 0.374
0.944CysSer: 0.944 ± 0.55
2.203CysThr: 2.203 ± 0.188
1.573CysVal: 1.573 ± 0.181
0.944CysTrp: 0.944 ± 0.002
0.629CysTyr: 0.629 ± 0.734
0.0CysXaa: 0.0 ± 0.0
Asp
5.979AspAla: 5.979 ± 2.4
1.888AspCys: 1.888 ± 0.555
2.517AspAsp: 2.517 ± 0.372
1.888AspGlu: 1.888 ± 0.555
1.573AspPhe: 1.573 ± 0.181
4.72AspGly: 4.72 ± 0.008
1.888AspHis: 1.888 ± 0.555
1.573AspIle: 1.573 ± 0.922
0.0AspLys: 0.0 ± 0.0
4.405AspLeu: 4.405 ± 0.375
1.573AspMet: 1.573 ± 0.475
0.0AspAsn: 0.0 ± 0.0
6.293AspPro: 6.293 ± 0.379
0.315AspGln: 0.315 ± 0.184
4.72AspArg: 4.72 ± 1.663
2.203AspSer: 2.203 ± 1.467
2.832AspThr: 2.832 ± 0.005
4.091AspVal: 4.091 ± 0.191
1.259AspTrp: 1.259 ± 0.917
1.259AspTyr: 1.259 ± 0.738
0.0AspXaa: 0.0 ± 0.0
Glu
5.664GluAla: 5.664 ± 0.561
0.0GluCys: 0.0 ± 0.0
4.091GluAsp: 4.091 ± 0.191
2.517GluGlu: 2.517 ± 0.372
0.315GluPhe: 0.315 ± 0.184
4.091GluGly: 4.091 ± 0.191
1.259GluHis: 1.259 ± 0.365
2.203GluIle: 2.203 ± 0.739
0.315GluLys: 0.315 ± 0.184
5.035GluLeu: 5.035 ± 0.744
0.629GluMet: 0.629 ± 0.183
0.0GluAsn: 0.0 ± 0.0
3.147GluPro: 3.147 ± 0.741
1.259GluGln: 1.259 ± 0.186
6.923GluArg: 6.923 ± 1.851
1.573GluSer: 1.573 ± 0.733
2.203GluThr: 2.203 ± 0.364
5.979GluVal: 5.979 ± 1.849
2.203GluTrp: 2.203 ± 0.188
0.629GluTyr: 0.629 ± 0.183
0.0GluXaa: 0.0 ± 0.0
Phe
2.517PheAla: 2.517 ± 0.372
0.944PheCys: 0.944 ± 0.002
1.259PheAsp: 1.259 ± 0.738
0.629PheGlu: 0.629 ± 0.183
0.629PhePhe: 0.629 ± 0.369
1.888PheGly: 1.888 ± 0.548
0.944PheHis: 0.944 ± 0.002
0.0PheIle: 0.0 ± 0.0
0.944PheLys: 0.944 ± 0.002
2.203PheLeu: 2.203 ± 0.364
0.315PheMet: 0.315 ± 0.367
0.315PheAsn: 0.315 ± 0.367
1.259PhePro: 1.259 ± 0.365
0.944PheGln: 0.944 ± 0.002
1.573PheArg: 1.573 ± 0.37
0.629PheSer: 0.629 ± 0.369
2.517PheThr: 2.517 ± 0.731
1.573PheVal: 1.573 ± 0.181
0.629PheTrp: 0.629 ± 0.734
0.629PheTyr: 0.629 ± 0.183
0.0PheXaa: 0.0 ± 0.0
Gly
9.125GlyAla: 9.125 ± 1.271
4.72GlyCys: 4.72 ± 0.543
4.405GlyAsp: 4.405 ± 0.375
5.035GlyGlu: 5.035 ± 0.359
0.944GlyPhe: 0.944 ± 0.55
7.867GlyGly: 7.867 ± 2.009
3.776GlyHis: 3.776 ± 1.096
1.888GlyIle: 1.888 ± 1.106
1.259GlyLys: 1.259 ± 0.365
7.867GlyLeu: 7.867 ± 2.56
1.259GlyMet: 1.259 ± 0.738
1.573GlyAsn: 1.573 ± 0.733
8.181GlyPro: 8.181 ± 0.382
1.888GlyGln: 1.888 ± 1.1
5.035GlyArg: 5.035 ± 0.359
4.091GlySer: 4.091 ± 0.191
5.664GlyThr: 5.664 ± 1.645
3.776GlyVal: 3.776 ± 0.007
1.888GlyTrp: 1.888 ± 0.003
1.259GlyTyr: 1.259 ± 0.365
0.0GlyXaa: 0.0 ± 0.0
His
3.776HisAla: 3.776 ± 0.558
0.629HisCys: 0.629 ± 0.369
1.888HisAsp: 1.888 ± 0.548
1.573HisGlu: 1.573 ± 0.922
1.573HisPhe: 1.573 ± 0.37
2.203HisGly: 2.203 ± 1.467
1.573HisHis: 1.573 ± 1.836
1.259HisIle: 1.259 ± 0.186
0.315HisLys: 0.315 ± 0.367
3.776HisLeu: 3.776 ± 1.661
0.629HisMet: 0.629 ± 0.183
0.315HisAsn: 0.315 ± 0.184
4.405HisPro: 4.405 ± 0.728
1.259HisGln: 1.259 ± 0.365
1.259HisArg: 1.259 ± 0.365
0.944HisSer: 0.944 ± 0.55
2.517HisThr: 2.517 ± 1.283
2.832HisVal: 2.832 ± 0.556
0.944HisTrp: 0.944 ± 0.55
1.888HisTyr: 1.888 ± 0.555
0.0HisXaa: 0.0 ± 0.0
Ile
1.259IleAla: 1.259 ± 0.186
1.573IleCys: 1.573 ± 0.733
2.203IleAsp: 2.203 ± 0.739
1.259IleGlu: 1.259 ± 0.186
1.573IlePhe: 1.573 ± 0.37
0.629IleGly: 0.629 ± 0.183
0.629IleHis: 0.629 ± 0.369
0.629IleIle: 0.629 ± 0.369
0.944IleLys: 0.944 ± 0.553
0.315IleLeu: 0.315 ± 0.184
1.259IleMet: 1.259 ± 0.186
0.0IleAsn: 0.0 ± 0.0
2.203IlePro: 2.203 ± 0.739
0.944IleGln: 0.944 ± 0.553
2.203IleArg: 2.203 ± 0.364
0.944IleSer: 0.944 ± 0.553
0.629IleThr: 0.629 ± 0.183
2.517IleVal: 2.517 ± 0.924
0.629IleTrp: 0.629 ± 0.183
0.629IleTyr: 0.629 ± 0.369
0.0IleXaa: 0.0 ± 0.0
Lys
1.259LysAla: 1.259 ± 0.186
1.259LysCys: 1.259 ± 0.365
0.0LysAsp: 0.0 ± 0.0
0.944LysGlu: 0.944 ± 0.553
1.573LysPhe: 1.573 ± 0.733
1.259LysGly: 1.259 ± 0.186
0.629LysHis: 0.629 ± 0.183
0.629LysIle: 0.629 ± 0.183
0.0LysLys: 0.0 ± 0.0
0.944LysLeu: 0.944 ± 0.553
0.0LysMet: 0.0 ± 0.0
0.944LysAsn: 0.944 ± 0.553
0.944LysPro: 0.944 ± 0.002
0.315LysGln: 0.315 ± 0.367
0.944LysArg: 0.944 ± 0.553
0.629LysSer: 0.629 ± 0.369
1.259LysThr: 1.259 ± 0.365
0.629LysVal: 0.629 ± 0.369
0.315LysTrp: 0.315 ± 0.184
0.0LysTyr: 0.0 ± 0.0
0.0LysXaa: 0.0 ± 0.0
Leu
13.531LeuAla: 13.531 ± 1.311
2.203LeuCys: 2.203 ± 0.739
4.72LeuAsp: 4.72 ± 1.111
5.035LeuGlu: 5.035 ± 1.296
0.944LeuPhe: 0.944 ± 0.553
5.349LeuGly: 5.349 ± 0.726
4.405LeuHis: 4.405 ± 0.927
1.573LeuIle: 1.573 ± 0.181
1.888LeuLys: 1.888 ± 0.555
7.237LeuLeu: 7.237 ± 0.723
0.944LeuMet: 0.944 ± 0.002
1.888LeuAsn: 1.888 ± 0.548
5.349LeuPro: 5.349 ± 0.175
2.203LeuGln: 2.203 ± 0.915
8.496LeuArg: 8.496 ± 0.015
5.349LeuSer: 5.349 ± 0.377
4.091LeuThr: 4.091 ± 0.191
4.405LeuVal: 4.405 ± 1.279
2.203LeuTrp: 2.203 ± 0.188
1.888LeuTyr: 1.888 ± 0.555
0.0LeuXaa: 0.0 ± 0.0
Met
1.573MetAla: 1.573 ± 0.181
1.573MetCys: 1.573 ± 0.922
0.629MetAsp: 0.629 ± 0.183
1.259MetGlu: 1.259 ± 0.186
0.0MetPhe: 0.0 ± 0.0
0.629MetGly: 0.629 ± 0.183
0.629MetHis: 0.629 ± 0.369
0.0MetIle: 0.0 ± 0.0
0.315MetLys: 0.315 ± 0.184
0.315MetLeu: 0.315 ± 0.184
0.0MetMet: 0.0 ± 0.0
0.629MetAsn: 0.629 ± 0.183
1.259MetPro: 1.259 ± 0.365
0.944MetGln: 0.944 ± 0.002
2.203MetArg: 2.203 ± 0.188
0.944MetSer: 0.944 ± 0.002
0.315MetThr: 0.315 ± 0.184
0.944MetVal: 0.944 ± 0.002
0.315MetTrp: 0.315 ± 0.184
0.315MetTyr: 0.315 ± 0.367
0.0MetXaa: 0.0 ± 0.0
Asn
4.091AsnAla: 4.091 ± 0.743
0.944AsnCys: 0.944 ± 0.55
0.629AsnAsp: 0.629 ± 0.183
0.315AsnGlu: 0.315 ± 0.184
0.0AsnPhe: 0.0 ± 0.0
0.944AsnGly: 0.944 ± 0.55
0.629AsnHis: 0.629 ± 0.369
0.315AsnIle: 0.315 ± 0.184
0.315AsnLys: 0.315 ± 0.184
1.888AsnLeu: 1.888 ± 0.003
0.0AsnMet: 0.0 ± 0.0
0.315AsnAsn: 0.315 ± 0.184
1.573AsnPro: 1.573 ± 1.284
0.944AsnGln: 0.944 ± 0.55
0.629AsnArg: 0.629 ± 0.183
0.315AsnSer: 0.315 ± 0.367
1.259AsnThr: 1.259 ± 0.365
0.944AsnVal: 0.944 ± 0.55
0.0AsnTrp: 0.0 ± 0.0
0.315AsnTyr: 0.315 ± 0.367
0.0AsnXaa: 0.0 ± 0.0
Pro
11.643ProAla: 11.643 ± 0.347
1.888ProCys: 1.888 ± 0.548
5.035ProAsp: 5.035 ± 0.744
5.979ProGlu: 5.979 ± 0.357
2.203ProPhe: 2.203 ± 0.915
6.923ProGly: 6.923 ± 1.459
2.517ProHis: 2.517 ± 0.179
0.629ProIle: 0.629 ± 0.183
1.573ProLys: 1.573 ± 0.37
5.979ProLeu: 5.979 ± 0.746
0.944ProMet: 0.944 ± 0.55
0.944ProAsn: 0.944 ± 0.002
17.936ProPro: 17.936 ± 2.726
2.203ProGln: 2.203 ± 0.915
9.755ProArg: 9.755 ± 0.351
2.832ProSer: 2.832 ± 1.108
6.293ProThr: 6.293 ± 1.276
4.091ProVal: 4.091 ± 0.912
2.832ProTrp: 2.832 ± 1.098
1.888ProTyr: 1.888 ± 0.003
0.0ProXaa: 0.0 ± 0.0
Gln
3.776GlnAla: 3.776 ± 0.558
0.629GlnCys: 0.629 ± 0.183
0.944GlnAsp: 0.944 ± 0.002
0.944GlnGlu: 0.944 ± 0.002
0.315GlnPhe: 0.315 ± 0.184
2.832GlnGly: 2.832 ± 0.005
1.573GlnHis: 1.573 ± 0.733
0.315GlnIle: 0.315 ± 0.184
0.944GlnLys: 0.944 ± 0.002
2.203GlnLeu: 2.203 ± 2.018
0.944GlnMet: 0.944 ± 0.553
0.0GlnAsn: 0.0 ± 0.0
2.517GlnPro: 2.517 ± 1.283
1.259GlnGln: 1.259 ± 0.365
2.832GlnArg: 2.832 ± 0.005
2.203GlnSer: 2.203 ± 0.915
2.203GlnThr: 2.203 ± 0.915
2.517GlnVal: 2.517 ± 0.372
0.0GlnTrp: 0.0 ± 0.0
0.629GlnTyr: 0.629 ± 0.734
0.0GlnXaa: 0.0 ± 0.0
Arg
16.677ArgAla: 16.677 ± 1.5
3.147ArgCys: 3.147 ± 0.741
5.664ArgAsp: 5.664 ± 1.664
3.461ArgGlu: 3.461 ± 2.028
1.888ArgPhe: 1.888 ± 0.548
8.181ArgGly: 8.181 ± 0.721
4.091ArgHis: 4.091 ± 0.743
2.832ArgIle: 2.832 ± 0.005
0.944ArgLys: 0.944 ± 0.002
5.349ArgLeu: 5.349 ± 0.726
2.203ArgMet: 2.203 ± 0.209
0.629ArgAsn: 0.629 ± 0.734
7.237ArgPro: 7.237 ± 0.171
1.888ArgGln: 1.888 ± 0.548
7.552ArgArg: 7.552 ± 0.565
2.832ArgSer: 2.832 ± 0.556
4.091ArgThr: 4.091 ± 0.743
7.552ArgVal: 7.552 ± 1.116
3.461ArgTrp: 3.461 ± 0.178
1.888ArgTyr: 1.888 ± 1.106
0.0ArgXaa: 0.0 ± 0.0
Ser
6.293SerAla: 6.293 ± 0.93
1.573SerCys: 1.573 ± 1.284
3.461SerAsp: 3.461 ± 0.729
1.888SerGlu: 1.888 ± 0.003
0.629SerPhe: 0.629 ± 0.183
2.517SerGly: 2.517 ± 1.283
0.629SerHis: 0.629 ± 0.369
0.944SerIle: 0.944 ± 0.553
0.315SerLys: 0.315 ± 0.367
3.147SerLeu: 3.147 ± 1.292
0.0SerMet: 0.0 ± 0.0
0.629SerAsn: 0.629 ± 0.369
4.405SerPro: 4.405 ± 0.176
1.888SerGln: 1.888 ± 0.548
4.091SerArg: 4.091 ± 1.464
0.629SerSer: 0.629 ± 0.734
2.517SerThr: 2.517 ± 0.731
3.461SerVal: 3.461 ± 0.925
1.259SerTrp: 1.259 ± 0.186
1.888SerTyr: 1.888 ± 0.548
0.0SerXaa: 0.0 ± 0.0
Thr
9.44ThrAla: 9.44 ± 2.19
1.573ThrCys: 1.573 ± 0.181
2.203ThrAsp: 2.203 ± 0.188
4.405ThrGlu: 4.405 ± 0.176
0.629ThrPhe: 0.629 ± 0.183
3.461ThrGly: 3.461 ± 0.729
1.888ThrHis: 1.888 ± 0.003
0.944ThrIle: 0.944 ± 0.553
0.944ThrLys: 0.944 ± 0.002
5.664ThrLeu: 5.664 ± 1.113
0.315ThrMet: 0.315 ± 0.367
1.573ThrAsn: 1.573 ± 0.181
7.552ThrPro: 7.552 ± 2.193
1.259ThrGln: 1.259 ± 0.365
5.979ThrArg: 5.979 ± 0.357
2.517ThrSer: 2.517 ± 0.179
4.091ThrThr: 4.091 ± 2.015
4.091ThrVal: 4.091 ± 0.361
0.944ThrTrp: 0.944 ± 0.002
1.259ThrTyr: 1.259 ± 0.365
0.0ThrXaa: 0.0 ± 0.0
Val
9.755ValAla: 9.755 ± 3.51
2.517ValCys: 2.517 ± 0.179
2.517ValAsp: 2.517 ± 0.179
3.147ValGlu: 3.147 ± 0.914
1.888ValPhe: 1.888 ± 0.003
5.349ValGly: 5.349 ± 0.377
1.259ValHis: 1.259 ± 0.738
1.573ValIle: 1.573 ± 0.181
0.629ValLys: 0.629 ± 0.183
5.979ValLeu: 5.979 ± 0.746
1.259ValMet: 1.259 ± 0.186
0.944ValAsn: 0.944 ± 0.002
5.035ValPro: 5.035 ± 0.193
1.573ValGln: 1.573 ± 0.181
9.125ValArg: 9.125 ± 0.935
3.776ValSer: 3.776 ± 0.007
3.147ValThr: 3.147 ± 0.914
3.776ValVal: 3.776 ± 0.007
1.573ValTrp: 1.573 ± 0.733
1.573ValTyr: 1.573 ± 0.37
0.0ValXaa: 0.0 ± 0.0
Trp
3.776TrpAla: 3.776 ± 0.007
0.315TrpCys: 0.315 ± 0.367
2.203TrpAsp: 2.203 ± 0.188
1.259TrpGlu: 1.259 ± 0.186
0.315TrpPhe: 0.315 ± 0.184
2.203TrpGly: 2.203 ± 1.467
1.573TrpHis: 1.573 ± 0.733
0.944TrpIle: 0.944 ± 0.002
0.315TrpLys: 0.315 ± 0.184
2.832TrpLeu: 2.832 ± 0.005
0.0TrpMet: 0.0 ± 0.0
0.315TrpAsn: 0.315 ± 0.367
0.629TrpPro: 0.629 ± 0.369
1.573TrpGln: 1.573 ± 0.181
1.573TrpArg: 1.573 ± 0.37
1.259TrpSer: 1.259 ± 0.365
0.944TrpThr: 0.944 ± 0.002
2.203TrpVal: 2.203 ± 0.364
0.629TrpTrp: 0.629 ± 0.183
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.832TyrAla: 2.832 ± 0.556
0.944TyrCys: 0.944 ± 0.553
0.944TyrAsp: 0.944 ± 0.553
0.629TyrGlu: 0.629 ± 0.369
0.315TyrPhe: 0.315 ± 0.367
2.517TyrGly: 2.517 ± 0.731
0.315TyrHis: 0.315 ± 0.367
0.629TyrIle: 0.629 ± 0.183
0.315TyrLys: 0.315 ± 0.367
1.259TyrLeu: 1.259 ± 0.365
0.315TyrMet: 0.315 ± 0.184
1.888TyrAsn: 1.888 ± 1.1
0.629TyrPro: 0.629 ± 0.183
0.944TyrGln: 0.944 ± 0.55
2.203TyrArg: 2.203 ± 0.739
1.259TyrSer: 1.259 ± 0.186
3.461TyrThr: 3.461 ± 1.477
1.888TyrVal: 1.888 ± 0.555
0.0TyrTrp: 0.0 ± 0.0
1.888TyrTyr: 1.888 ± 0.003
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (3179 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski