Amino acid dipepetide frequency for Ceratobasidium endornavirus G

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.709AlaAla: 7.709 ± 0.744
2.045AlaCys: 2.045 ± 1.03
5.035AlaAsp: 5.035 ± 2.144
5.821AlaGlu: 5.821 ± 0.833
0.629AlaPhe: 0.629 ± 0.029
4.405AlaGly: 4.405 ± 1.565
2.203AlaHis: 2.203 ± 0.273
4.091AlaIle: 4.091 ± 0.021
4.091AlaLys: 4.091 ± 0.021
7.867AlaLeu: 7.867 ± 0.142
1.888AlaMet: 1.888 ± 0.223
4.248AlaAsn: 4.248 ± 0.396
2.517AlaPro: 2.517 ± 0.458
3.147AlaGln: 3.147 ± 0.193
4.877AlaArg: 4.877 ± 0.707
4.877AlaSer: 4.877 ± 0.027
5.035AlaThr: 5.035 ± 1.124
4.405AlaVal: 4.405 ± 1.494
1.416AlaTrp: 1.416 ± 0.321
1.416AlaTyr: 1.416 ± 0.019
0.0AlaXaa: 0.0 ± 0.0
Cys
1.416CysAla: 1.416 ± 0.661
0.787CysCys: 0.787 ± 0.292
0.944CysAsp: 0.944 ± 0.554
0.787CysGlu: 0.787 ± 0.388
0.315CysPhe: 0.315 ± 0.185
1.573CysGly: 1.573 ± 0.583
0.315CysHis: 0.315 ± 0.185
0.944CysIle: 0.944 ± 0.214
1.259CysLys: 1.259 ± 0.281
1.259CysLeu: 1.259 ± 0.059
0.629CysMet: 0.629 ± 0.369
2.045CysAsn: 2.045 ± 0.351
1.259CysPro: 1.259 ± 0.281
1.101CysGln: 1.101 ± 0.203
0.472CysArg: 0.472 ± 0.233
0.629CysSer: 0.629 ± 0.31
0.787CysThr: 0.787 ± 0.388
1.416CysVal: 1.416 ± 0.661
0.629CysTrp: 0.629 ± 0.029
1.101CysTyr: 1.101 ± 0.136
0.0CysXaa: 0.0 ± 0.0
Asp
3.461AspAla: 3.461 ± 0.688
1.101AspCys: 1.101 ± 0.136
2.989AspAsp: 2.989 ± 0.115
2.989AspGlu: 2.989 ± 0.565
1.573AspPhe: 1.573 ± 0.243
4.405AspGly: 4.405 ± 0.474
1.573AspHis: 1.573 ± 0.096
3.933AspIle: 3.933 ± 0.099
2.989AspLys: 2.989 ± 0.455
5.035AspLeu: 5.035 ± 0.784
2.045AspMet: 2.045 ± 0.329
3.304AspAsn: 3.304 ± 1.089
2.517AspPro: 2.517 ± 1.242
2.36AspGln: 2.36 ± 0.145
2.675AspArg: 2.675 ± 0.3
2.989AspSer: 2.989 ± 1.135
3.461AspThr: 3.461 ± 0.672
3.147AspVal: 3.147 ± 0.533
2.675AspTrp: 2.675 ± 0.3
1.731AspTyr: 1.731 ± 1.186
0.0AspXaa: 0.0 ± 0.0
Glu
4.248GluAla: 4.248 ± 0.736
1.101GluCys: 1.101 ± 0.476
2.675GluAsp: 2.675 ± 0.04
2.517GluGlu: 2.517 ± 0.902
2.203GluPhe: 2.203 ± 0.407
4.248GluGly: 4.248 ± 0.396
2.203GluHis: 2.203 ± 0.613
2.517GluIle: 2.517 ± 0.118
1.573GluLys: 1.573 ± 0.096
5.349GluLeu: 5.349 ± 0.42
2.045GluMet: 2.045 ± 0.351
2.989GluAsn: 2.989 ± 0.565
2.517GluPro: 2.517 ± 0.902
2.832GluGln: 2.832 ± 1.057
2.517GluArg: 2.517 ± 0.118
2.675GluSer: 2.675 ± 0.72
3.304GluThr: 3.304 ± 0.61
4.72GluVal: 4.72 ± 0.629
1.888GluTrp: 1.888 ± 0.428
1.416GluTyr: 1.416 ± 0.019
0.0GluXaa: 0.0 ± 0.0
Phe
2.36PheAla: 2.36 ± 0.145
0.944PheCys: 0.944 ± 0.554
1.416PheAsp: 1.416 ± 0.019
2.36PheGlu: 2.36 ± 0.485
0.472PhePhe: 0.472 ± 0.107
2.045PheGly: 2.045 ± 0.69
0.472PheHis: 0.472 ± 0.107
2.045PheIle: 2.045 ± 0.011
2.045PheLys: 2.045 ± 0.011
1.259PheLeu: 1.259 ± 0.059
0.787PheMet: 0.787 ± 0.053
1.416PheAsn: 1.416 ± 0.019
0.787PhePro: 0.787 ± 0.388
1.888PheGln: 1.888 ± 0.252
0.944PheArg: 0.944 ± 0.214
1.101PheSer: 1.101 ± 0.543
1.888PheThr: 1.888 ± 0.088
2.045PheVal: 2.045 ± 0.351
0.315PheTrp: 0.315 ± 0.185
0.944PheTyr: 0.944 ± 0.126
0.0PheXaa: 0.0 ± 0.0
Gly
4.091GlyAla: 4.091 ± 0.021
1.259GlyCys: 1.259 ± 0.399
2.989GlyAsp: 2.989 ± 0.225
2.832GlyGlu: 2.832 ± 0.302
2.675GlyPhe: 2.675 ± 0.38
2.36GlyGly: 2.36 ± 0.145
2.203GlyHis: 2.203 ± 0.613
3.461GlyIle: 3.461 ± 0.332
4.405GlyLys: 4.405 ± 1.154
5.507GlyLeu: 5.507 ± 1.017
1.101GlyMet: 1.101 ± 0.203
3.776GlyAsn: 3.776 ± 0.176
4.248GlyPro: 4.248 ± 0.396
2.203GlyGln: 2.203 ± 0.407
4.248GlyArg: 4.248 ± 0.623
4.405GlySer: 4.405 ± 1.154
4.248GlyThr: 4.248 ± 0.056
4.72GlyVal: 4.72 ± 0.73
1.416GlyTrp: 1.416 ± 0.321
2.203GlyTyr: 2.203 ± 0.953
0.0GlyXaa: 0.0 ± 0.0
His
1.416HisAla: 1.416 ± 0.019
0.787HisCys: 0.787 ± 0.048
2.832HisAsp: 2.832 ± 0.717
1.259HisGlu: 1.259 ± 0.399
1.101HisPhe: 1.101 ± 0.203
1.731HisGly: 1.731 ± 0.506
0.787HisHis: 0.787 ± 0.632
1.573HisIle: 1.573 ± 0.583
1.888HisLys: 1.888 ± 0.428
2.675HisLeu: 2.675 ± 0.3
0.944HisMet: 0.944 ± 0.126
1.888HisAsn: 1.888 ± 0.088
1.416HisPro: 1.416 ± 1.001
1.731HisGln: 1.731 ± 0.174
2.517HisArg: 2.517 ± 1.137
1.573HisSer: 1.573 ± 0.096
2.36HisThr: 2.36 ± 0.145
1.731HisVal: 1.731 ± 0.846
0.787HisTrp: 0.787 ± 0.632
0.944HisTyr: 0.944 ± 0.466
0.0HisXaa: 0.0 ± 0.0
Ile
4.877IleAla: 4.877 ± 1.333
0.787IleCys: 0.787 ± 0.048
3.619IleAsp: 3.619 ± 0.594
3.304IleGlu: 3.304 ± 0.27
0.629IlePhe: 0.629 ± 0.31
4.405IleGly: 4.405 ± 0.814
2.045IleHis: 2.045 ± 0.011
4.405IleIle: 4.405 ± 0.134
2.36IleLys: 2.36 ± 0.145
3.619IleLeu: 3.619 ± 0.254
1.416IleMet: 1.416 ± 0.321
3.933IleAsn: 3.933 ± 0.241
1.259IlePro: 1.259 ± 0.739
2.045IleGln: 2.045 ± 0.69
2.517IleArg: 2.517 ± 0.118
2.36IleSer: 2.36 ± 0.535
5.192IleThr: 5.192 ± 0.522
3.304IleVal: 3.304 ± 0.409
0.629IleTrp: 0.629 ± 0.709
2.517IleTyr: 2.517 ± 0.458
0.0IleXaa: 0.0 ± 0.0
Lys
3.147LysAla: 3.147 ± 0.873
0.787LysCys: 0.787 ± 0.632
1.259LysAsp: 1.259 ± 0.621
2.517LysGlu: 2.517 ± 0.797
2.832LysPhe: 2.832 ± 0.038
2.675LysGly: 2.675 ± 0.64
1.101LysHis: 1.101 ± 0.136
2.203LysIle: 2.203 ± 0.407
1.416LysLys: 1.416 ± 0.321
5.979LysLeu: 5.979 ± 0.91
1.259LysMet: 1.259 ± 0.621
2.045LysAsn: 2.045 ± 0.351
3.147LysPro: 3.147 ± 0.193
2.36LysGln: 2.36 ± 0.145
1.888LysArg: 1.888 ± 0.088
2.045LysSer: 2.045 ± 0.011
4.248LysThr: 4.248 ± 0.056
3.619LysVal: 3.619 ± 0.426
0.787LysTrp: 0.787 ± 0.292
2.517LysTyr: 2.517 ± 0.562
0.157LysXaa: 0.157 ± 0.078
Leu
8.181LeuAla: 8.181 ± 1.657
1.259LeuCys: 1.259 ± 0.621
6.293LeuAsp: 6.293 ± 0.294
5.035LeuGlu: 5.035 ± 1.124
2.203LeuPhe: 2.203 ± 0.407
5.349LeuGly: 5.349 ± 0.26
3.461LeuHis: 3.461 ± 0.332
4.405LeuIle: 4.405 ± 1.226
3.933LeuLys: 3.933 ± 0.099
9.283LeuLeu: 9.283 ± 0.859
2.36LeuMet: 2.36 ± 0.485
4.248LeuAsn: 4.248 ± 0.284
4.877LeuPro: 4.877 ± 0.313
3.933LeuGln: 3.933 ± 0.241
5.507LeuArg: 5.507 ± 0.677
5.821LeuSer: 5.821 ± 0.527
6.451LeuThr: 6.451 ± 1.143
7.237LeuVal: 7.237 ± 0.851
1.259LeuTrp: 1.259 ± 0.059
3.304LeuTyr: 3.304 ± 0.27
0.0LeuXaa: 0.0 ± 0.0
Met
3.304MetAla: 3.304 ± 0.95
0.629MetCys: 0.629 ± 0.369
1.731MetAsp: 1.731 ± 0.174
1.259MetGlu: 1.259 ± 0.059
0.944MetPhe: 0.944 ± 0.214
1.731MetGly: 1.731 ± 0.514
0.944MetHis: 0.944 ± 0.466
1.888MetIle: 1.888 ± 0.088
1.101MetLys: 1.101 ± 0.136
4.091MetLeu: 4.091 ± 0.021
0.472MetMet: 0.472 ± 0.233
1.101MetAsn: 1.101 ± 0.136
1.416MetPro: 1.416 ± 0.019
1.101MetGln: 1.101 ± 0.203
0.944MetArg: 0.944 ± 0.126
1.416MetSer: 1.416 ± 0.321
1.573MetThr: 1.573 ± 0.583
1.573MetVal: 1.573 ± 0.436
0.472MetTrp: 0.472 ± 0.447
1.573MetTyr: 1.573 ± 0.583
0.0MetXaa: 0.0 ± 0.0
Asn
3.776AsnAla: 3.776 ± 1.183
1.416AsnCys: 1.416 ± 0.321
2.832AsnAsp: 2.832 ± 0.642
1.888AsnGlu: 1.888 ± 0.088
1.888AsnPhe: 1.888 ± 0.768
3.619AsnGly: 3.619 ± 0.426
1.731AsnHis: 1.731 ± 0.174
2.675AsnIle: 2.675 ± 2.079
2.36AsnLys: 2.36 ± 0.145
5.035AsnLeu: 5.035 ± 1.255
1.731AsnMet: 1.731 ± 0.514
2.989AsnAsn: 2.989 ± 0.904
2.832AsnPro: 2.832 ± 0.377
1.416AsnGln: 1.416 ± 0.019
1.259AsnArg: 1.259 ± 0.059
3.619AsnSer: 3.619 ± 0.086
5.192AsnThr: 5.192 ± 1.857
4.248AsnVal: 4.248 ± 0.963
1.731AsnTrp: 1.731 ± 0.166
1.573AsnTyr: 1.573 ± 0.096
0.0AsnXaa: 0.0 ± 0.0
Pro
3.147ProAla: 3.147 ± 0.487
0.0ProCys: 0.0 ± 0.0
2.989ProAsp: 2.989 ± 0.795
2.675ProGlu: 2.675 ± 0.3
0.944ProPhe: 0.944 ± 0.466
4.563ProGly: 4.563 ± 0.552
0.472ProHis: 0.472 ± 0.447
5.035ProIle: 5.035 ± 0.105
1.731ProLys: 1.731 ± 0.514
4.563ProLeu: 4.563 ± 1.828
1.259ProMet: 1.259 ± 0.281
3.304ProAsn: 3.304 ± 0.069
1.259ProPro: 1.259 ± 0.621
2.045ProGln: 2.045 ± 0.329
1.888ProArg: 1.888 ± 0.252
2.045ProSer: 2.045 ± 0.351
2.675ProThr: 2.675 ± 0.3
3.933ProVal: 3.933 ± 0.099
0.787ProTrp: 0.787 ± 0.048
1.888ProTyr: 1.888 ± 0.592
0.0ProXaa: 0.0 ± 0.0
Gln
3.304GlnAla: 3.304 ± 0.27
0.315GlnCys: 0.315 ± 0.155
1.416GlnAsp: 1.416 ± 0.699
1.731GlnGlu: 1.731 ± 0.174
1.416GlnPhe: 1.416 ± 0.019
2.203GlnGly: 2.203 ± 0.273
1.888GlnHis: 1.888 ± 0.088
2.989GlnIle: 2.989 ± 0.455
0.944GlnLys: 0.944 ± 0.126
4.72GlnLeu: 4.72 ± 0.289
1.416GlnMet: 1.416 ± 0.321
1.259GlnAsn: 1.259 ± 0.281
2.832GlnPro: 2.832 ± 0.642
2.203GlnGln: 2.203 ± 0.407
2.517GlnArg: 2.517 ± 1.137
3.304GlnSer: 3.304 ± 0.61
1.888GlnThr: 1.888 ± 0.252
2.675GlnVal: 2.675 ± 0.98
0.629GlnTrp: 0.629 ± 0.369
1.101GlnTyr: 1.101 ± 0.203
0.0GlnXaa: 0.0 ± 0.0
Arg
3.776ArgAla: 3.776 ± 0.503
1.259ArgCys: 1.259 ± 0.059
3.776ArgAsp: 3.776 ± 0.176
2.675ArgGlu: 2.675 ± 0.38
1.416ArgPhe: 1.416 ± 0.019
2.36ArgGly: 2.36 ± 0.145
2.045ArgHis: 2.045 ± 1.37
2.203ArgIle: 2.203 ± 0.407
2.517ArgLys: 2.517 ± 0.222
5.349ArgLeu: 5.349 ± 0.6
1.731ArgMet: 1.731 ± 0.174
2.675ArgAsn: 2.675 ± 0.38
2.989ArgPro: 2.989 ± 1.135
2.045ArgGln: 2.045 ± 0.351
2.517ArgArg: 2.517 ± 0.797
1.731ArgSer: 1.731 ± 0.166
2.36ArgThr: 2.36 ± 0.824
3.619ArgVal: 3.619 ± 0.426
0.629ArgTrp: 0.629 ± 0.369
2.203ArgTyr: 2.203 ± 0.953
0.0ArgXaa: 0.0 ± 0.0
Ser
3.461SerAla: 3.461 ± 0.672
1.101SerCys: 1.101 ± 0.136
4.405SerAsp: 4.405 ± 0.886
3.776SerGlu: 3.776 ± 0.516
1.888SerPhe: 1.888 ± 0.592
3.776SerGly: 3.776 ± 0.503
1.888SerHis: 1.888 ± 0.088
2.517SerIle: 2.517 ± 1.242
2.517SerLys: 2.517 ± 0.562
7.552SerLeu: 7.552 ± 1.347
1.888SerMet: 1.888 ± 0.088
2.36SerAsn: 2.36 ± 1.215
0.944SerPro: 0.944 ± 0.126
2.045SerGln: 2.045 ± 0.351
2.832SerArg: 2.832 ± 1.397
4.091SerSer: 4.091 ± 0.319
2.832SerThr: 2.832 ± 0.717
4.248SerVal: 4.248 ± 0.396
0.944SerTrp: 0.944 ± 0.126
2.36SerTyr: 2.36 ± 0.875
0.0SerXaa: 0.0 ± 0.0
Thr
5.507ThrAla: 5.507 ± 1.017
1.573ThrCys: 1.573 ± 0.776
2.832ThrAsp: 2.832 ± 0.302
3.933ThrGlu: 3.933 ± 0.241
1.731ThrPhe: 1.731 ± 0.506
4.563ThrGly: 4.563 ± 0.128
2.203ThrHis: 2.203 ± 0.407
3.304ThrIle: 3.304 ± 0.409
3.461ThrLys: 3.461 ± 0.332
5.035ThrLeu: 5.035 ± 0.575
2.832ThrMet: 2.832 ± 0.302
2.989ThrAsn: 2.989 ± 0.455
2.832ThrPro: 2.832 ± 0.717
2.045ThrGln: 2.045 ± 0.329
2.989ThrArg: 2.989 ± 0.225
4.405ThrSer: 4.405 ± 0.134
5.192ThrThr: 5.192 ± 0.158
4.877ThrVal: 4.877 ± 0.707
1.101ThrTrp: 1.101 ± 0.136
2.989ThrTyr: 2.989 ± 0.795
0.0ThrXaa: 0.0 ± 0.0
Val
6.451ValAla: 6.451 ± 1.236
0.472ValCys: 0.472 ± 0.447
2.989ValAsp: 2.989 ± 0.455
5.192ValGlu: 5.192 ± 0.522
0.944ValPhe: 0.944 ± 0.466
4.877ValGly: 4.877 ± 0.027
2.36ValHis: 2.36 ± 0.145
2.832ValIle: 2.832 ± 0.038
4.563ValLys: 4.563 ± 0.468
5.349ValLeu: 5.349 ± 2.639
1.573ValMet: 1.573 ± 0.096
3.304ValAsn: 3.304 ± 0.409
4.563ValPro: 4.563 ± 0.212
2.36ValGln: 2.36 ± 0.195
3.304ValArg: 3.304 ± 0.61
5.979ValSer: 5.979 ± 0.91
4.72ValThr: 4.72 ± 0.73
4.563ValVal: 4.563 ± 1.488
1.573ValTrp: 1.573 ± 0.096
1.101ValTyr: 1.101 ± 0.476
0.0ValXaa: 0.0 ± 0.0
Trp
1.416TrpAla: 1.416 ± 0.321
0.787TrpCys: 0.787 ± 0.388
1.573TrpAsp: 1.573 ± 0.096
0.472TrpGlu: 0.472 ± 0.107
1.101TrpPhe: 1.101 ± 0.816
1.101TrpGly: 1.101 ± 0.816
0.944TrpHis: 0.944 ± 0.554
0.472TrpIle: 0.472 ± 0.447
0.472TrpLys: 0.472 ± 0.233
1.731TrpLeu: 1.731 ± 0.166
0.472TrpMet: 0.472 ± 0.447
1.259TrpAsn: 1.259 ± 0.059
1.731TrpPro: 1.731 ± 1.186
0.944TrpGln: 0.944 ± 0.126
1.259TrpArg: 1.259 ± 0.059
1.573TrpSer: 1.573 ± 0.583
1.101TrpThr: 1.101 ± 0.203
1.259TrpVal: 1.259 ± 0.059
0.315TrpTrp: 0.315 ± 0.185
0.472TrpTyr: 0.472 ± 0.233
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.832TyrAla: 2.832 ± 0.302
1.259TyrCys: 1.259 ± 0.621
2.36TyrAsp: 2.36 ± 0.485
2.517TyrGlu: 2.517 ± 0.118
0.629TyrPhe: 0.629 ± 0.029
2.36TyrGly: 2.36 ± 0.875
1.101TyrHis: 1.101 ± 0.203
1.731TyrIle: 1.731 ± 0.846
2.36TyrLys: 2.36 ± 0.824
3.147TyrLeu: 3.147 ± 1.213
1.259TyrMet: 1.259 ± 0.399
2.675TyrAsn: 2.675 ± 1.4
1.259TyrPro: 1.259 ± 0.399
0.944TyrGln: 0.944 ± 0.554
2.203TyrArg: 2.203 ± 0.273
0.787TyrSer: 0.787 ± 0.388
1.888TyrThr: 1.888 ± 0.252
1.573TyrVal: 1.573 ± 0.583
0.472TyrTrp: 0.472 ± 0.447
0.944TyrTyr: 0.944 ± 0.894
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.157XaaVal: 0.157 ± 0.078
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (6357 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski