Amino acid dipepetide frequency for Pea streak virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.863AlaAla: 4.863 ± 0.877
0.748AlaCys: 0.748 ± 0.56
7.482AlaAsp: 7.482 ± 0.894
5.612AlaGlu: 5.612 ± 2.336
2.993AlaPhe: 2.993 ± 0.766
3.367AlaGly: 3.367 ± 1.172
0.374AlaHis: 0.374 ± 0.178
7.108AlaIle: 7.108 ± 1.608
5.612AlaLys: 5.612 ± 2.133
8.605AlaLeu: 8.605 ± 2.234
1.122AlaMet: 1.122 ± 0.503
1.871AlaAsn: 1.871 ± 0.568
1.496AlaPro: 1.496 ± 0.9
1.122AlaGln: 1.122 ± 0.447
4.863AlaArg: 4.863 ± 2.134
4.115AlaSer: 4.115 ± 2.285
2.245AlaThr: 2.245 ± 0.309
4.115AlaVal: 4.115 ± 0.859
0.748AlaTrp: 0.748 ± 0.482
1.122AlaTyr: 1.122 ± 0.534
0.0AlaXaa: 0.0 ± 0.0
Cys
2.245CysAla: 2.245 ± 1.068
0.748CysCys: 0.748 ± 1.424
1.122CysAsp: 1.122 ± 0.908
2.619CysGlu: 2.619 ± 0.838
1.496CysPhe: 1.496 ± 0.48
2.245CysGly: 2.245 ± 0.309
0.374CysHis: 0.374 ± 0.758
0.748CysIle: 0.748 ± 0.356
0.748CysLys: 0.748 ± 0.356
0.748CysLeu: 0.748 ± 0.356
0.0CysMet: 0.0 ± 0.529
1.122CysAsn: 1.122 ± 0.626
0.374CysPro: 0.374 ± 0.661
0.748CysGln: 0.748 ± 0.356
1.496CysArg: 1.496 ± 0.506
2.245CysSer: 2.245 ± 2.287
2.619CysThr: 2.619 ± 1.563
2.993CysVal: 2.993 ± 1.345
0.0CysTrp: 0.0 ± 0.0
0.748CysTyr: 0.748 ± 0.356
0.0CysXaa: 0.0 ± 0.0
Asp
2.619AspAla: 2.619 ± 1.605
2.245AspCys: 2.245 ± 1.068
2.619AspAsp: 2.619 ± 0.803
2.993AspGlu: 2.993 ± 0.403
4.115AspPhe: 4.115 ± 1.05
2.993AspGly: 2.993 ± 1.171
1.496AspHis: 1.496 ± 0.48
2.619AspIle: 2.619 ± 0.91
2.619AspLys: 2.619 ± 0.803
5.612AspLeu: 5.612 ± 1.134
1.496AspMet: 1.496 ± 0.712
3.367AspAsn: 3.367 ± 1.298
1.496AspPro: 1.496 ± 0.48
1.871AspGln: 1.871 ± 0.89
2.619AspArg: 2.619 ± 0.993
1.496AspSer: 1.496 ± 0.964
2.619AspThr: 2.619 ± 1.104
4.115AspVal: 4.115 ± 0.712
1.496AspTrp: 1.496 ± 0.48
2.245AspTyr: 2.245 ± 0.694
0.0AspXaa: 0.0 ± 0.0
Glu
5.612GluAla: 5.612 ± 1.195
1.871GluCys: 1.871 ± 0.396
2.245GluAsp: 2.245 ± 1.068
8.605GluGlu: 8.605 ± 2.956
4.863GluPhe: 4.863 ± 3.102
5.986GluGly: 5.986 ± 0.797
1.122GluHis: 1.122 ± 0.503
5.238GluIle: 5.238 ± 1.615
5.238GluLys: 5.238 ± 1.025
5.986GluLeu: 5.986 ± 1.229
0.748GluMet: 0.748 ± 0.356
2.245GluAsn: 2.245 ± 1.006
2.619GluPro: 2.619 ± 1.615
3.367GluGln: 3.367 ± 1.509
4.489GluArg: 4.489 ± 1.028
7.482GluSer: 7.482 ± 2.093
0.748GluThr: 0.748 ± 0.356
5.238GluVal: 5.238 ± 1.676
0.374GluTrp: 0.374 ± 0.178
3.367GluTyr: 3.367 ± 0.865
0.0GluXaa: 0.0 ± 0.0
Phe
2.245PheAla: 2.245 ± 0.979
0.374PheCys: 0.374 ± 0.178
2.993PheAsp: 2.993 ± 0.766
5.238PheGlu: 5.238 ± 1.82
1.871PhePhe: 1.871 ± 0.568
4.863PheGly: 4.863 ± 0.594
0.748PheHis: 0.748 ± 0.482
3.367PheIle: 3.367 ± 1.955
1.496PheLys: 1.496 ± 0.629
8.23PheLeu: 8.23 ± 2.854
1.122PheMet: 1.122 ± 0.534
2.619PheAsn: 2.619 ± 0.993
1.496PhePro: 1.496 ± 0.48
1.871PheGln: 1.871 ± 0.396
2.245PheArg: 2.245 ± 0.694
4.489PheSer: 4.489 ± 1.338
3.741PheThr: 3.741 ± 0.922
2.245PheVal: 2.245 ± 0.672
0.748PheTrp: 0.748 ± 0.56
2.993PheTyr: 2.993 ± 1.12
0.0PheXaa: 0.0 ± 0.0
Gly
3.367GlyAla: 3.367 ± 1.091
1.871GlyCys: 1.871 ± 1.356
4.489GlyAsp: 4.489 ± 1.096
4.115GlyGlu: 4.115 ± 1.471
2.993GlyPhe: 2.993 ± 1.1
3.367GlyGly: 3.367 ± 1.12
0.374GlyHis: 0.374 ± 0.178
2.619GlyIle: 2.619 ± 0.708
6.36GlyLys: 6.36 ± 2.146
7.482GlyLeu: 7.482 ± 0.802
0.748GlyMet: 0.748 ± 0.56
2.245GlyAsn: 2.245 ± 1.068
1.122GlyPro: 1.122 ± 0.626
1.122GlyGln: 1.122 ± 0.534
1.871GlyArg: 1.871 ± 0.569
6.36GlySer: 6.36 ± 2.217
2.245GlyThr: 2.245 ± 0.77
5.612GlyVal: 5.612 ± 2.099
1.122GlyTrp: 1.122 ± 1.212
1.871GlyTyr: 1.871 ± 0.89
0.0GlyXaa: 0.0 ± 0.0
His
1.496HisAla: 1.496 ± 0.629
0.0HisCys: 0.0 ± 0.0
1.122HisAsp: 1.122 ± 0.534
0.374HisGlu: 0.374 ± 0.178
0.374HisPhe: 0.374 ± 0.178
0.748HisGly: 0.748 ± 0.672
0.748HisHis: 0.748 ± 0.482
0.748HisIle: 0.748 ± 0.356
1.496HisLys: 1.496 ± 1.183
3.741HisLeu: 3.741 ± 1.779
0.748HisMet: 0.748 ± 0.579
1.496HisAsn: 1.496 ± 0.964
0.374HisPro: 0.374 ± 0.178
0.0HisGln: 0.0 ± 0.0
2.245HisArg: 2.245 ± 0.894
2.993HisSer: 2.993 ± 1.258
0.374HisThr: 0.374 ± 0.178
1.496HisVal: 1.496 ± 1.505
0.374HisTrp: 0.374 ± 0.178
0.748HisTyr: 0.748 ± 0.356
0.0HisXaa: 0.0 ± 0.0
Ile
6.36IleAla: 6.36 ± 1.871
1.496IleCys: 1.496 ± 0.48
3.367IleAsp: 3.367 ± 1.038
5.612IleGlu: 5.612 ± 2.078
2.619IlePhe: 2.619 ± 1.368
4.489IleGly: 4.489 ± 2.04
1.496IleHis: 1.496 ± 1.344
5.986IleIle: 5.986 ± 2.927
3.741IleLys: 3.741 ± 1.321
5.986IleLeu: 5.986 ± 2.034
2.245IleMet: 2.245 ± 1.346
2.245IleAsn: 2.245 ± 0.845
1.871IlePro: 1.871 ± 0.89
2.619IleGln: 2.619 ± 1.403
3.367IleArg: 3.367 ± 1.941
4.115IleSer: 4.115 ± 1.155
1.871IleThr: 1.871 ± 0.984
3.367IleVal: 3.367 ± 1.166
0.374IleTrp: 0.374 ± 0.661
1.496IleTyr: 1.496 ± 0.712
0.0IleXaa: 0.0 ± 0.0
Lys
4.489LysAla: 4.489 ± 1.344
1.496LysCys: 1.496 ± 1.367
2.619LysAsp: 2.619 ± 0.887
3.367LysGlu: 3.367 ± 1.509
3.741LysPhe: 3.741 ± 0.695
3.367LysGly: 3.367 ± 1.367
0.748LysHis: 0.748 ± 0.356
5.238LysIle: 5.238 ± 1.195
3.741LysLys: 3.741 ± 1.266
7.856LysLeu: 7.856 ± 1.247
1.871LysMet: 1.871 ± 1.405
2.245LysAsn: 2.245 ± 0.898
2.245LysPro: 2.245 ± 1.698
2.245LysGln: 2.245 ± 0.672
3.367LysArg: 3.367 ± 1.105
5.612LysSer: 5.612 ± 0.815
5.238LysThr: 5.238 ± 1.152
3.367LysVal: 3.367 ± 0.865
0.374LysTrp: 0.374 ± 0.178
0.748LysTyr: 0.748 ± 0.356
0.0LysXaa: 0.0 ± 0.0
Leu
5.986LeuAla: 5.986 ± 1.926
2.245LeuCys: 2.245 ± 1.068
5.238LeuAsp: 5.238 ± 0.623
8.605LeuGlu: 8.605 ± 1.139
4.115LeuPhe: 4.115 ± 1.957
7.856LeuGly: 7.856 ± 1.29
4.115LeuHis: 4.115 ± 1.471
8.605LeuIle: 8.605 ± 2.042
10.849LeuLys: 10.849 ± 1.772
9.727LeuLeu: 9.727 ± 3.399
2.993LeuMet: 2.993 ± 0.993
4.863LeuAsn: 4.863 ± 2.313
5.612LeuPro: 5.612 ± 1.833
3.367LeuGln: 3.367 ± 1.397
5.986LeuArg: 5.986 ± 1.222
3.741LeuSer: 3.741 ± 1.766
6.36LeuThr: 6.36 ± 1.351
6.36LeuVal: 6.36 ± 1.626
0.374LeuTrp: 0.374 ± 1.236
1.496LeuTyr: 1.496 ± 0.712
0.0LeuXaa: 0.0 ± 0.0
Met
3.367MetAla: 3.367 ± 1.06
0.374MetCys: 0.374 ± 0.178
1.122MetAsp: 1.122 ± 0.447
2.245MetGlu: 2.245 ± 1.068
0.748MetPhe: 0.748 ± 0.56
0.374MetGly: 0.374 ± 0.178
0.374MetHis: 0.374 ± 0.661
0.0MetIle: 0.0 ± 0.0
0.374MetLys: 0.374 ± 0.661
2.245MetLeu: 2.245 ± 0.694
0.748MetMet: 0.748 ± 1.148
0.748MetAsn: 0.748 ± 0.356
1.496MetPro: 1.496 ± 0.9
1.122MetGln: 1.122 ± 0.908
1.871MetArg: 1.871 ± 2.224
1.122MetSer: 1.122 ± 1.082
0.748MetThr: 0.748 ± 0.356
0.374MetVal: 0.374 ± 0.178
0.0MetTrp: 0.0 ± 0.0
0.374MetTyr: 0.374 ± 0.661
0.0MetXaa: 0.0 ± 0.0
Asn
2.993AsnAla: 2.993 ± 0.766
2.993AsnCys: 2.993 ± 1.033
1.122AsnAsp: 1.122 ± 1.212
2.619AsnGlu: 2.619 ± 1.246
2.245AsnPhe: 2.245 ± 1.068
2.619AsnGly: 2.619 ± 0.803
1.496AsnHis: 1.496 ± 0.48
1.871AsnIle: 1.871 ± 1.586
2.245AsnLys: 2.245 ± 0.706
5.986AsnLeu: 5.986 ± 1.721
0.748AsnMet: 0.748 ± 0.56
2.245AsnAsn: 2.245 ± 2.935
2.993AsnPro: 2.993 ± 1.012
1.122AsnGln: 1.122 ± 1.344
2.619AsnArg: 2.619 ± 1.246
2.993AsnSer: 2.993 ± 0.967
1.122AsnThr: 1.122 ± 1.198
0.374AsnVal: 0.374 ± 0.178
1.871AsnTrp: 1.871 ± 1.45
2.993AsnTyr: 2.993 ± 1.012
0.0AsnXaa: 0.0 ± 0.0
Pro
2.619ProAla: 2.619 ± 1.368
0.374ProCys: 0.374 ± 0.178
1.871ProAsp: 1.871 ± 0.748
3.367ProGlu: 3.367 ± 2.056
1.871ProPhe: 1.871 ± 0.569
2.245ProGly: 2.245 ± 0.694
1.496ProHis: 1.496 ± 1.344
2.245ProIle: 2.245 ± 0.845
2.993ProLys: 2.993 ± 1.179
2.619ProLeu: 2.619 ± 0.708
0.374ProMet: 0.374 ± 0.178
1.122ProAsn: 1.122 ± 0.626
2.245ProPro: 2.245 ± 1.34
1.122ProGln: 1.122 ± 0.503
1.871ProArg: 1.871 ± 1.034
1.122ProSer: 1.122 ± 1.464
1.871ProThr: 1.871 ± 1.889
1.122ProVal: 1.122 ± 1.082
0.374ProTrp: 0.374 ± 0.178
1.496ProTyr: 1.496 ± 0.712
0.0ProXaa: 0.0 ± 0.0
Gln
1.871GlnAla: 1.871 ± 1.356
1.496GlnCys: 1.496 ± 0.506
2.993GlnAsp: 2.993 ± 0.403
1.496GlnGlu: 1.496 ± 0.48
1.496GlnPhe: 1.496 ± 0.712
1.871GlnGly: 1.871 ± 0.568
0.374GlnHis: 0.374 ± 0.178
2.619GlnIle: 2.619 ± 1.057
1.496GlnLys: 1.496 ± 1.871
3.367GlnLeu: 3.367 ± 1.172
0.374GlnMet: 0.374 ± 0.178
2.245GlnAsn: 2.245 ± 0.923
0.748GlnPro: 0.748 ± 0.56
1.496GlnGln: 1.496 ± 1.871
0.748GlnArg: 0.748 ± 0.482
2.993GlnSer: 2.993 ± 0.949
1.122GlnThr: 1.122 ± 0.534
1.122GlnVal: 1.122 ± 1.212
0.374GlnTrp: 0.374 ± 0.178
0.374GlnTyr: 0.374 ± 0.758
0.0GlnXaa: 0.0 ± 0.0
Arg
6.36ArgAla: 6.36 ± 1.189
1.496ArgCys: 1.496 ± 0.712
2.245ArgAsp: 2.245 ± 1.006
3.367ArgGlu: 3.367 ± 0.539
5.238ArgPhe: 5.238 ± 0.987
3.367ArgGly: 3.367 ± 1.155
0.374ArgHis: 0.374 ± 0.178
1.496ArgIle: 1.496 ± 0.712
1.871ArgLys: 1.871 ± 1.08
5.612ArgLeu: 5.612 ± 1.93
0.748ArgMet: 0.748 ± 0.356
2.993ArgAsn: 2.993 ± 0.766
1.122ArgPro: 1.122 ± 1.212
1.871ArgGln: 1.871 ± 0.396
2.993ArgArg: 2.993 ± 1.562
4.863ArgSer: 4.863 ± 1.357
2.619ArgThr: 2.619 ± 1.676
2.993ArgVal: 2.993 ± 1.533
0.748ArgTrp: 0.748 ± 0.356
3.367ArgTyr: 3.367 ± 1.06
0.0ArgXaa: 0.0 ± 0.0
Ser
4.489SerAla: 4.489 ± 0.727
1.122SerCys: 1.122 ± 2.025
4.115SerAsp: 4.115 ± 1.231
5.238SerGlu: 5.238 ± 2.036
2.619SerPhe: 2.619 ± 0.739
4.489SerGly: 4.489 ± 1.388
2.245SerHis: 2.245 ± 0.694
3.741SerIle: 3.741 ± 2.815
7.108SerLys: 7.108 ± 1.518
7.482SerLeu: 7.482 ± 1.113
0.748SerMet: 0.748 ± 0.404
3.741SerAsn: 3.741 ± 1.716
3.367SerPro: 3.367 ± 1.329
1.496SerGln: 1.496 ± 0.878
4.489SerArg: 4.489 ± 0.804
5.612SerSer: 5.612 ± 1.783
2.619SerThr: 2.619 ± 2.398
3.741SerVal: 3.741 ± 1.533
0.0SerTrp: 0.0 ± 0.0
1.871SerTyr: 1.871 ± 0.89
0.0SerXaa: 0.0 ± 0.0
Thr
1.871ThrAla: 1.871 ± 1.769
0.374ThrCys: 0.374 ± 0.573
1.496ThrAsp: 1.496 ± 0.712
3.741ThrGlu: 3.741 ± 1.246
6.734ThrPhe: 6.734 ± 2.073
2.993ThrGly: 2.993 ± 0.839
1.496ThrHis: 1.496 ± 0.506
2.993ThrIle: 2.993 ± 0.868
2.245ThrLys: 2.245 ± 0.77
5.986ThrLeu: 5.986 ± 2.615
0.748ThrMet: 0.748 ± 0.356
2.245ThrAsn: 2.245 ± 1.34
0.748ThrPro: 0.748 ± 0.482
1.122ThrGln: 1.122 ± 0.908
2.619ThrArg: 2.619 ± 1.612
2.245ThrSer: 2.245 ± 2.453
1.871ThrThr: 1.871 ± 0.89
1.496ThrVal: 1.496 ± 0.48
0.0ThrTrp: 0.0 ± 0.0
2.245ThrTyr: 2.245 ± 1.068
0.0ThrXaa: 0.0 ± 0.0
Val
3.367ValAla: 3.367 ± 0.705
2.619ValCys: 2.619 ± 1.178
2.619ValAsp: 2.619 ± 1.246
5.612ValGlu: 5.612 ± 1.193
2.993ValPhe: 2.993 ± 1.354
1.871ValGly: 1.871 ± 0.812
1.496ValHis: 1.496 ± 0.48
4.489ValIle: 4.489 ± 1.327
2.619ValLys: 2.619 ± 1.925
5.986ValLeu: 5.986 ± 1.344
0.748ValMet: 0.748 ± 1.148
2.993ValAsn: 2.993 ± 1.012
1.871ValPro: 1.871 ± 0.984
2.619ValGln: 2.619 ± 0.994
2.993ValArg: 2.993 ± 1.185
3.367ValSer: 3.367 ± 0.565
2.993ValThr: 2.993 ± 0.403
4.863ValVal: 4.863 ± 2.846
0.374ValTrp: 0.374 ± 0.178
0.374ValTyr: 0.374 ± 0.178
0.0ValXaa: 0.0 ± 0.0
Trp
0.374TrpAla: 0.374 ± 0.661
0.374TrpCys: 0.374 ± 0.178
0.374TrpAsp: 0.374 ± 0.661
0.748TrpGlu: 0.748 ± 0.849
1.122TrpPhe: 1.122 ± 0.626
0.0TrpGly: 0.0 ± 0.0
0.374TrpHis: 0.374 ± 0.178
0.374TrpIle: 0.374 ± 0.178
0.0TrpLys: 0.0 ± 0.0
1.496TrpLeu: 1.496 ± 1.043
0.374TrpMet: 0.374 ± 0.178
1.122TrpAsn: 1.122 ± 1.212
0.374TrpPro: 0.374 ± 0.178
0.0TrpGln: 0.0 ± 0.0
0.374TrpArg: 0.374 ± 0.661
1.122TrpSer: 1.122 ± 0.534
0.374TrpThr: 0.374 ± 0.573
1.122TrpVal: 1.122 ± 0.534
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.993TyrAla: 2.993 ± 0.839
1.496TyrCys: 1.496 ± 0.506
1.496TyrAsp: 1.496 ± 0.712
1.871TyrGlu: 1.871 ± 0.89
0.748TyrPhe: 0.748 ± 0.356
2.245TyrGly: 2.245 ± 1.068
0.374TyrHis: 0.374 ± 0.758
2.619TyrIle: 2.619 ± 0.312
1.122TyrLys: 1.122 ± 0.626
3.741TyrLeu: 3.741 ± 1.332
0.748TyrMet: 0.748 ± 0.56
1.496TyrAsn: 1.496 ± 0.48
0.748TyrPro: 0.748 ± 0.56
0.374TyrGln: 0.374 ± 0.178
2.619TyrArg: 2.619 ± 1.246
2.245TyrSer: 2.245 ± 1.068
1.871TyrThr: 1.871 ± 0.568
0.748TyrVal: 0.748 ± 0.356
0.374TyrTrp: 0.374 ± 0.178
1.122TyrTyr: 1.122 ± 0.534
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (2674 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski