Amino acid dipepetide frequency for Brazoran virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.317AlaAla: 2.317 ± 1.04
0.927AlaCys: 0.927 ± 0.272
3.476AlaAsp: 3.476 ± 0.646
2.549AlaGlu: 2.549 ± 0.885
2.086AlaPhe: 2.086 ± 0.556
2.086AlaGly: 2.086 ± 2.51
0.463AlaHis: 0.463 ± 0.136
4.635AlaIle: 4.635 ± 0.5
5.794AlaLys: 5.794 ± 0.465
5.562AlaLeu: 5.562 ± 0.772
2.781AlaMet: 2.781 ± 0.959
2.781AlaAsn: 2.781 ± 0.543
1.39AlaPro: 1.39 ± 1.66
1.159AlaGln: 1.159 ± 0.272
3.708AlaArg: 3.708 ± 0.589
3.013AlaSer: 3.013 ± 0.87
2.317AlaThr: 2.317 ± 0.774
2.317AlaVal: 2.317 ± 0.275
0.232AlaTrp: 0.232 ± 0.147
2.317AlaTyr: 2.317 ± 0.774
0.0AlaXaa: 0.0 ± 0.0
Cys
1.159CysAla: 1.159 ± 0.302
0.0CysCys: 0.0 ± 0.0
0.695CysAsp: 0.695 ± 0.615
0.463CysGlu: 0.463 ± 0.41
1.622CysPhe: 1.622 ± 0.828
2.781CysGly: 2.781 ± 1.262
1.159CysHis: 1.159 ± 0.272
3.013CysIle: 3.013 ± 0.803
2.549CysLys: 2.549 ± 1.67
1.854CysLeu: 1.854 ± 0.813
0.695CysMet: 0.695 ± 0.315
1.854CysAsn: 1.854 ± 0.49
1.159CysPro: 1.159 ± 0.879
2.086CysGln: 2.086 ± 0.761
1.159CysArg: 1.159 ± 0.302
1.854CysSer: 1.854 ± 0.594
1.39CysThr: 1.39 ± 0.783
1.854CysVal: 1.854 ± 1.153
0.232CysTrp: 0.232 ± 0.147
0.927CysTyr: 0.927 ± 0.317
0.0CysXaa: 0.0 ± 0.0
Asp
3.013AspAla: 3.013 ± 0.525
0.927AspCys: 0.927 ± 0.272
3.476AspAsp: 3.476 ± 0.416
3.476AspGlu: 3.476 ± 1.274
4.635AspPhe: 4.635 ± 2.09
1.622AspGly: 1.622 ± 0.199
0.463AspHis: 0.463 ± 0.136
6.952AspIle: 6.952 ± 1.348
3.94AspLys: 3.94 ± 0.725
5.562AspLeu: 5.562 ± 1.376
1.622AspMet: 1.622 ± 0.199
3.244AspAsn: 3.244 ± 0.724
2.781AspPro: 2.781 ± 0.735
2.317AspGln: 2.317 ± 0.241
3.013AspArg: 3.013 ± 1.044
1.39AspSer: 1.39 ± 0.39
2.086AspThr: 2.086 ± 0.703
3.013AspVal: 3.013 ± 0.675
0.695AspTrp: 0.695 ± 0.411
2.086AspTyr: 2.086 ± 0.584
0.0AspXaa: 0.0 ± 0.0
Glu
3.476GluAla: 3.476 ± 0.416
2.086GluCys: 2.086 ± 0.946
3.94GluAsp: 3.94 ± 1.019
3.013GluGlu: 3.013 ± 0.358
3.244GluPhe: 3.244 ± 0.652
2.549GluGly: 2.549 ± 0.445
1.39GluHis: 1.39 ± 0.783
6.025GluIle: 6.025 ± 1.78
5.33GluLys: 5.33 ± 0.801
5.098GluLeu: 5.098 ± 0.828
2.086GluMet: 2.086 ± 0.739
2.317GluAsn: 2.317 ± 1.434
1.39GluPro: 1.39 ± 0.88
3.244GluGln: 3.244 ± 1.425
2.781GluArg: 2.781 ± 0.543
3.708GluSer: 3.708 ± 0.588
2.317GluThr: 2.317 ± 0.604
2.781GluVal: 2.781 ± 0.907
0.927GluTrp: 0.927 ± 0.317
2.317GluTyr: 2.317 ± 1.174
0.0GluXaa: 0.0 ± 0.0
Phe
0.927PheAla: 0.927 ± 0.469
2.317PheCys: 2.317 ± 0.908
3.244PheAsp: 3.244 ± 0.998
3.244PheGlu: 3.244 ± 0.985
2.781PhePhe: 2.781 ± 0.543
2.086PheGly: 2.086 ± 1.023
0.695PheHis: 0.695 ± 0.195
3.708PheIle: 3.708 ± 0.348
4.867PheLys: 4.867 ± 1.718
4.867PheLeu: 4.867 ± 1.466
1.854PheMet: 1.854 ± 0.544
2.549PheAsn: 2.549 ± 1.048
1.622PhePro: 1.622 ± 0.85
1.622PheGln: 1.622 ± 0.923
2.317PheArg: 2.317 ± 0.46
3.708PheSer: 3.708 ± 0.839
3.244PheThr: 3.244 ± 0.951
2.317PheVal: 2.317 ± 0.275
0.463PheTrp: 0.463 ± 0.293
2.549PheTyr: 2.549 ± 1.345
0.0PheXaa: 0.0 ± 0.0
Gly
3.244GlyAla: 3.244 ± 2.201
1.622GlyCys: 1.622 ± 0.828
2.549GlyAsp: 2.549 ± 0.648
3.244GlyGlu: 3.244 ± 0.828
1.854GlyPhe: 1.854 ± 0.602
0.927GlyGly: 0.927 ± 0.272
0.695GlyHis: 0.695 ± 0.195
3.476GlyIle: 3.476 ± 0.274
1.854GlyLys: 1.854 ± 0.432
4.171GlyLeu: 4.171 ± 1.02
0.927GlyMet: 0.927 ± 0.886
3.013GlyAsn: 3.013 ± 0.79
1.854GlyPro: 1.854 ± 0.564
1.854GlyGln: 1.854 ± 0.561
1.854GlyArg: 1.854 ± 0.381
2.781GlySer: 2.781 ± 1.372
2.549GlyThr: 2.549 ± 1.51
2.317GlyVal: 2.317 ± 0.68
0.695GlyTrp: 0.695 ± 0.195
1.854GlyTyr: 1.854 ± 1.283
0.0GlyXaa: 0.0 ± 0.0
His
0.232HisAla: 0.232 ± 0.147
0.232HisCys: 0.232 ± 0.205
0.927HisAsp: 0.927 ± 0.514
0.0HisGlu: 0.0 ± 0.0
1.159HisPhe: 1.159 ± 0.454
0.927HisGly: 0.927 ± 0.317
1.39HisHis: 1.39 ± 0.188
1.622HisIle: 1.622 ± 0.57
1.622HisLys: 1.622 ± 0.679
0.927HisLeu: 0.927 ± 0.587
0.463HisMet: 0.463 ± 0.136
1.39HisAsn: 1.39 ± 0.595
1.159HisPro: 1.159 ± 0.529
0.463HisGln: 0.463 ± 0.136
0.463HisArg: 0.463 ± 0.136
3.013HisSer: 3.013 ± 0.962
2.086HisThr: 2.086 ± 0.536
0.927HisVal: 0.927 ± 0.587
0.463HisTrp: 0.463 ± 0.434
1.159HisTyr: 1.159 ± 0.44
0.0HisXaa: 0.0 ± 0.0
Ile
4.635IleAla: 4.635 ± 1.26
1.622IleCys: 1.622 ± 0.828
5.098IleAsp: 5.098 ± 1.854
5.33IleGlu: 5.33 ± 0.568
3.94IlePhe: 3.94 ± 1.024
2.086IleGly: 2.086 ± 0.421
1.622IleHis: 1.622 ± 1.027
5.562IleIle: 5.562 ± 1.57
7.184IleLys: 7.184 ± 1.405
6.489IleLeu: 6.489 ± 1.902
3.476IleMet: 3.476 ± 0.305
5.562IleAsn: 5.562 ± 0.605
4.403IlePro: 4.403 ± 1.206
5.33IleGln: 5.33 ± 1.287
3.244IleArg: 3.244 ± 1.01
6.952IleSer: 6.952 ± 1.521
3.013IleThr: 3.013 ± 0.433
4.403IleVal: 4.403 ± 0.733
0.463IleTrp: 0.463 ± 0.293
3.476IleTyr: 3.476 ± 0.274
0.0IleXaa: 0.0 ± 0.0
Lys
3.94LysAla: 3.94 ± 1.024
2.549LysCys: 2.549 ± 1.432
4.403LysAsp: 4.403 ± 0.976
5.794LysGlu: 5.794 ± 0.867
2.317LysPhe: 2.317 ± 0.979
3.708LysGly: 3.708 ± 1.255
2.086LysHis: 2.086 ± 0.584
5.33LysIle: 5.33 ± 0.673
3.708LysLys: 3.708 ± 0.98
6.025LysLeu: 6.025 ± 1.239
3.708LysMet: 3.708 ± 0.579
4.171LysAsn: 4.171 ± 0.686
3.708LysPro: 3.708 ± 0.335
2.317LysGln: 2.317 ± 0.604
2.549LysArg: 2.549 ± 0.587
5.562LysSer: 5.562 ± 1.578
4.171LysThr: 4.171 ± 1.092
3.94LysVal: 3.94 ± 0.738
0.463LysTrp: 0.463 ± 0.293
3.708LysTyr: 3.708 ± 0.713
0.0LysXaa: 0.0 ± 0.0
Leu
3.476LeuAla: 3.476 ± 0.457
2.086LeuCys: 2.086 ± 1.271
5.098LeuAsp: 5.098 ± 1.843
5.794LeuGlu: 5.794 ± 0.361
3.94LeuPhe: 3.94 ± 1.145
3.244LeuGly: 3.244 ± 0.48
2.317LeuHis: 2.317 ± 0.881
8.111LeuIle: 8.111 ± 1.52
5.098LeuLys: 5.098 ± 0.989
8.806LeuLeu: 8.806 ± 1.759
1.854LeuMet: 1.854 ± 0.295
3.708LeuAsn: 3.708 ± 0.588
4.867LeuPro: 4.867 ± 1.741
4.403LeuGln: 4.403 ± 2.076
3.94LeuArg: 3.94 ± 1.187
6.952LeuSer: 6.952 ± 1.961
7.416LeuThr: 7.416 ± 1.607
3.94LeuVal: 3.94 ± 0.711
0.463LeuTrp: 0.463 ± 0.401
3.013LeuTyr: 3.013 ± 0.256
0.0LeuXaa: 0.0 ± 0.0
Met
1.39MetAla: 1.39 ± 0.188
0.927MetCys: 0.927 ± 0.514
2.317MetAsp: 2.317 ± 1.174
1.854MetGlu: 1.854 ± 0.634
0.927MetPhe: 0.927 ± 0.617
0.927MetGly: 0.927 ± 0.272
0.0MetHis: 0.0 ± 0.0
2.317MetIle: 2.317 ± 0.508
2.549MetLys: 2.549 ± 1.048
2.086MetLeu: 2.086 ± 0.421
0.927MetMet: 0.927 ± 0.317
2.549MetAsn: 2.549 ± 0.241
1.39MetPro: 1.39 ± 0.408
1.622MetGln: 1.622 ± 0.263
1.854MetArg: 1.854 ± 0.602
2.549MetSer: 2.549 ± 0.405
3.013MetThr: 3.013 ± 0.554
2.086MetVal: 2.086 ± 0.769
0.0MetTrp: 0.0 ± 0.0
1.159MetTyr: 1.159 ± 1.218
0.0MetXaa: 0.0 ± 0.0
Asn
2.549AsnAla: 2.549 ± 1.013
1.159AsnCys: 1.159 ± 0.838
3.94AsnAsp: 3.94 ± 1.401
3.244AsnGlu: 3.244 ± 0.852
2.781AsnPhe: 2.781 ± 0.779
2.086AsnGly: 2.086 ± 0.946
1.39AsnHis: 1.39 ± 0.39
3.013AsnIle: 3.013 ± 0.89
5.794AsnLys: 5.794 ± 1.63
5.098AsnLeu: 5.098 ± 0.515
1.39AsnMet: 1.39 ± 0.595
2.086AsnAsn: 2.086 ± 0.736
2.317AsnPro: 2.317 ± 0.241
1.854AsnGln: 1.854 ± 0.97
1.622AsnArg: 1.622 ± 0.709
3.244AsnSer: 3.244 ± 0.794
1.622AsnThr: 1.622 ± 0.199
1.39AsnVal: 1.39 ± 0.391
0.463AsnTrp: 0.463 ± 0.293
3.013AsnTyr: 3.013 ± 1.334
0.0AsnXaa: 0.0 ± 0.0
Pro
3.708ProAla: 3.708 ± 0.906
0.232ProCys: 0.232 ± 0.451
3.013ProAsp: 3.013 ± 0.35
2.781ProGlu: 2.781 ± 0.543
1.622ProPhe: 1.622 ± 0.421
2.549ProGly: 2.549 ± 1.547
1.159ProHis: 1.159 ± 0.717
3.94ProIle: 3.94 ± 0.872
1.39ProLys: 1.39 ± 0.188
3.013ProLeu: 3.013 ± 1.474
0.927ProMet: 0.927 ± 0.778
1.159ProAsn: 1.159 ± 0.293
1.159ProPro: 1.159 ± 0.293
2.781ProGln: 2.781 ± 3.319
1.622ProArg: 1.622 ± 0.674
1.854ProSer: 1.854 ± 0.602
2.781ProThr: 2.781 ± 0.833
2.317ProVal: 2.317 ± 0.68
0.463ProTrp: 0.463 ± 0.293
1.39ProTyr: 1.39 ± 0.391
0.0ProXaa: 0.0 ± 0.0
Gln
2.317GlnAla: 2.317 ± 1.431
1.39GlnCys: 1.39 ± 0.672
2.317GlnAsp: 2.317 ± 0.832
3.013GlnGlu: 3.013 ± 0.783
1.854GlnPhe: 1.854 ± 0.168
2.317GlnGly: 2.317 ± 1.782
0.927GlnHis: 0.927 ± 0.728
2.549GlnIle: 2.549 ± 0.94
3.013GlnLys: 3.013 ± 0.49
4.867GlnLeu: 4.867 ± 1.721
1.39GlnMet: 1.39 ± 0.592
2.086GlnAsn: 2.086 ± 0.83
2.086GlnPro: 2.086 ± 1.66
3.244GlnGln: 3.244 ± 5.183
1.622GlnArg: 1.622 ± 0.739
3.244GlnSer: 3.244 ± 1.708
3.476GlnThr: 3.476 ± 1.119
2.086GlnVal: 2.086 ± 1.023
0.695GlnTrp: 0.695 ± 0.341
1.854GlnTyr: 1.854 ± 1.047
0.0GlnXaa: 0.0 ± 0.0
Arg
1.622ArgAla: 1.622 ± 0.674
2.086ArgCys: 2.086 ± 0.536
3.708ArgAsp: 3.708 ± 1.502
2.549ArgGlu: 2.549 ± 0.241
1.854ArgPhe: 1.854 ± 0.168
1.159ArgGly: 1.159 ± 0.524
0.695ArgHis: 0.695 ± 0.466
4.867ArgIle: 4.867 ± 1.693
3.013ArgLys: 3.013 ± 0.358
4.867ArgLeu: 4.867 ± 1.005
1.39ArgMet: 1.39 ± 0.39
1.622ArgAsn: 1.622 ± 0.739
0.463ArgPro: 0.463 ± 0.41
1.39ArgGln: 1.39 ± 1.662
2.086ArgArg: 2.086 ± 0.174
2.781ArgSer: 2.781 ± 0.782
2.317ArgThr: 2.317 ± 0.746
2.781ArgVal: 2.781 ± 0.546
0.463ArgTrp: 0.463 ± 0.478
2.086ArgTyr: 2.086 ± 0.596
0.0ArgXaa: 0.0 ± 0.0
Ser
5.098SerAla: 5.098 ± 1.374
3.013SerCys: 3.013 ± 1.805
3.013SerAsp: 3.013 ± 0.685
4.171SerGlu: 4.171 ± 0.686
3.244SerPhe: 3.244 ± 0.813
2.549SerGly: 2.549 ± 0.807
0.927SerHis: 0.927 ± 0.317
6.721SerIle: 6.721 ± 0.595
5.562SerLys: 5.562 ± 1.161
8.806SerLeu: 8.806 ± 1.599
1.854SerMet: 1.854 ± 0.853
3.013SerAsn: 3.013 ± 1.194
2.086SerPro: 2.086 ± 0.342
2.549SerGln: 2.549 ± 0.94
2.317SerArg: 2.317 ± 0.697
5.33SerSer: 5.33 ± 1.572
4.171SerThr: 4.171 ± 0.663
3.244SerVal: 3.244 ± 0.842
0.463SerTrp: 0.463 ± 0.136
2.781SerTyr: 2.781 ± 0.88
0.0SerXaa: 0.0 ± 0.0
Thr
3.013ThrAla: 3.013 ± 0.554
2.086ThrCys: 2.086 ± 0.536
1.854ThrAsp: 1.854 ± 0.436
2.781ThrGlu: 2.781 ± 0.732
3.94ThrPhe: 3.94 ± 0.744
4.635ThrGly: 4.635 ± 1.265
1.622ThrHis: 1.622 ± 0.426
4.635ThrIle: 4.635 ± 0.5
3.476ThrLys: 3.476 ± 0.833
1.854ThrLeu: 1.854 ± 1.119
1.854ThrMet: 1.854 ± 0.634
1.622ThrAsn: 1.622 ± 0.505
2.781ThrPro: 2.781 ± 1.159
3.244ThrGln: 3.244 ± 1.349
3.244ThrArg: 3.244 ± 0.562
5.562ThrSer: 5.562 ± 2.058
3.244ThrThr: 3.244 ± 0.852
2.317ThrVal: 2.317 ± 0.586
0.695ThrTrp: 0.695 ± 0.195
3.708ThrTyr: 3.708 ± 1.11
0.0ThrXaa: 0.0 ± 0.0
Val
3.013ValAla: 3.013 ± 0.525
1.854ValCys: 1.854 ± 0.594
1.159ValAsp: 1.159 ± 0.302
3.476ValGlu: 3.476 ± 0.699
4.403ValPhe: 4.403 ± 0.436
2.549ValGly: 2.549 ± 0.688
0.695ValHis: 0.695 ± 0.44
3.013ValIle: 3.013 ± 0.525
3.476ValLys: 3.476 ± 1.478
4.171ValLeu: 4.171 ± 0.728
1.159ValMet: 1.159 ± 0.302
2.317ValAsn: 2.317 ± 1.597
1.854ValPro: 1.854 ± 0.544
2.549ValGln: 2.549 ± 0.534
1.622ValArg: 1.622 ± 0.508
4.867ValSer: 4.867 ± 1.234
2.317ValThr: 2.317 ± 0.832
2.317ValVal: 2.317 ± 0.68
0.463ValTrp: 0.463 ± 0.41
1.159ValTyr: 1.159 ± 0.454
0.0ValXaa: 0.0 ± 0.0
Trp
0.695TrpAla: 0.695 ± 0.315
0.0TrpCys: 0.0 ± 0.0
0.463TrpAsp: 0.463 ± 0.401
0.463TrpGlu: 0.463 ± 0.293
0.463TrpPhe: 0.463 ± 0.136
0.695TrpGly: 0.695 ± 0.315
0.0TrpHis: 0.0 ± 0.0
0.0TrpIle: 0.0 ± 0.0
0.695TrpLys: 0.695 ± 0.341
1.39TrpLeu: 1.39 ± 0.391
0.0TrpMet: 0.0 ± 0.0
0.232TrpAsn: 0.232 ± 0.147
0.0TrpPro: 0.0 ± 0.0
0.695TrpGln: 0.695 ± 0.44
0.0TrpArg: 0.0 ± 0.0
0.927TrpSer: 0.927 ± 0.587
0.927TrpThr: 0.927 ± 0.469
0.463TrpVal: 0.463 ± 0.293
0.0TrpTrp: 0.0 ± 0.0
0.695TrpTyr: 0.695 ± 0.466
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.086TyrAla: 2.086 ± 0.623
1.39TyrCys: 1.39 ± 0.92
1.622TyrAsp: 1.622 ± 0.828
2.781TyrGlu: 2.781 ± 0.853
2.317TyrPhe: 2.317 ± 0.46
1.854TyrGly: 1.854 ± 0.602
0.927TyrHis: 0.927 ± 0.728
4.403TyrIle: 4.403 ± 0.672
3.244TyrLys: 3.244 ± 0.404
3.013TyrLeu: 3.013 ± 0.785
1.854TyrMet: 1.854 ± 0.634
2.781TyrAsn: 2.781 ± 0.951
1.39TyrPro: 1.39 ± 0.524
1.622TyrGln: 1.622 ± 1.108
2.781TyrArg: 2.781 ± 1.344
1.854TyrSer: 1.854 ± 0.49
3.708TyrThr: 3.708 ± 0.589
1.622TyrVal: 1.622 ± 0.459
0.0TyrTrp: 0.0 ± 0.0
1.622TyrTyr: 1.622 ± 0.263
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4 proteins (4316 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski