Amino acid dipepetide frequency for Suffolk virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.324AlaAla: 5.324 ± 1.548
3.253AlaCys: 3.253 ± 0.94
1.775AlaAsp: 1.775 ± 0.803
3.253AlaGlu: 3.253 ± 1.574
3.549AlaPhe: 3.549 ± 1.605
5.915AlaGly: 5.915 ± 0.941
1.775AlaHis: 1.775 ± 0.558
5.028AlaIle: 5.028 ± 2.246
2.958AlaLys: 2.958 ± 0.667
6.507AlaLeu: 6.507 ± 2.241
0.887AlaMet: 0.887 ± 0.642
2.366AlaAsn: 2.366 ± 0.578
4.437AlaPro: 4.437 ± 2.977
2.958AlaGln: 2.958 ± 0.416
3.845AlaArg: 3.845 ± 2.42
6.507AlaSer: 6.507 ± 0.956
5.915AlaThr: 5.915 ± 2.661
5.915AlaVal: 5.915 ± 0.546
1.183AlaTrp: 1.183 ± 0.298
3.253AlaTyr: 3.253 ± 0.925
0.0AlaXaa: 0.0 ± 0.0
Cys
0.592CysAla: 0.592 ± 0.272
0.296CysCys: 0.296 ± 0.161
1.775CysAsp: 1.775 ± 0.868
0.592CysGlu: 0.592 ± 0.323
0.296CysPhe: 0.296 ± 0.381
0.887CysGly: 0.887 ± 0.642
0.592CysHis: 0.592 ± 0.272
0.887CysIle: 0.887 ± 0.484
1.183CysLys: 1.183 ± 0.298
2.662CysLeu: 2.662 ± 1.018
0.887CysMet: 0.887 ± 0.235
0.887CysAsn: 0.887 ± 0.235
1.775CysPro: 1.775 ± 0.558
0.592CysGln: 0.592 ± 0.404
0.887CysArg: 0.887 ± 0.401
1.479CysSer: 1.479 ± 0.91
0.887CysThr: 0.887 ± 0.235
1.775CysVal: 1.775 ± 0.22
0.296CysTrp: 0.296 ± 0.161
1.479CysTyr: 1.479 ± 0.806
0.0CysXaa: 0.0 ± 0.0
Asp
3.845AspAla: 3.845 ± 0.612
1.183AspCys: 1.183 ± 0.545
4.437AspAsp: 4.437 ± 2.18
3.253AspGlu: 3.253 ± 0.576
2.366AspPhe: 2.366 ± 1.184
2.662AspGly: 2.662 ± 1.115
2.07AspHis: 2.07 ± 0.845
2.07AspIle: 2.07 ± 0.821
1.775AspLys: 1.775 ± 0.599
3.845AspLeu: 3.845 ± 1.052
0.592AspMet: 0.592 ± 0.323
0.592AspAsn: 0.592 ± 0.323
4.141AspPro: 4.141 ± 0.388
1.775AspGln: 1.775 ± 0.566
2.958AspArg: 2.958 ± 1.613
0.887AspSer: 0.887 ± 0.401
3.549AspThr: 3.549 ± 1.368
4.141AspVal: 4.141 ± 1.079
0.887AspTrp: 0.887 ± 0.484
1.775AspTyr: 1.775 ± 0.558
0.0AspXaa: 0.0 ± 0.0
Glu
5.915GluAla: 5.915 ± 1.8
0.592GluCys: 0.592 ± 0.323
3.253GluAsp: 3.253 ± 1.007
3.253GluGlu: 3.253 ± 0.397
2.662GluPhe: 2.662 ± 0.217
4.732GluGly: 4.732 ± 0.76
0.887GluHis: 0.887 ± 0.401
2.07GluIle: 2.07 ± 0.759
2.366GluLys: 2.366 ± 0.578
8.282GluLeu: 8.282 ± 3.19
1.479GluMet: 1.479 ± 0.516
1.183GluAsn: 1.183 ± 0.571
1.479GluPro: 1.479 ± 0.806
1.479GluGln: 1.479 ± 0.417
4.141GluArg: 4.141 ± 0.939
2.07GluSer: 2.07 ± 0.819
3.549GluThr: 3.549 ± 0.881
5.028GluVal: 5.028 ± 0.971
1.183GluTrp: 1.183 ± 0.298
1.183GluTyr: 1.183 ± 0.46
0.0GluXaa: 0.0 ± 0.0
Phe
1.775PheAla: 1.775 ± 1.284
0.592PheCys: 0.592 ± 0.323
2.07PheAsp: 2.07 ± 0.512
2.662PheGlu: 2.662 ± 1.204
1.183PhePhe: 1.183 ± 0.298
1.479PheGly: 1.479 ± 0.208
0.296PheHis: 0.296 ± 0.381
1.183PheIle: 1.183 ± 0.298
0.592PheLys: 0.592 ± 0.323
5.915PheLeu: 5.915 ± 0.831
1.479PheMet: 1.479 ± 0.483
3.253PheAsn: 3.253 ± 0.973
2.662PhePro: 2.662 ± 0.603
1.775PheGln: 1.775 ± 1.779
2.366PheArg: 2.366 ± 0.862
0.887PheSer: 0.887 ± 0.235
1.479PheThr: 1.479 ± 0.417
2.07PheVal: 2.07 ± 0.326
0.592PheTrp: 0.592 ± 0.582
1.479PheTyr: 1.479 ± 0.555
0.0PheXaa: 0.0 ± 0.0
Gly
4.732GlyAla: 4.732 ± 0.76
0.887GlyCys: 0.887 ± 0.235
1.183GlyAsp: 1.183 ± 0.3
3.549GlyGlu: 3.549 ± 1.083
4.141GlyPhe: 4.141 ± 0.941
2.958GlyGly: 2.958 ± 1.111
2.366GlyHis: 2.366 ± 0.965
3.253GlyIle: 3.253 ± 0.583
2.958GlyLys: 2.958 ± 0.972
7.394GlyLeu: 7.394 ± 1.26
0.592GlyMet: 0.592 ± 0.272
2.366GlyAsn: 2.366 ± 0.185
2.366GlyPro: 2.366 ± 1.048
2.662GlyGln: 2.662 ± 1.017
2.958GlyArg: 2.958 ± 0.768
2.662GlySer: 2.662 ± 0.995
5.028GlyThr: 5.028 ± 2.43
5.62GlyVal: 5.62 ± 0.771
1.479GlyTrp: 1.479 ± 0.417
1.479GlyTyr: 1.479 ± 0.627
0.0GlyXaa: 0.0 ± 0.0
His
0.887HisAla: 0.887 ± 0.484
0.887HisCys: 0.887 ± 0.235
2.07HisAsp: 2.07 ± 1.129
0.592HisGlu: 0.592 ± 0.323
1.183HisPhe: 1.183 ± 0.545
0.0HisGly: 0.0 ± 0.0
0.592HisHis: 0.592 ± 0.323
1.183HisIle: 1.183 ± 0.645
1.183HisLys: 1.183 ± 0.645
3.549HisLeu: 3.549 ± 0.942
1.183HisMet: 1.183 ± 0.638
1.775HisAsn: 1.775 ± 0.684
1.775HisPro: 1.775 ± 0.904
1.183HisGln: 1.183 ± 0.545
1.183HisArg: 1.183 ± 0.875
1.183HisSer: 1.183 ± 1.02
2.07HisThr: 2.07 ± 0.708
2.07HisVal: 2.07 ± 0.708
0.0HisTrp: 0.0 ± 0.0
0.887HisTyr: 0.887 ± 0.401
0.0HisXaa: 0.0 ± 0.0
Ile
3.253IleAla: 3.253 ± 1.583
0.592IleCys: 0.592 ± 0.323
2.366IleAsp: 2.366 ± 0.185
3.253IleGlu: 3.253 ± 1.154
1.775IlePhe: 1.775 ± 0.22
1.775IleGly: 1.775 ± 0.657
1.183IleHis: 1.183 ± 0.645
3.253IleIle: 3.253 ± 0.478
2.07IleLys: 2.07 ± 0.271
4.732IleLeu: 4.732 ± 1.358
2.366IleMet: 2.366 ± 0.653
0.592IleAsn: 0.592 ± 0.404
1.479IlePro: 1.479 ± 1.311
2.07IleGln: 2.07 ± 0.512
4.437IleArg: 4.437 ± 1.969
3.845IleSer: 3.845 ± 0.426
5.324IleThr: 5.324 ± 1.57
3.253IleVal: 3.253 ± 0.756
0.592IleTrp: 0.592 ± 0.323
1.479IleTyr: 1.479 ± 0.483
0.0IleXaa: 0.0 ± 0.0
Lys
2.958LysAla: 2.958 ± 0.938
2.07LysCys: 2.07 ± 0.748
1.479LysAsp: 1.479 ± 0.56
3.845LysGlu: 3.845 ± 0.962
1.479LysPhe: 1.479 ± 0.56
2.07LysGly: 2.07 ± 0.271
0.592LysHis: 0.592 ± 0.323
2.366LysIle: 2.366 ± 0.919
2.07LysLys: 2.07 ± 1.122
4.732LysLeu: 4.732 ± 0.929
0.592LysMet: 0.592 ± 0.323
0.887LysAsn: 0.887 ± 0.434
1.479LysPro: 1.479 ± 0.627
2.07LysGln: 2.07 ± 0.326
0.887LysArg: 0.887 ± 0.401
1.775LysSer: 1.775 ± 0.684
2.958LysThr: 2.958 ± 1.175
3.845LysVal: 3.845 ± 0.478
0.592LysTrp: 0.592 ± 0.323
1.479LysTyr: 1.479 ± 0.483
0.0LysXaa: 0.0 ± 0.0
Leu
9.169LeuAla: 9.169 ± 1.739
3.549LeuCys: 3.549 ± 0.441
5.324LeuAsp: 5.324 ± 1.674
7.098LeuGlu: 7.098 ± 1.877
3.549LeuPhe: 3.549 ± 0.395
6.507LeuGly: 6.507 ± 1.851
2.958LeuHis: 2.958 ± 0.334
3.845LeuIle: 3.845 ± 0.612
1.775LeuLys: 1.775 ± 0.22
9.465LeuLeu: 9.465 ± 1.584
1.183LeuMet: 1.183 ± 0.298
4.437LeuAsn: 4.437 ± 1.177
6.507LeuPro: 6.507 ± 1.991
3.845LeuGln: 3.845 ± 1.245
7.394LeuArg: 7.394 ± 0.907
6.507LeuSer: 6.507 ± 1.329
6.211LeuThr: 6.211 ± 0.832
8.282LeuVal: 8.282 ± 1.547
2.662LeuTrp: 2.662 ± 0.614
3.845LeuTyr: 3.845 ± 0.672
0.0LeuXaa: 0.0 ± 0.0
Met
1.775MetAla: 1.775 ± 0.904
0.296MetCys: 0.296 ± 0.381
0.592MetAsp: 0.592 ± 0.272
2.07MetGlu: 2.07 ± 0.708
1.775MetPhe: 1.775 ± 0.904
2.07MetGly: 2.07 ± 0.728
0.592MetHis: 0.592 ± 0.272
0.296MetIle: 0.296 ± 0.161
1.183MetLys: 1.183 ± 0.645
2.366MetLeu: 2.366 ± 0.464
1.479MetMet: 1.479 ± 0.624
0.0MetAsn: 0.0 ± 0.0
1.479MetPro: 1.479 ± 0.624
1.183MetGln: 1.183 ± 0.858
1.775MetArg: 1.775 ± 0.566
2.07MetSer: 2.07 ± 0.613
2.07MetThr: 2.07 ± 0.597
2.366MetVal: 2.366 ± 0.709
0.296MetTrp: 0.296 ± 0.161
0.296MetTyr: 0.296 ± 0.161
0.0MetXaa: 0.0 ± 0.0
Asn
3.549AsnAla: 3.549 ± 0.798
0.887AsnCys: 0.887 ± 0.401
2.07AsnAsp: 2.07 ± 0.596
0.887AsnGlu: 0.887 ± 0.235
0.887AsnPhe: 0.887 ± 0.484
1.775AsnGly: 1.775 ± 1.416
0.887AsnHis: 0.887 ± 0.235
1.479AsnIle: 1.479 ± 0.789
1.183AsnLys: 1.183 ± 0.645
2.07AsnLeu: 2.07 ± 0.326
2.07AsnMet: 2.07 ± 1.187
0.887AsnAsn: 0.887 ± 0.484
2.07AsnPro: 2.07 ± 0.326
1.183AsnGln: 1.183 ± 0.807
2.07AsnArg: 2.07 ± 0.708
1.183AsnSer: 1.183 ± 0.3
2.366AsnThr: 2.366 ± 0.185
5.324AsnVal: 5.324 ± 0.878
0.296AsnTrp: 0.296 ± 0.381
1.479AsnTyr: 1.479 ± 0.789
0.0AsnXaa: 0.0 ± 0.0
Pro
3.253ProAla: 3.253 ± 3.303
0.592ProCys: 0.592 ± 0.272
4.141ProAsp: 4.141 ± 1.434
3.549ProGlu: 3.549 ± 0.63
1.775ProPhe: 1.775 ± 0.22
2.662ProGly: 2.662 ± 0.587
1.479ProHis: 1.479 ± 0.555
4.732ProIle: 4.732 ± 1.334
2.07ProLys: 2.07 ± 0.271
5.028ProLeu: 5.028 ± 0.617
1.479ProMet: 1.479 ± 0.624
2.07ProAsn: 2.07 ± 0.848
5.915ProPro: 5.915 ± 1.97
1.775ProGln: 1.775 ± 0.618
3.253ProArg: 3.253 ± 0.996
3.253ProSer: 3.253 ± 0.94
3.253ProThr: 3.253 ± 2.58
3.845ProVal: 3.845 ± 1.833
0.887ProTrp: 0.887 ± 0.856
2.958ProTyr: 2.958 ± 1.267
0.0ProXaa: 0.0 ± 0.0
Gln
2.958GlnAla: 2.958 ± 1.111
0.592GlnCys: 0.592 ± 0.323
1.183GlnAsp: 1.183 ± 0.46
2.366GlnGlu: 2.366 ± 0.623
0.296GlnPhe: 0.296 ± 0.161
2.07GlnGly: 2.07 ± 0.271
2.07GlnHis: 2.07 ± 0.512
0.887GlnIle: 0.887 ± 0.856
2.366GlnLys: 2.366 ± 0.709
4.732GlnLeu: 4.732 ± 0.594
0.887GlnMet: 0.887 ± 0.434
0.592GlnAsn: 0.592 ± 0.404
1.775GlnPro: 1.775 ± 1.025
2.07GlnGln: 2.07 ± 1.207
3.253GlnArg: 3.253 ± 0.397
1.775GlnSer: 1.775 ± 1.452
1.775GlnThr: 1.775 ± 0.471
3.845GlnVal: 3.845 ± 0.691
0.296GlnTrp: 0.296 ± 0.161
0.592GlnTyr: 0.592 ± 0.323
0.0GlnXaa: 0.0 ± 0.0
Arg
6.803ArgAla: 6.803 ± 1.184
0.296ArgCys: 0.296 ± 0.161
2.958ArgAsp: 2.958 ± 0.765
2.662ArgGlu: 2.662 ± 0.707
1.479ArgPhe: 1.479 ± 0.417
5.028ArgGly: 5.028 ± 1.44
2.366ArgHis: 2.366 ± 0.943
3.845ArgIle: 3.845 ± 0.532
3.253ArgLys: 3.253 ± 1.239
5.915ArgLeu: 5.915 ± 1.499
1.183ArgMet: 1.183 ± 0.3
1.775ArgAsn: 1.775 ± 0.22
4.141ArgPro: 4.141 ± 0.982
1.479ArgGln: 1.479 ± 0.555
3.549ArgArg: 3.549 ± 1.144
2.366ArgSer: 2.366 ± 0.623
5.028ArgThr: 5.028 ± 0.617
3.253ArgVal: 3.253 ± 0.972
1.183ArgTrp: 1.183 ± 0.645
2.366ArgTyr: 2.366 ± 0.665
0.0ArgXaa: 0.0 ± 0.0
Ser
4.437SerAla: 4.437 ± 1.491
1.775SerCys: 1.775 ± 0.22
3.253SerAsp: 3.253 ± 1.097
2.07SerGlu: 2.07 ± 0.512
1.479SerPhe: 1.479 ± 1.399
4.141SerGly: 4.141 ± 1.906
2.662SerHis: 2.662 ± 0.706
2.07SerIle: 2.07 ± 0.596
3.845SerLys: 3.845 ± 1.652
5.915SerLeu: 5.915 ± 1.066
2.07SerMet: 2.07 ± 0.75
1.183SerAsn: 1.183 ± 0.46
3.549SerPro: 3.549 ± 2.085
2.07SerGln: 2.07 ± 0.708
2.366SerArg: 2.366 ± 0.185
5.028SerSer: 5.028 ± 2.904
3.253SerThr: 3.253 ± 0.429
3.845SerVal: 3.845 ± 0.971
0.887SerTrp: 0.887 ± 0.235
3.549SerTyr: 3.549 ± 0.671
0.0SerXaa: 0.0 ± 0.0
Thr
5.028ThrAla: 5.028 ± 0.655
0.592ThrCys: 0.592 ± 0.272
3.845ThrAsp: 3.845 ± 0.771
4.141ThrGlu: 4.141 ± 0.303
3.845ThrPhe: 3.845 ± 1.19
3.549ThrGly: 3.549 ± 0.798
1.183ThrHis: 1.183 ± 0.545
3.549ThrIle: 3.549 ± 1.282
2.662ThrLys: 2.662 ± 0.762
9.76ThrLeu: 9.76 ± 1.095
1.183ThrMet: 1.183 ± 0.3
2.07ThrAsn: 2.07 ± 0.271
3.549ThrPro: 3.549 ± 1.235
2.366ThrGln: 2.366 ± 1.157
4.141ThrArg: 4.141 ± 1.434
5.62ThrSer: 5.62 ± 2.302
5.324ThrThr: 5.324 ± 2.341
3.845ThrVal: 3.845 ± 1.017
0.592ThrTrp: 0.592 ± 0.323
4.437ThrTyr: 4.437 ± 1.65
0.0ThrXaa: 0.0 ± 0.0
Val
5.62ValAla: 5.62 ± 0.913
1.479ValCys: 1.479 ± 0.56
2.662ValAsp: 2.662 ± 0.706
6.211ValGlu: 6.211 ± 0.764
0.887ValPhe: 0.887 ± 0.235
5.028ValGly: 5.028 ± 0.608
0.592ValHis: 0.592 ± 0.323
5.028ValIle: 5.028 ± 2.298
2.662ValLys: 2.662 ± 0.502
7.098ValLeu: 7.098 ± 1.787
2.662ValMet: 2.662 ± 0.706
3.845ValAsn: 3.845 ± 1.083
4.437ValPro: 4.437 ± 1.252
1.775ValGln: 1.775 ± 0.22
6.211ValArg: 6.211 ± 1.445
5.324ValSer: 5.324 ± 0.434
5.62ValThr: 5.62 ± 1.133
6.211ValVal: 6.211 ± 2.148
0.887ValTrp: 0.887 ± 0.401
4.732ValTyr: 4.732 ± 1.192
0.0ValXaa: 0.0 ± 0.0
Trp
2.366TrpAla: 2.366 ± 0.464
0.296TrpCys: 0.296 ± 0.161
0.887TrpAsp: 0.887 ± 0.484
0.0TrpGlu: 0.0 ± 0.0
0.296TrpPhe: 0.296 ± 0.161
1.479TrpGly: 1.479 ± 0.208
0.0TrpHis: 0.0 ± 0.0
0.887TrpIle: 0.887 ± 0.235
0.296TrpLys: 0.296 ± 0.161
1.183TrpLeu: 1.183 ± 0.46
0.0TrpMet: 0.0 ± 0.0
0.887TrpAsn: 0.887 ± 0.401
0.887TrpPro: 0.887 ± 0.484
0.296TrpGln: 0.296 ± 0.161
1.183TrpArg: 1.183 ± 0.645
1.775TrpSer: 1.775 ± 0.558
0.887TrpThr: 0.887 ± 0.642
1.775TrpVal: 1.775 ± 0.406
0.887TrpTrp: 0.887 ± 0.642
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.366TyrAla: 2.366 ± 0.919
0.0TyrCys: 0.0 ± 0.0
1.479TyrAsp: 1.479 ± 0.806
1.775TyrGlu: 1.775 ± 0.599
0.887TyrPhe: 0.887 ± 0.235
3.845TyrGly: 3.845 ± 0.566
0.296TyrHis: 0.296 ± 0.161
1.775TyrIle: 1.775 ± 0.22
1.775TyrLys: 1.775 ± 0.904
2.958TyrLeu: 2.958 ± 0.41
1.479TyrMet: 1.479 ± 0.417
2.958TyrAsn: 2.958 ± 0.768
2.07TyrPro: 2.07 ± 0.517
1.775TyrGln: 1.775 ± 0.657
2.07TyrArg: 2.07 ± 0.512
3.253TyrSer: 3.253 ± 0.478
4.732TyrThr: 4.732 ± 0.675
2.662TyrVal: 2.662 ± 0.603
0.592TyrTrp: 0.592 ± 0.272
0.592TyrTyr: 0.592 ± 0.323
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4 proteins (3382 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski