Amino acid dipepetide frequency for Sindbis virus (SINV)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.645AlaAla: 7.645 ± 1.083
1.747AlaCys: 1.747 ± 0.18
5.679AlaAsp: 5.679 ± 0.574
4.15AlaGlu: 4.15 ± 1.57
3.495AlaPhe: 3.495 ± 0.36
2.621AlaGly: 2.621 ± 0.356
1.092AlaHis: 1.092 ± 0.226
4.587AlaIle: 4.587 ± 0.416
3.713AlaLys: 3.713 ± 0.116
7.645AlaLeu: 7.645 ± 0.228
3.713AlaMet: 3.713 ± 0.658
2.84AlaAsn: 2.84 ± 0.324
6.116AlaPro: 6.116 ± 1.23
3.713AlaGln: 3.713 ± 0.658
4.806AlaArg: 4.806 ± 1.146
4.806AlaSer: 4.806 ± 0.885
6.116AlaThr: 6.116 ± 0.469
6.99AlaVal: 6.99 ± 0.405
0.218AlaTrp: 0.218 ± 0.127
3.713AlaTyr: 3.713 ± 0.96
0.0AlaXaa: 0.0 ± 0.0
Cys
1.747CysAla: 1.747 ± 0.174
1.529CysCys: 1.529 ± 1.082
1.311CysAsp: 1.311 ± 0.382
0.874CysGlu: 0.874 ± 0.257
1.311CysPhe: 1.311 ± 0.111
2.184CysGly: 2.184 ± 0.451
0.874CysHis: 0.874 ± 0.087
0.655CysIle: 0.655 ± 0.191
2.621CysLys: 2.621 ± 0.867
3.058CysLeu: 3.058 ± 0.24
0.655CysMet: 0.655 ± 0.191
1.092CysAsn: 1.092 ± 0.371
1.311CysPro: 1.311 ± 0.111
0.655CysGln: 0.655 ± 0.165
1.529CysArg: 1.529 ± 0.29
1.966CysSer: 1.966 ± 0.828
2.403CysThr: 2.403 ± 0.575
1.966CysVal: 1.966 ± 0.146
0.218CysTrp: 0.218 ± 0.127
0.655CysTyr: 0.655 ± 0.191
0.0CysXaa: 0.0 ± 0.0
Asp
2.621AspAla: 2.621 ± 0.983
0.874AspCys: 0.874 ± 0.288
3.495AspAsp: 3.495 ± 0.349
3.058AspGlu: 3.058 ± 0.681
1.311AspPhe: 1.311 ± 0.433
1.966AspGly: 1.966 ± 0.146
3.495AspHis: 3.495 ± 1.347
3.277AspIle: 3.277 ± 0.286
1.529AspLys: 1.529 ± 0.892
3.277AspLeu: 3.277 ± 0.246
1.747AspMet: 1.747 ± 0.466
2.403AspAsn: 2.403 ± 0.317
1.966AspPro: 1.966 ± 0.592
2.184AspGln: 2.184 ± 0.138
3.932AspArg: 3.932 ± 0.279
2.84AspSer: 2.84 ± 0.245
2.621AspThr: 2.621 ± 0.382
3.713AspVal: 3.713 ± 0.755
0.218AspTrp: 0.218 ± 0.127
0.874AspTyr: 0.874 ± 0.349
0.0AspXaa: 0.0 ± 0.0
Glu
5.461GluAla: 5.461 ± 0.672
2.184GluCys: 2.184 ± 0.774
2.621GluAsp: 2.621 ± 0.973
4.15GluGlu: 4.15 ± 0.774
1.529GluPhe: 1.529 ± 0.078
4.587GluGly: 4.587 ± 0.805
1.966GluHis: 1.966 ± 0.302
2.84GluIle: 2.84 ± 0.245
3.495GluLys: 3.495 ± 0.627
3.058GluLeu: 3.058 ± 1.497
0.655GluMet: 0.655 ± 0.165
1.747GluAsn: 1.747 ± 0.18
3.058GluPro: 3.058 ± 0.24
1.529GluGln: 1.529 ± 0.34
3.058GluArg: 3.058 ± 1.497
2.184GluSer: 2.184 ± 0.138
3.713GluThr: 3.713 ± 0.209
3.495GluVal: 3.495 ± 1.479
1.092GluTrp: 1.092 ± 0.502
2.403GluTyr: 2.403 ± 0.328
0.0GluXaa: 0.0 ± 0.0
Phe
2.184PheAla: 2.184 ± 0.243
0.655PheCys: 0.655 ± 0.165
1.966PheAsp: 1.966 ± 0.302
1.092PheGlu: 1.092 ± 0.371
1.311PhePhe: 1.311 ± 0.382
3.058PheGly: 3.058 ± 0.667
0.655PheHis: 0.655 ± 0.382
1.529PheIle: 1.529 ± 0.327
1.747PheLys: 1.747 ± 0.174
1.966PheLeu: 1.966 ± 0.265
0.437PheMet: 0.437 ± 0.144
1.747PheAsn: 1.747 ± 0.174
2.84PhePro: 2.84 ± 0.245
1.092PheGln: 1.092 ± 0.371
1.311PheArg: 1.311 ± 0.765
3.495PheSer: 3.495 ± 1.188
3.495PheThr: 3.495 ± 1.91
1.966PheVal: 1.966 ± 0.592
0.437PheTrp: 0.437 ± 0.313
0.874PheTyr: 0.874 ± 0.087
0.0PheXaa: 0.0 ± 0.0
Gly
4.587GlyAla: 4.587 ± 1.43
1.529GlyCys: 1.529 ± 0.268
3.495GlyAsp: 3.495 ± 1.074
2.621GlyGlu: 2.621 ± 1.281
3.058GlyPhe: 3.058 ± 0.202
3.495GlyGly: 3.495 ± 0.881
1.092GlyHis: 1.092 ± 0.106
1.966GlyIle: 1.966 ± 1.147
5.898GlyLys: 5.898 ± 1.397
3.495GlyLeu: 3.495 ± 0.276
0.874GlyMet: 0.874 ± 0.257
1.529GlyAsn: 1.529 ± 0.824
2.403GlyPro: 2.403 ± 0.553
0.874GlyGln: 0.874 ± 0.349
5.024GlyArg: 5.024 ± 1.647
2.84GlySer: 2.84 ± 0.807
6.116GlyThr: 6.116 ± 0.302
3.277GlyVal: 3.277 ± 0.246
0.655GlyTrp: 0.655 ± 0.191
2.621GlyTyr: 2.621 ± 0.449
0.0GlyXaa: 0.0 ± 0.0
His
3.932HisAla: 3.932 ± 0.332
0.437HisCys: 0.437 ± 0.211
0.437HisAsp: 0.437 ± 0.313
2.403HisGlu: 2.403 ± 0.35
0.874HisPhe: 0.874 ± 0.257
2.184HisGly: 2.184 ± 1.005
1.529HisHis: 1.529 ± 0.268
0.874HisIle: 0.874 ± 0.349
1.092HisLys: 1.092 ± 0.282
2.621HisLeu: 2.621 ± 0.435
0.0HisMet: 0.0 ± 0.0
1.092HisAsn: 1.092 ± 0.637
2.403HisPro: 2.403 ± 0.35
0.655HisGln: 0.655 ± 0.382
1.747HisArg: 1.747 ± 0.393
1.311HisSer: 1.311 ± 0.33
2.184HisThr: 2.184 ± 0.701
1.311HisVal: 1.311 ± 0.433
0.218HisTrp: 0.218 ± 0.127
0.655HisTyr: 0.655 ± 0.191
0.0HisXaa: 0.0 ± 0.0
Ile
3.277IleAla: 3.277 ± 1.113
0.655IleCys: 0.655 ± 0.382
2.403IleAsp: 2.403 ± 0.011
2.403IleGlu: 2.403 ± 0.846
0.874IlePhe: 0.874 ± 0.257
3.932IleGly: 3.932 ± 0.331
1.966IleHis: 1.966 ± 0.302
2.403IleIle: 2.403 ± 1.117
3.932IleLys: 3.932 ± 0.53
2.621IleLeu: 2.621 ± 0.262
0.874IleMet: 0.874 ± 0.087
0.874IleAsn: 0.874 ± 0.51
4.15IlePro: 4.15 ± 1.257
1.311IleGln: 1.311 ± 0.647
2.184IleArg: 2.184 ± 0.451
3.277IleSer: 3.277 ± 0.286
4.806IleThr: 4.806 ± 0.543
4.15IleVal: 4.15 ± 1.257
0.437IleTrp: 0.437 ± 0.313
1.092IleTyr: 1.092 ± 0.637
0.0IleXaa: 0.0 ± 0.0
Lys
2.403LysAla: 2.403 ± 0.553
1.747LysCys: 1.747 ± 0.955
1.747LysAsp: 1.747 ± 0.514
3.495LysGlu: 3.495 ± 0.932
2.184LysPhe: 2.184 ± 0.138
3.932LysGly: 3.932 ± 0.331
1.747LysHis: 1.747 ± 0.466
3.932LysIle: 3.932 ± 0.53
6.116LysLys: 6.116 ± 0.302
5.461LysLeu: 5.461 ± 0.991
0.874LysMet: 0.874 ± 0.481
2.84LysAsn: 2.84 ± 0.438
5.461LysPro: 5.461 ± 1.612
3.713LysGln: 3.713 ± 0.725
2.621LysArg: 2.621 ± 0.435
2.84LysSer: 2.84 ± 0.557
4.806LysThr: 4.806 ± 0.585
5.898LysVal: 5.898 ± 0.889
0.874LysTrp: 0.874 ± 0.087
2.184LysTyr: 2.184 ± 0.212
0.0LysXaa: 0.0 ± 0.0
Leu
7.645LeuAla: 7.645 ± 0.533
3.058LeuCys: 3.058 ± 0.202
4.587LeuAsp: 4.587 ± 1.264
4.806LeuGlu: 4.806 ± 0.339
3.058LeuPhe: 3.058 ± 0.671
3.713LeuGly: 3.713 ± 0.725
1.747LeuHis: 1.747 ± 0.174
2.403LeuIle: 2.403 ± 0.332
5.461LeuLys: 5.461 ± 0.362
5.461LeuLeu: 5.461 ± 0.213
1.092LeuMet: 1.092 ± 0.226
2.84LeuAsn: 2.84 ± 0.807
4.587LeuPro: 4.587 ± 0.378
4.369LeuGln: 4.369 ± 1.162
3.495LeuArg: 3.495 ± 0.627
3.932LeuSer: 3.932 ± 0.292
5.679LeuThr: 5.679 ± 0.107
5.461LeuVal: 5.461 ± 0.984
0.655LeuTrp: 0.655 ± 0.191
1.966LeuTyr: 1.966 ± 1.147
0.0LeuXaa: 0.0 ± 0.0
Met
1.966MetAla: 1.966 ± 0.302
0.437MetCys: 0.437 ± 0.313
1.966MetAsp: 1.966 ± 0.146
1.966MetGlu: 1.966 ± 0.146
0.655MetPhe: 0.655 ± 0.382
0.0MetGly: 0.0 ± 0.0
0.655MetHis: 0.655 ± 0.191
1.092MetIle: 1.092 ± 0.226
1.966MetLys: 1.966 ± 0.206
1.092MetLeu: 1.092 ± 0.226
1.529MetMet: 1.529 ± 0.078
0.437MetAsn: 0.437 ± 0.313
0.874MetPro: 0.874 ± 0.087
1.311MetGln: 1.311 ± 0.765
1.747MetArg: 1.747 ± 0.174
2.403MetSer: 2.403 ± 0.332
0.874MetThr: 0.874 ± 0.51
0.655MetVal: 0.655 ± 0.191
0.874MetTrp: 0.874 ± 0.669
0.437MetTyr: 0.437 ± 0.255
0.0MetXaa: 0.0 ± 0.0
Asn
2.84AsnAla: 2.84 ± 0.611
1.092AsnCys: 1.092 ± 0.637
1.311AsnAsp: 1.311 ± 0.491
1.529AsnGlu: 1.529 ± 0.34
0.874AsnPhe: 0.874 ± 0.349
1.311AsnGly: 1.311 ± 0.111
1.311AsnHis: 1.311 ± 0.382
2.84AsnIle: 2.84 ± 0.807
0.874AsnLys: 0.874 ± 0.087
1.311AsnLeu: 1.311 ± 0.491
1.529AsnMet: 1.529 ± 0.268
0.655AsnAsn: 0.655 ± 0.165
2.403AsnPro: 2.403 ± 0.553
1.092AsnGln: 1.092 ± 0.371
1.092AsnArg: 1.092 ± 0.106
4.806AsnSer: 4.806 ± 0.585
2.403AsnThr: 2.403 ± 0.553
3.932AsnVal: 3.932 ± 0.069
0.655AsnTrp: 0.655 ± 0.191
1.311AsnTyr: 1.311 ± 0.491
0.0AsnXaa: 0.0 ± 0.0
Pro
5.461ProAla: 5.461 ± 0.923
2.184ProCys: 2.184 ± 1.005
1.747ProAsp: 1.747 ± 0.466
3.713ProGlu: 3.713 ± 0.209
2.84ProPhe: 2.84 ± 1.467
5.024ProGly: 5.024 ± 0.427
2.403ProHis: 2.403 ± 0.575
1.966ProIle: 1.966 ± 0.206
5.024ProLys: 5.024 ± 0.775
5.461ProLeu: 5.461 ± 0.477
1.747ProMet: 1.747 ± 0.466
2.403ProAsn: 2.403 ± 0.332
5.898ProPro: 5.898 ± 1.397
2.184ProGln: 2.184 ± 0.243
3.713ProArg: 3.713 ± 1.265
3.713ProSer: 3.713 ± 0.44
4.369ProThr: 4.369 ± 0.423
5.898ProVal: 5.898 ± 0.85
0.437ProTrp: 0.437 ± 0.255
2.184ProTyr: 2.184 ± 0.451
0.0ProXaa: 0.0 ± 0.0
Gln
4.587GlnAla: 4.587 ± 0.67
1.529GlnCys: 1.529 ± 0.327
1.311GlnAsp: 1.311 ± 0.217
2.84GlnGlu: 2.84 ± 0.757
1.747GlnPhe: 1.747 ± 0.466
1.311GlnGly: 1.311 ± 0.217
0.655GlnHis: 0.655 ± 0.191
1.311GlnIle: 1.311 ± 0.111
1.966GlnLys: 1.966 ± 0.592
3.932GlnLeu: 3.932 ± 0.292
0.874GlnMet: 0.874 ± 0.257
1.092GlnAsn: 1.092 ± 0.637
3.713GlnPro: 3.713 ± 0.393
0.874GlnGln: 0.874 ± 0.087
0.874GlnArg: 0.874 ± 0.087
1.747GlnSer: 1.747 ± 0.466
1.311GlnThr: 1.311 ± 0.382
1.529GlnVal: 1.529 ± 0.34
0.218GlnTrp: 0.218 ± 0.127
1.311GlnTyr: 1.311 ± 0.111
0.0GlnXaa: 0.0 ± 0.0
Arg
2.84ArgAla: 2.84 ± 0.551
1.966ArgCys: 1.966 ± 0.146
1.092ArgAsp: 1.092 ± 0.106
3.277ArgGlu: 3.277 ± 0.097
1.092ArgPhe: 1.092 ± 0.106
2.403ArgGly: 2.403 ± 0.575
0.874ArgHis: 0.874 ± 0.626
1.966ArgIle: 1.966 ± 0.302
4.15ArgLys: 4.15 ± 1.864
5.024ArgLeu: 5.024 ± 0.584
1.529ArgMet: 1.529 ± 0.142
2.403ArgAsn: 2.403 ± 0.317
5.898ArgPro: 5.898 ± 1.954
2.621ArgGln: 2.621 ± 0.262
6.553ArgArg: 6.553 ± 1.207
3.495ArgSer: 3.495 ± 0.349
3.932ArgThr: 3.932 ± 0.652
2.84ArgVal: 2.84 ± 0.557
0.218ArgTrp: 0.218 ± 0.127
1.529ArgTyr: 1.529 ± 0.34
0.0ArgXaa: 0.0 ± 0.0
Ser
8.519SerAla: 8.519 ± 1.241
1.529SerCys: 1.529 ± 0.268
1.966SerAsp: 1.966 ± 0.452
4.369SerGlu: 4.369 ± 0.485
3.495SerPhe: 3.495 ± 0.917
6.553SerGly: 6.553 ± 1.587
0.437SerHis: 0.437 ± 0.144
1.747SerIle: 1.747 ± 0.18
3.713SerLys: 3.713 ± 0.658
4.587SerLeu: 4.587 ± 0.382
0.655SerMet: 0.655 ± 0.165
1.747SerAsn: 1.747 ± 0.174
3.713SerPro: 3.713 ± 0.116
1.529SerGln: 1.529 ± 0.52
3.058SerArg: 3.058 ± 1.227
5.898SerSer: 5.898 ± 0.27
5.461SerThr: 5.461 ± 1.227
4.15SerVal: 4.15 ± 0.774
0.874SerTrp: 0.874 ± 0.288
3.495SerTyr: 3.495 ± 0.514
0.0SerXaa: 0.0 ± 0.0
Thr
7.427ThrAla: 7.427 ± 0.758
3.058ThrCys: 3.058 ± 1.04
3.058ThrAsp: 3.058 ± 0.681
4.587ThrGlu: 4.587 ± 0.524
1.311ThrPhe: 1.311 ± 0.217
3.932ThrGly: 3.932 ± 1.184
0.655ThrHis: 0.655 ± 0.382
3.713ThrIle: 3.713 ± 0.725
3.713ThrLys: 3.713 ± 1.058
5.461ThrLeu: 5.461 ± 0.362
1.747ThrMet: 1.747 ± 0.693
1.966ThrAsn: 1.966 ± 0.865
4.587ThrPro: 4.587 ± 0.73
1.747ThrGln: 1.747 ± 0.18
4.587ThrArg: 4.587 ± 0.469
6.99ThrSer: 6.99 ± 1.948
5.024ThrThr: 5.024 ± 1.058
7.864ThrVal: 7.864 ± 2.473
0.655ThrTrp: 0.655 ± 0.191
2.184ThrTyr: 2.184 ± 0.138
0.0ThrXaa: 0.0 ± 0.0
Val
5.679ValAla: 5.679 ± 0.445
1.747ValCys: 1.747 ± 1.02
3.058ValAsp: 3.058 ± 0.671
1.966ValGlu: 1.966 ± 0.625
1.529ValPhe: 1.529 ± 0.078
3.058ValGly: 3.058 ± 0.202
2.403ValHis: 2.403 ± 0.881
4.587ValIle: 4.587 ± 0.712
4.369ValLys: 4.369 ± 0.149
7.208ValLeu: 7.208 ± 1.382
1.747ValMet: 1.747 ± 0.125
3.277ValAsn: 3.277 ± 0.246
4.587ValPro: 4.587 ± 0.563
2.621ValGln: 2.621 ± 0.262
2.621ValArg: 2.621 ± 0.262
5.898ValSer: 5.898 ± 0.888
6.335ValThr: 6.335 ± 0.872
5.461ValVal: 5.461 ± 1.218
1.092ValTrp: 1.092 ± 0.106
3.713ValTyr: 3.713 ± 1.273
0.0ValXaa: 0.0 ± 0.0
Trp
0.874TrpAla: 0.874 ± 0.51
0.0TrpCys: 0.0 ± 0.0
0.655TrpAsp: 0.655 ± 0.191
0.437TrpGlu: 0.437 ± 0.255
0.218TrpPhe: 0.218 ± 0.127
0.655TrpGly: 0.655 ± 0.475
0.655TrpHis: 0.655 ± 0.191
1.311TrpIle: 1.311 ± 0.217
0.655TrpLys: 0.655 ± 0.165
0.437TrpLeu: 0.437 ± 0.144
0.0TrpMet: 0.0 ± 0.0
0.437TrpAsn: 0.437 ± 0.313
0.874TrpPro: 0.874 ± 0.087
0.0TrpGln: 0.0 ± 0.0
0.437TrpArg: 0.437 ± 0.313
1.529TrpSer: 1.529 ± 0.52
0.218TrpThr: 0.218 ± 0.127
1.092TrpVal: 1.092 ± 0.502
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.713TyrAla: 3.713 ± 0.209
0.655TyrCys: 0.655 ± 0.191
3.495TyrAsp: 3.495 ± 1.074
1.092TyrGlu: 1.092 ± 0.226
1.092TyrPhe: 1.092 ± 0.106
1.529TyrGly: 1.529 ± 0.416
1.747TyrHis: 1.747 ± 0.466
2.621TyrIle: 2.621 ± 0.117
2.84TyrLys: 2.84 ± 0.324
2.621TyrLeu: 2.621 ± 0.449
0.437TyrMet: 0.437 ± 0.313
1.747TyrAsn: 1.747 ± 0.18
1.529TyrPro: 1.529 ± 0.615
0.655TyrGln: 0.655 ± 0.382
1.311TyrArg: 1.311 ± 0.217
1.529TyrSer: 1.529 ± 0.078
2.403TyrThr: 2.403 ± 1.422
1.529TyrVal: 1.529 ± 0.416
0.437TyrTrp: 0.437 ± 0.255
0.437TyrTyr: 0.437 ± 0.313
0.218TyrXaa: 0.218 ± 0.127
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.218XaaLeu: 0.218 ± 0.127
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (4579 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski