Amino acid dipepetide frequency for Dunaliella viridis virus SI2

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
18.009AlaAla: 18.009 ± 2.112
1.044AlaCys: 1.044 ± 0.368
8.439AlaAsp: 8.439 ± 0.916
9.222AlaGlu: 9.222 ± 1.467
3.567AlaPhe: 3.567 ± 0.526
10.005AlaGly: 10.005 ± 0.999
1.914AlaHis: 1.914 ± 0.407
6.525AlaIle: 6.525 ± 0.66
2.871AlaLys: 2.871 ± 0.636
11.571AlaLeu: 11.571 ± 1.231
3.654AlaMet: 3.654 ± 0.605
3.219AlaAsn: 3.219 ± 0.345
4.263AlaPro: 4.263 ± 0.516
4.611AlaGln: 4.611 ± 0.692
8.613AlaArg: 8.613 ± 1.346
7.047AlaSer: 7.047 ± 0.812
6.003AlaThr: 6.003 ± 0.85
6.003AlaVal: 6.003 ± 0.75
1.479AlaTrp: 1.479 ± 0.323
2.784AlaTyr: 2.784 ± 0.42
0.0AlaXaa: 0.0 ± 0.0
Cys
0.435CysAla: 0.435 ± 0.181
0.087CysCys: 0.087 ± 0.1
0.609CysAsp: 0.609 ± 0.222
0.696CysGlu: 0.696 ± 0.275
0.435CysPhe: 0.435 ± 0.206
0.435CysGly: 0.435 ± 0.186
0.174CysHis: 0.174 ± 0.158
0.261CysIle: 0.261 ± 0.124
0.435CysLys: 0.435 ± 0.189
0.435CysLeu: 0.435 ± 0.187
0.174CysMet: 0.174 ± 0.121
0.0CysAsn: 0.0 ± 0.0
0.522CysPro: 0.522 ± 0.222
0.174CysGln: 0.174 ± 0.106
1.479CysArg: 1.479 ± 0.355
0.609CysSer: 0.609 ± 0.245
0.435CysThr: 0.435 ± 0.31
0.087CysVal: 0.087 ± 0.1
0.087CysTrp: 0.087 ± 0.091
0.087CysTyr: 0.087 ± 0.075
0.0CysXaa: 0.0 ± 0.0
Asp
7.569AspAla: 7.569 ± 0.885
0.609AspCys: 0.609 ± 0.237
3.045AspAsp: 3.045 ± 0.423
4.698AspGlu: 4.698 ± 0.609
2.784AspPhe: 2.784 ± 0.457
6.525AspGly: 6.525 ± 0.932
1.305AspHis: 1.305 ± 0.26
3.306AspIle: 3.306 ± 0.671
0.957AspLys: 0.957 ± 0.257
6.264AspLeu: 6.264 ± 0.652
1.653AspMet: 1.653 ± 0.369
2.001AspAsn: 2.001 ± 0.34
3.567AspPro: 3.567 ± 0.608
2.958AspGln: 2.958 ± 0.649
5.307AspArg: 5.307 ± 0.648
2.001AspSer: 2.001 ± 0.476
4.002AspThr: 4.002 ± 0.675
3.567AspVal: 3.567 ± 0.486
1.218AspTrp: 1.218 ± 0.273
1.479AspTyr: 1.479 ± 0.398
0.0AspXaa: 0.0 ± 0.0
Glu
9.222GluAla: 9.222 ± 1.28
0.609GluCys: 0.609 ± 0.223
4.089GluAsp: 4.089 ± 0.664
5.394GluGlu: 5.394 ± 0.85
2.349GluPhe: 2.349 ± 0.408
4.611GluGly: 4.611 ± 0.69
0.87GluHis: 0.87 ± 0.286
4.611GluIle: 4.611 ± 0.672
2.088GluLys: 2.088 ± 0.393
6.09GluLeu: 6.09 ± 0.918
2.001GluMet: 2.001 ± 0.582
2.349GluAsn: 2.349 ± 0.403
2.523GluPro: 2.523 ± 0.659
3.132GluGln: 3.132 ± 0.387
5.829GluArg: 5.829 ± 0.884
2.262GluSer: 2.262 ± 0.506
3.567GluThr: 3.567 ± 0.598
5.655GluVal: 5.655 ± 0.59
1.044GluTrp: 1.044 ± 0.259
1.566GluTyr: 1.566 ± 0.352
0.0GluXaa: 0.0 ± 0.0
Phe
3.741PheAla: 3.741 ± 0.468
0.261PheCys: 0.261 ± 0.147
2.697PheAsp: 2.697 ± 0.411
2.784PheGlu: 2.784 ± 0.451
2.001PhePhe: 2.001 ± 0.441
4.002PheGly: 4.002 ± 0.556
0.522PheHis: 0.522 ± 0.211
1.653PheIle: 1.653 ± 0.365
0.783PheLys: 0.783 ± 0.242
1.74PheLeu: 1.74 ± 0.487
0.957PheMet: 0.957 ± 0.249
1.131PheAsn: 1.131 ± 0.351
1.74PhePro: 1.74 ± 0.365
1.392PheGln: 1.392 ± 0.334
3.219PheArg: 3.219 ± 0.58
1.827PheSer: 1.827 ± 0.358
2.349PheThr: 2.349 ± 0.429
2.175PheVal: 2.175 ± 0.498
0.696PheTrp: 0.696 ± 0.202
0.87PheTyr: 0.87 ± 0.25
0.0PheXaa: 0.0 ± 0.0
Gly
10.005GlyAla: 10.005 ± 0.917
0.609GlyCys: 0.609 ± 0.226
5.394GlyAsp: 5.394 ± 1.08
5.394GlyGlu: 5.394 ± 0.7
2.784GlyPhe: 2.784 ± 0.409
8.613GlyGly: 8.613 ± 1.178
2.001GlyHis: 2.001 ± 0.403
2.61GlyIle: 2.61 ± 0.572
2.784GlyLys: 2.784 ± 0.517
6.525GlyLeu: 6.525 ± 0.618
3.48GlyMet: 3.48 ± 0.636
2.436GlyAsn: 2.436 ± 0.569
3.828GlyPro: 3.828 ± 0.596
4.176GlyGln: 4.176 ± 0.485
6.525GlyArg: 6.525 ± 0.967
5.307GlySer: 5.307 ± 0.85
6.351GlyThr: 6.351 ± 1.274
5.829GlyVal: 5.829 ± 0.651
1.044GlyTrp: 1.044 ± 0.257
1.827GlyTyr: 1.827 ± 0.352
0.0GlyXaa: 0.0 ± 0.0
His
1.74HisAla: 1.74 ± 0.321
0.435HisCys: 0.435 ± 0.202
0.87HisAsp: 0.87 ± 0.259
1.218HisGlu: 1.218 ± 0.397
0.87HisPhe: 0.87 ± 0.349
1.131HisGly: 1.131 ± 0.309
0.783HisHis: 0.783 ± 0.251
0.696HisIle: 0.696 ± 0.268
0.522HisLys: 0.522 ± 0.216
1.914HisLeu: 1.914 ± 0.322
0.609HisMet: 0.609 ± 0.214
0.348HisAsn: 0.348 ± 0.187
0.696HisPro: 0.696 ± 0.258
0.435HisGln: 0.435 ± 0.208
1.479HisArg: 1.479 ± 0.316
0.348HisSer: 0.348 ± 0.149
0.783HisThr: 0.783 ± 0.322
1.218HisVal: 1.218 ± 0.341
0.435HisTrp: 0.435 ± 0.245
0.522HisTyr: 0.522 ± 0.151
0.0HisXaa: 0.0 ± 0.0
Ile
5.655IleAla: 5.655 ± 0.681
0.696IleCys: 0.696 ± 0.27
4.263IleAsp: 4.263 ± 0.641
2.61IleGlu: 2.61 ± 0.439
0.957IlePhe: 0.957 ± 0.281
4.263IleGly: 4.263 ± 0.703
0.609IleHis: 0.609 ± 0.206
1.827IleIle: 1.827 ± 0.329
1.044IleLys: 1.044 ± 0.272
3.567IleLeu: 3.567 ± 0.504
1.131IleMet: 1.131 ± 0.344
1.131IleAsn: 1.131 ± 0.244
1.827IlePro: 1.827 ± 0.371
1.653IleGln: 1.653 ± 0.228
3.567IleArg: 3.567 ± 0.623
3.219IleSer: 3.219 ± 0.562
3.219IleThr: 3.219 ± 0.728
4.089IleVal: 4.089 ± 0.496
0.609IleTrp: 0.609 ± 0.211
0.957IleTyr: 0.957 ± 0.243
0.0IleXaa: 0.0 ± 0.0
Lys
2.784LysAla: 2.784 ± 0.612
0.174LysCys: 0.174 ± 0.117
1.392LysAsp: 1.392 ± 0.379
1.392LysGlu: 1.392 ± 0.428
0.957LysPhe: 0.957 ± 0.349
2.088LysGly: 2.088 ± 0.426
0.696LysHis: 0.696 ± 0.247
1.044LysIle: 1.044 ± 0.232
0.435LysLys: 0.435 ± 0.207
2.001LysLeu: 2.001 ± 0.517
0.957LysMet: 0.957 ± 0.29
0.957LysAsn: 0.957 ± 0.347
1.827LysPro: 1.827 ± 0.455
1.131LysGln: 1.131 ± 0.382
1.827LysArg: 1.827 ± 0.424
2.001LysSer: 2.001 ± 0.41
1.74LysThr: 1.74 ± 0.443
1.914LysVal: 1.914 ± 0.428
0.609LysTrp: 0.609 ± 0.229
0.435LysTyr: 0.435 ± 0.187
0.0LysXaa: 0.0 ± 0.0
Leu
9.918LeuAla: 9.918 ± 0.965
0.87LeuCys: 0.87 ± 0.331
5.394LeuAsp: 5.394 ± 0.72
7.743LeuGlu: 7.743 ± 0.804
2.61LeuPhe: 2.61 ± 0.401
5.916LeuGly: 5.916 ± 0.612
1.044LeuHis: 1.044 ± 0.295
2.697LeuIle: 2.697 ± 0.338
1.566LeuLys: 1.566 ± 0.384
4.611LeuLeu: 4.611 ± 0.784
2.001LeuMet: 2.001 ± 0.422
1.566LeuAsn: 1.566 ± 0.322
4.524LeuPro: 4.524 ± 0.692
3.132LeuGln: 3.132 ± 0.521
5.394LeuArg: 5.394 ± 0.59
4.959LeuSer: 4.959 ± 0.684
4.698LeuThr: 4.698 ± 0.581
4.437LeuVal: 4.437 ± 0.719
1.653LeuTrp: 1.653 ± 0.36
2.001LeuTyr: 2.001 ± 0.377
0.0LeuXaa: 0.0 ± 0.0
Met
4.002MetAla: 4.002 ± 0.762
0.087MetCys: 0.087 ± 0.091
1.653MetAsp: 1.653 ± 0.382
2.001MetGlu: 2.001 ± 0.426
0.957MetPhe: 0.957 ± 0.352
1.479MetGly: 1.479 ± 0.249
0.696MetHis: 0.696 ± 0.254
0.87MetIle: 0.87 ± 0.249
1.479MetLys: 1.479 ± 0.395
1.566MetLeu: 1.566 ± 0.422
0.696MetMet: 0.696 ± 0.229
1.044MetAsn: 1.044 ± 0.305
1.653MetPro: 1.653 ± 0.427
1.305MetGln: 1.305 ± 0.301
2.001MetArg: 2.001 ± 0.462
2.001MetSer: 2.001 ± 0.447
2.436MetThr: 2.436 ± 0.428
2.001MetVal: 2.001 ± 0.356
0.174MetTrp: 0.174 ± 0.111
0.435MetTyr: 0.435 ± 0.17
0.0MetXaa: 0.0 ± 0.0
Asn
4.698AsnAla: 4.698 ± 0.742
0.087AsnCys: 0.087 ± 0.081
1.914AsnAsp: 1.914 ± 0.446
1.653AsnGlu: 1.653 ± 0.312
0.696AsnPhe: 0.696 ± 0.25
3.045AsnGly: 3.045 ± 0.477
0.261AsnHis: 0.261 ± 0.13
1.305AsnIle: 1.305 ± 0.355
0.957AsnLys: 0.957 ± 0.309
2.958AsnLeu: 2.958 ± 0.472
0.957AsnMet: 0.957 ± 0.261
0.348AsnAsn: 0.348 ± 0.161
2.349AsnPro: 2.349 ± 0.545
0.87AsnGln: 0.87 ± 0.213
2.784AsnArg: 2.784 ± 0.657
1.74AsnSer: 1.74 ± 0.36
1.74AsnThr: 1.74 ± 0.442
1.479AsnVal: 1.479 ± 0.453
0.261AsnTrp: 0.261 ± 0.13
0.696AsnTyr: 0.696 ± 0.262
0.0AsnXaa: 0.0 ± 0.0
Pro
4.698ProAla: 4.698 ± 0.554
0.174ProCys: 0.174 ± 0.114
4.611ProAsp: 4.611 ± 0.656
2.784ProGlu: 2.784 ± 0.608
1.74ProPhe: 1.74 ± 0.361
4.35ProGly: 4.35 ± 0.532
1.044ProHis: 1.044 ± 0.287
2.871ProIle: 2.871 ± 0.445
1.218ProLys: 1.218 ± 0.263
2.958ProLeu: 2.958 ± 0.522
1.305ProMet: 1.305 ± 0.338
1.392ProAsn: 1.392 ± 0.293
2.61ProPro: 2.61 ± 0.513
2.262ProGln: 2.262 ± 0.444
3.567ProArg: 3.567 ± 0.519
2.784ProSer: 2.784 ± 0.453
2.61ProThr: 2.61 ± 0.56
3.219ProVal: 3.219 ± 0.508
0.696ProTrp: 0.696 ± 0.233
0.87ProTyr: 0.87 ± 0.299
0.0ProXaa: 0.0 ± 0.0
Gln
5.133GlnAla: 5.133 ± 0.774
0.087GlnCys: 0.087 ± 0.079
1.914GlnAsp: 1.914 ± 0.512
3.045GlnGlu: 3.045 ± 0.45
1.479GlnPhe: 1.479 ± 0.388
3.567GlnGly: 3.567 ± 0.505
0.174GlnHis: 0.174 ± 0.119
2.958GlnIle: 2.958 ± 0.42
1.218GlnLys: 1.218 ± 0.413
2.175GlnLeu: 2.175 ± 0.414
1.479GlnMet: 1.479 ± 0.351
1.218GlnAsn: 1.218 ± 0.297
2.088GlnPro: 2.088 ± 0.357
1.566GlnGln: 1.566 ± 0.316
4.002GlnArg: 4.002 ± 0.487
1.74GlnSer: 1.74 ± 0.321
1.827GlnThr: 1.827 ± 0.466
3.219GlnVal: 3.219 ± 0.511
0.522GlnTrp: 0.522 ± 0.25
0.957GlnTyr: 0.957 ± 0.344
0.0GlnXaa: 0.0 ± 0.0
Arg
10.527ArgAla: 10.527 ± 1.233
0.522ArgCys: 0.522 ± 0.204
5.655ArgAsp: 5.655 ± 0.794
5.742ArgGlu: 5.742 ± 0.747
3.132ArgPhe: 3.132 ± 0.526
5.829ArgGly: 5.829 ± 0.774
1.392ArgHis: 1.392 ± 0.397
3.741ArgIle: 3.741 ± 0.475
2.61ArgLys: 2.61 ± 0.604
6.264ArgLeu: 6.264 ± 0.708
2.697ArgMet: 2.697 ± 0.435
3.045ArgAsn: 3.045 ± 0.499
2.523ArgPro: 2.523 ± 0.549
2.697ArgGln: 2.697 ± 0.523
6.177ArgArg: 6.177 ± 0.857
5.133ArgSer: 5.133 ± 0.631
3.915ArgThr: 3.915 ± 0.589
4.785ArgVal: 4.785 ± 0.616
0.957ArgTrp: 0.957 ± 0.276
2.349ArgTyr: 2.349 ± 0.496
0.0ArgXaa: 0.0 ± 0.0
Ser
6.699SerAla: 6.699 ± 0.843
0.174SerCys: 0.174 ± 0.117
3.219SerAsp: 3.219 ± 0.436
3.045SerGlu: 3.045 ± 0.516
2.436SerPhe: 2.436 ± 0.434
6.699SerGly: 6.699 ± 0.905
0.696SerHis: 0.696 ± 0.238
2.784SerIle: 2.784 ± 0.491
1.827SerLys: 1.827 ± 0.348
3.48SerLeu: 3.48 ± 0.703
1.218SerMet: 1.218 ± 0.311
2.349SerAsn: 2.349 ± 0.473
2.871SerPro: 2.871 ± 0.445
2.436SerGln: 2.436 ± 0.573
4.263SerArg: 4.263 ± 0.55
3.828SerSer: 3.828 ± 0.745
4.089SerThr: 4.089 ± 0.587
3.045SerVal: 3.045 ± 0.599
0.435SerTrp: 0.435 ± 0.184
1.305SerTyr: 1.305 ± 0.337
0.0SerXaa: 0.0 ± 0.0
Thr
7.221ThrAla: 7.221 ± 1.099
0.174ThrCys: 0.174 ± 0.117
4.002ThrAsp: 4.002 ± 0.68
3.654ThrGlu: 3.654 ± 0.579
2.262ThrPhe: 2.262 ± 0.589
6.351ThrGly: 6.351 ± 0.604
1.044ThrHis: 1.044 ± 0.266
3.132ThrIle: 3.132 ± 0.549
1.566ThrLys: 1.566 ± 0.503
5.394ThrLeu: 5.394 ± 0.733
1.044ThrMet: 1.044 ± 0.309
2.436ThrAsn: 2.436 ± 0.428
4.524ThrPro: 4.524 ± 0.495
1.827ThrGln: 1.827 ± 0.365
3.828ThrArg: 3.828 ± 0.604
3.132ThrSer: 3.132 ± 0.717
4.089ThrThr: 4.089 ± 0.786
4.176ThrVal: 4.176 ± 0.763
0.783ThrTrp: 0.783 ± 0.208
1.392ThrTyr: 1.392 ± 0.281
0.0ThrXaa: 0.0 ± 0.0
Val
5.742ValAla: 5.742 ± 0.604
0.348ValCys: 0.348 ± 0.14
4.176ValAsp: 4.176 ± 0.64
4.437ValGlu: 4.437 ± 0.779
2.784ValPhe: 2.784 ± 0.5
5.22ValGly: 5.22 ± 0.517
0.609ValHis: 0.609 ± 0.204
2.436ValIle: 2.436 ± 0.483
1.131ValLys: 1.131 ± 0.332
4.089ValLeu: 4.089 ± 0.49
1.566ValMet: 1.566 ± 0.28
2.349ValAsn: 2.349 ± 0.431
2.871ValPro: 2.871 ± 0.426
2.697ValGln: 2.697 ± 0.488
5.394ValArg: 5.394 ± 0.948
5.046ValSer: 5.046 ± 0.781
5.307ValThr: 5.307 ± 0.878
3.828ValVal: 3.828 ± 0.414
1.131ValTrp: 1.131 ± 0.339
1.305ValTyr: 1.305 ± 0.307
0.0ValXaa: 0.0 ± 0.0
Trp
1.218TrpAla: 1.218 ± 0.33
0.261TrpCys: 0.261 ± 0.156
0.783TrpAsp: 0.783 ± 0.24
0.609TrpGlu: 0.609 ± 0.214
0.783TrpPhe: 0.783 ± 0.34
0.783TrpGly: 0.783 ± 0.26
0.522TrpHis: 0.522 ± 0.243
0.609TrpIle: 0.609 ± 0.225
0.609TrpLys: 0.609 ± 0.24
1.479TrpLeu: 1.479 ± 0.355
0.609TrpMet: 0.609 ± 0.249
0.435TrpAsn: 0.435 ± 0.203
0.348TrpPro: 0.348 ± 0.199
0.696TrpGln: 0.696 ± 0.301
2.436TrpArg: 2.436 ± 0.474
0.522TrpSer: 0.522 ± 0.199
0.609TrpThr: 0.609 ± 0.196
0.696TrpVal: 0.696 ± 0.224
0.087TrpTrp: 0.087 ± 0.085
0.261TrpTyr: 0.261 ± 0.192
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.827TyrAla: 1.827 ± 0.429
0.348TyrCys: 0.348 ± 0.198
0.783TyrAsp: 0.783 ± 0.256
1.74TyrGlu: 1.74 ± 0.356
0.957TyrPhe: 0.957 ± 0.278
2.523TyrGly: 2.523 ± 0.442
0.696TyrHis: 0.696 ± 0.213
0.957TyrIle: 0.957 ± 0.274
0.261TyrLys: 0.261 ± 0.148
1.653TyrLeu: 1.653 ± 0.345
0.348TyrMet: 0.348 ± 0.194
1.044TyrAsn: 1.044 ± 0.297
0.609TyrPro: 0.609 ± 0.244
1.218TyrGln: 1.218 ± 0.328
2.001TyrArg: 2.001 ± 0.533
1.392TyrSer: 1.392 ± 0.335
2.262TyrThr: 2.262 ± 0.515
1.044TyrVal: 1.044 ± 0.368
0.435TyrTrp: 0.435 ± 0.235
0.696TyrTyr: 0.696 ± 0.221
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 51 proteins (11495 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski