Amino acid dipepetide frequency for Nam Dinh virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.23AlaAla: 3.23 ± 1.3
1.336AlaCys: 1.336 ± 0.566
3.118AlaAsp: 3.118 ± 0.419
3.23AlaGlu: 3.23 ± 0.916
2.45AlaPhe: 2.45 ± 0.902
1.114AlaGly: 1.114 ± 0.272
1.559AlaHis: 1.559 ± 0.294
6.125AlaIle: 6.125 ± 1.195
3.118AlaLys: 3.118 ± 0.733
6.125AlaLeu: 6.125 ± 1.168
1.225AlaMet: 1.225 ± 1.22
3.453AlaAsn: 3.453 ± 0.694
1.671AlaPro: 1.671 ± 0.273
2.339AlaGln: 2.339 ± 0.477
2.562AlaArg: 2.562 ± 0.577
3.118AlaSer: 3.118 ± 1.265
4.678AlaThr: 4.678 ± 0.816
2.116AlaVal: 2.116 ± 0.659
0.445AlaTrp: 0.445 ± 0.189
4.343AlaTyr: 4.343 ± 0.96
0.0AlaXaa: 0.0 ± 0.0
Cys
0.668CysAla: 0.668 ± 0.229
0.111CysCys: 0.111 ± 0.069
1.114CysAsp: 1.114 ± 0.386
0.891CysGlu: 0.891 ± 0.321
0.78CysPhe: 0.78 ± 0.274
1.336CysGly: 1.336 ± 0.304
0.445CysHis: 0.445 ± 0.163
1.671CysIle: 1.671 ± 0.49
2.116CysLys: 2.116 ± 0.327
1.448CysLeu: 1.448 ± 0.435
0.668CysMet: 0.668 ± 0.229
1.336CysAsn: 1.336 ± 0.336
1.002CysPro: 1.002 ± 0.485
0.334CysGln: 0.334 ± 0.207
0.557CysArg: 0.557 ± 0.14
0.668CysSer: 0.668 ± 1.347
2.562CysThr: 2.562 ± 0.67
1.559CysVal: 1.559 ± 0.357
0.0CysTrp: 0.0 ± 0.0
1.336CysTyr: 1.336 ± 0.356
0.0CysXaa: 0.0 ± 0.0
Asp
3.118AspAla: 3.118 ± 0.954
1.114AspCys: 1.114 ± 0.447
3.564AspAsp: 3.564 ± 0.519
2.005AspGlu: 2.005 ± 0.385
3.341AspPhe: 3.341 ± 0.76
1.336AspGly: 1.336 ± 0.362
1.114AspHis: 1.114 ± 0.366
4.566AspIle: 4.566 ± 0.864
2.005AspLys: 2.005 ± 0.579
5.791AspLeu: 5.791 ± 0.814
0.334AspMet: 0.334 ± 0.189
4.121AspAsn: 4.121 ± 1.212
2.562AspPro: 2.562 ± 0.6
1.893AspGln: 1.893 ± 0.287
1.893AspArg: 1.893 ± 0.52
3.564AspSer: 3.564 ± 0.469
5.346AspThr: 5.346 ± 0.985
2.673AspVal: 2.673 ± 0.475
0.334AspTrp: 0.334 ± 0.114
3.341AspTyr: 3.341 ± 0.508
0.0AspXaa: 0.0 ± 0.0
Glu
1.893GluAla: 1.893 ± 0.328
0.891GluCys: 0.891 ± 0.208
2.005GluAsp: 2.005 ± 0.333
1.448GluGlu: 1.448 ± 0.25
3.898GluPhe: 3.898 ± 0.705
0.557GluGly: 0.557 ± 0.185
2.45GluHis: 2.45 ± 0.457
3.675GluIle: 3.675 ± 0.913
2.784GluLys: 2.784 ± 0.63
5.791GluLeu: 5.791 ± 1.277
0.891GluMet: 0.891 ± 0.327
2.339GluAsn: 2.339 ± 0.395
2.339GluPro: 2.339 ± 0.459
1.671GluGln: 1.671 ± 0.395
0.668GluArg: 0.668 ± 0.318
2.227GluSer: 2.227 ± 0.481
2.896GluThr: 2.896 ± 0.46
1.559GluVal: 1.559 ± 0.398
0.111GluTrp: 0.111 ± 0.069
1.893GluTyr: 1.893 ± 0.489
0.0GluXaa: 0.0 ± 0.0
Phe
2.784PheAla: 2.784 ± 0.734
1.002PheCys: 1.002 ± 0.638
3.118PheAsp: 3.118 ± 0.608
2.116PheGlu: 2.116 ± 0.41
0.223PhePhe: 0.223 ± 0.094
2.562PheGly: 2.562 ± 0.466
0.111PheHis: 0.111 ± 0.069
3.23PheIle: 3.23 ± 0.722
1.559PheLys: 1.559 ± 0.577
3.341PheLeu: 3.341 ± 0.504
1.225PheMet: 1.225 ± 0.422
3.453PheAsn: 3.453 ± 0.736
0.334PhePro: 0.334 ± 0.189
0.78PheGln: 0.78 ± 0.314
0.891PheArg: 0.891 ± 0.466
2.45PheSer: 2.45 ± 1.348
3.564PheThr: 3.564 ± 1.406
3.564PheVal: 3.564 ± 0.695
0.445PheTrp: 0.445 ± 0.189
1.893PheTyr: 1.893 ± 0.406
0.223PheXaa: 0.223 ± 0.094
Gly
1.448GlyAla: 1.448 ± 0.506
0.557GlyCys: 0.557 ± 0.198
1.225GlyAsp: 1.225 ± 0.332
1.336GlyGlu: 1.336 ± 0.431
1.782GlyPhe: 1.782 ± 0.447
1.448GlyGly: 1.448 ± 0.292
0.557GlyHis: 0.557 ± 0.198
1.893GlyIle: 1.893 ± 0.407
2.562GlyLys: 2.562 ± 0.499
2.673GlyLeu: 2.673 ± 0.914
0.445GlyMet: 0.445 ± 0.163
1.336GlyAsn: 1.336 ± 0.343
1.559GlyPro: 1.559 ± 0.291
0.891GlyGln: 0.891 ± 0.329
1.114GlyArg: 1.114 ± 0.37
2.45GlySer: 2.45 ± 1.134
2.45GlyThr: 2.45 ± 0.281
1.671GlyVal: 1.671 ± 0.765
0.445GlyTrp: 0.445 ± 0.399
2.116GlyTyr: 2.116 ± 0.397
0.0GlyXaa: 0.0 ± 0.0
His
2.005HisAla: 2.005 ± 0.384
0.891HisCys: 0.891 ± 0.321
1.782HisAsp: 1.782 ± 0.461
1.002HisGlu: 1.002 ± 0.333
1.336HisPhe: 1.336 ± 0.311
0.891HisGly: 0.891 ± 0.543
1.225HisHis: 1.225 ± 0.281
2.562HisIle: 2.562 ± 0.486
2.896HisLys: 2.896 ± 0.828
3.564HisLeu: 3.564 ± 0.766
0.78HisMet: 0.78 ± 0.292
3.007HisAsn: 3.007 ± 0.368
2.562HisPro: 2.562 ± 0.535
1.559HisGln: 1.559 ± 0.372
0.78HisArg: 0.78 ± 0.314
0.891HisSer: 0.891 ± 0.319
3.898HisThr: 3.898 ± 0.75
1.448HisVal: 1.448 ± 0.257
0.223HisTrp: 0.223 ± 0.138
1.893HisTyr: 1.893 ± 0.713
0.0HisXaa: 0.0 ± 0.0
Ile
4.121IleAla: 4.121 ± 0.898
1.225IleCys: 1.225 ± 0.45
4.566IleAsp: 4.566 ± 0.532
3.23IleGlu: 3.23 ± 0.638
2.562IlePhe: 2.562 ± 0.625
1.782IleGly: 1.782 ± 0.398
1.559IleHis: 1.559 ± 0.679
4.789IleIle: 4.789 ± 0.911
4.343IleLys: 4.343 ± 0.705
7.462IleLeu: 7.462 ± 0.375
3.118IleMet: 3.118 ± 0.486
6.794IleAsn: 6.794 ± 1.23
3.787IlePro: 3.787 ± 1.017
3.118IleGln: 3.118 ± 0.449
3.453IleArg: 3.453 ± 0.877
3.453IleSer: 3.453 ± 0.68
6.125IleThr: 6.125 ± 0.939
3.675IleVal: 3.675 ± 0.498
0.668IleTrp: 0.668 ± 0.229
4.343IleTyr: 4.343 ± 1.077
0.0IleXaa: 0.0 ± 0.0
Lys
3.341LysAla: 3.341 ± 0.326
1.114LysCys: 1.114 ± 0.274
3.787LysAsp: 3.787 ± 0.529
1.782LysGlu: 1.782 ± 0.49
1.225LysPhe: 1.225 ± 0.411
1.336LysGly: 1.336 ± 0.354
3.007LysHis: 3.007 ± 0.865
3.898LysIle: 3.898 ± 0.73
1.448LysLys: 1.448 ± 0.418
6.571LysLeu: 6.571 ± 1.82
1.336LysMet: 1.336 ± 0.569
3.787LysAsn: 3.787 ± 0.248
4.566LysPro: 4.566 ± 0.741
4.009LysGln: 4.009 ± 1.252
2.673LysArg: 2.673 ± 0.429
3.341LysSer: 3.341 ± 0.595
4.343LysThr: 4.343 ± 0.509
3.341LysVal: 3.341 ± 0.403
0.445LysTrp: 0.445 ± 0.144
4.343LysTyr: 4.343 ± 0.518
0.0LysXaa: 0.0 ± 0.0
Leu
5.791LeuAla: 5.791 ± 0.928
1.893LeuCys: 1.893 ± 0.526
5.569LeuAsp: 5.569 ± 1.246
4.566LeuGlu: 4.566 ± 0.665
4.343LeuPhe: 4.343 ± 1.164
2.784LeuGly: 2.784 ± 0.608
3.453LeuHis: 3.453 ± 0.44
6.46LeuIle: 6.46 ± 0.738
7.907LeuLys: 7.907 ± 0.801
10.023LeuLeu: 10.023 ± 0.878
3.118LeuMet: 3.118 ± 0.697
8.019LeuAsn: 8.019 ± 1.499
3.118LeuPro: 3.118 ± 0.731
5.68LeuGln: 5.68 ± 0.688
5.68LeuArg: 5.68 ± 0.346
7.462LeuSer: 7.462 ± 1.24
6.794LeuThr: 6.794 ± 0.743
4.455LeuVal: 4.455 ± 0.494
0.78LeuTrp: 0.78 ± 0.586
5.791LeuTyr: 5.791 ± 1.104
0.0LeuXaa: 0.0 ± 0.0
Met
1.559MetAla: 1.559 ± 0.78
0.334MetCys: 0.334 ± 0.114
1.559MetAsp: 1.559 ± 0.398
1.336MetGlu: 1.336 ± 0.303
1.114MetPhe: 1.114 ± 0.286
0.78MetGly: 0.78 ± 0.561
0.445MetHis: 0.445 ± 0.144
1.002MetIle: 1.002 ± 0.312
0.891MetLys: 0.891 ± 0.172
3.23MetLeu: 3.23 ± 1.615
0.668MetMet: 0.668 ± 0.283
1.782MetAsn: 1.782 ± 0.461
1.114MetPro: 1.114 ± 0.392
1.559MetGln: 1.559 ± 0.294
0.557MetArg: 0.557 ± 0.223
0.891MetSer: 0.891 ± 0.288
1.336MetThr: 1.336 ± 0.49
0.78MetVal: 0.78 ± 0.147
0.0MetTrp: 0.0 ± 0.0
1.002MetTyr: 1.002 ± 0.203
0.0MetXaa: 0.0 ± 0.0
Asn
5.123AsnAla: 5.123 ± 2.176
1.336AsnCys: 1.336 ± 1.135
2.784AsnAsp: 2.784 ± 0.681
2.784AsnGlu: 2.784 ± 0.815
4.232AsnPhe: 4.232 ± 1.598
2.896AsnGly: 2.896 ± 1.013
2.896AsnHis: 2.896 ± 0.578
5.012AsnIle: 5.012 ± 1.129
4.789AsnLys: 4.789 ± 0.831
7.239AsnLeu: 7.239 ± 1.07
1.559AsnMet: 1.559 ± 0.465
6.46AsnAsn: 6.46 ± 1.378
4.343AsnPro: 4.343 ± 1.12
3.341AsnGln: 3.341 ± 0.665
2.116AsnArg: 2.116 ± 0.493
2.673AsnSer: 2.673 ± 1.71
6.125AsnThr: 6.125 ± 1.176
4.343AsnVal: 4.343 ± 0.668
0.668AsnTrp: 0.668 ± 0.229
4.789AsnTyr: 4.789 ± 0.731
0.0AsnXaa: 0.0 ± 0.0
Pro
3.118ProAla: 3.118 ± 0.465
0.334ProCys: 0.334 ± 0.176
2.005ProAsp: 2.005 ± 0.611
2.116ProGlu: 2.116 ± 0.297
0.78ProPhe: 0.78 ± 0.314
1.671ProGly: 1.671 ± 0.544
1.671ProHis: 1.671 ± 0.249
3.118ProIle: 3.118 ± 0.7
2.896ProLys: 2.896 ± 2.262
7.016ProLeu: 7.016 ± 0.825
0.223ProMet: 0.223 ± 0.138
2.227ProAsn: 2.227 ± 0.648
1.782ProPro: 1.782 ± 0.398
1.671ProGln: 1.671 ± 0.519
1.782ProArg: 1.782 ± 0.477
3.007ProSer: 3.007 ± 0.867
4.343ProThr: 4.343 ± 0.323
2.116ProVal: 2.116 ± 0.421
0.334ProTrp: 0.334 ± 0.114
2.227ProTyr: 2.227 ± 0.992
0.0ProXaa: 0.0 ± 0.0
Gln
2.673GlnAla: 2.673 ± 1.041
0.891GlnCys: 0.891 ± 0.309
1.671GlnAsp: 1.671 ± 0.519
1.671GlnGlu: 1.671 ± 0.455
1.559GlnPhe: 1.559 ± 0.542
1.114GlnGly: 1.114 ± 0.272
2.896GlnHis: 2.896 ± 0.34
2.227GlnIle: 2.227 ± 0.425
1.893GlnLys: 1.893 ± 0.237
5.68GlnLeu: 5.68 ± 0.552
0.334GlnMet: 0.334 ± 0.114
3.007GlnAsn: 3.007 ± 0.693
2.339GlnPro: 2.339 ± 1.986
1.893GlnGln: 1.893 ± 1.358
1.559GlnArg: 1.559 ± 0.584
2.227GlnSer: 2.227 ± 2.125
2.673GlnThr: 2.673 ± 0.378
2.005GlnVal: 2.005 ± 0.556
0.334GlnTrp: 0.334 ± 0.176
2.784GlnTyr: 2.784 ± 0.759
0.0GlnXaa: 0.0 ± 0.0
Arg
1.782ArgAla: 1.782 ± 0.618
0.445ArgCys: 0.445 ± 0.163
2.562ArgAsp: 2.562 ± 0.473
1.671ArgGlu: 1.671 ± 0.371
0.891ArgPhe: 0.891 ± 0.272
0.557ArgGly: 0.557 ± 0.198
0.891ArgHis: 0.891 ± 0.272
3.564ArgIle: 3.564 ± 1.223
3.564ArgLys: 3.564 ± 0.92
2.227ArgLeu: 2.227 ± 0.403
0.445ArgMet: 0.445 ± 0.163
3.453ArgAsn: 3.453 ± 1.091
1.114ArgPro: 1.114 ± 0.262
2.227ArgGln: 2.227 ± 0.726
2.339ArgArg: 2.339 ± 0.619
1.671ArgSer: 1.671 ± 1.082
3.118ArgThr: 3.118 ± 0.426
1.782ArgVal: 1.782 ± 0.583
0.445ArgTrp: 0.445 ± 0.671
4.566ArgTyr: 4.566 ± 1.435
0.0ArgXaa: 0.0 ± 0.0
Ser
2.896SerAla: 2.896 ± 0.631
1.225SerCys: 1.225 ± 0.372
3.564SerAsp: 3.564 ± 0.579
4.009SerGlu: 4.009 ± 0.402
1.448SerPhe: 1.448 ± 0.357
1.336SerGly: 1.336 ± 0.885
2.116SerHis: 2.116 ± 0.871
4.121SerIle: 4.121 ± 1.151
3.118SerLys: 3.118 ± 1.02
4.232SerLeu: 4.232 ± 1.393
1.336SerMet: 1.336 ± 0.918
4.232SerAsn: 4.232 ± 0.956
2.005SerPro: 2.005 ± 1.711
2.116SerGln: 2.116 ± 0.443
2.005SerArg: 2.005 ± 0.581
4.566SerSer: 4.566 ± 2.564
4.009SerThr: 4.009 ± 1.59
2.45SerVal: 2.45 ± 0.501
0.0SerTrp: 0.0 ± 0.0
4.455SerTyr: 4.455 ± 1.524
0.0SerXaa: 0.0 ± 0.0
Thr
4.455ThrAla: 4.455 ± 1.226
2.227ThrCys: 2.227 ± 0.403
3.675ThrAsp: 3.675 ± 0.756
2.562ThrGlu: 2.562 ± 0.539
2.116ThrPhe: 2.116 ± 0.533
3.007ThrGly: 3.007 ± 0.741
2.562ThrHis: 2.562 ± 0.686
6.682ThrIle: 6.682 ± 1.076
5.234ThrLys: 5.234 ± 0.728
9.689ThrLeu: 9.689 ± 0.982
1.782ThrMet: 1.782 ± 0.342
4.678ThrAsn: 4.678 ± 0.78
4.343ThrPro: 4.343 ± 0.542
2.896ThrGln: 2.896 ± 1.148
3.453ThrArg: 3.453 ± 0.501
5.012ThrSer: 5.012 ± 1.404
9.912ThrThr: 9.912 ± 2.145
5.903ThrVal: 5.903 ± 1.178
0.445ThrTrp: 0.445 ± 0.435
4.455ThrTyr: 4.455 ± 0.596
0.111ThrXaa: 0.111 ± 0.214
Val
3.118ValAla: 3.118 ± 0.65
1.671ValCys: 1.671 ± 0.395
2.339ValAsp: 2.339 ± 0.596
1.671ValGlu: 1.671 ± 0.594
3.118ValPhe: 3.118 ± 0.47
0.668ValGly: 0.668 ± 0.298
3.007ValHis: 3.007 ± 0.686
3.564ValIle: 3.564 ± 0.82
3.23ValLys: 3.23 ± 0.495
5.903ValLeu: 5.903 ± 1.35
0.668ValMet: 0.668 ± 0.343
4.455ValAsn: 4.455 ± 0.969
1.893ValPro: 1.893 ± 0.385
1.225ValGln: 1.225 ± 1.243
1.893ValArg: 1.893 ± 0.332
2.784ValSer: 2.784 ± 0.419
4.009ValThr: 4.009 ± 0.473
2.339ValVal: 2.339 ± 0.432
0.111ValTrp: 0.111 ± 0.069
3.453ValTyr: 3.453 ± 0.396
0.0ValXaa: 0.0 ± 0.0
Trp
0.557TrpAla: 0.557 ± 0.393
0.0TrpCys: 0.0 ± 0.0
0.557TrpAsp: 0.557 ± 0.372
0.0TrpGlu: 0.0 ± 0.0
0.111TrpPhe: 0.111 ± 0.409
0.111TrpGly: 0.111 ± 0.069
0.445TrpHis: 0.445 ± 0.379
0.557TrpIle: 0.557 ± 0.565
0.0TrpLys: 0.0 ± 0.0
0.78TrpLeu: 0.78 ± 0.57
0.111TrpMet: 0.111 ± 0.069
0.334TrpAsn: 0.334 ± 0.207
0.111TrpPro: 0.111 ± 0.214
0.334TrpGln: 0.334 ± 0.114
0.557TrpArg: 0.557 ± 0.319
0.334TrpSer: 0.334 ± 0.114
0.78TrpThr: 0.78 ± 0.288
0.223TrpVal: 0.223 ± 0.094
0.0TrpTrp: 0.0 ± 0.0
0.668TrpTyr: 0.668 ± 0.168
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.675TyrAla: 3.675 ± 0.796
2.005TyrCys: 2.005 ± 0.578
3.453TyrAsp: 3.453 ± 0.381
2.562TyrGlu: 2.562 ± 0.686
1.559TyrPhe: 1.559 ± 0.503
2.673TyrGly: 2.673 ± 0.34
2.784TyrHis: 2.784 ± 0.912
5.457TyrIle: 5.457 ± 0.809
3.341TyrLys: 3.341 ± 0.665
4.9TyrLeu: 4.9 ± 0.649
1.559TyrMet: 1.559 ± 0.375
7.35TyrAsn: 7.35 ± 1.203
1.893TyrPro: 1.893 ± 0.649
1.893TyrGln: 1.893 ± 0.412
2.784TyrArg: 2.784 ± 0.546
2.339TyrSer: 2.339 ± 0.515
6.237TyrThr: 6.237 ± 0.588
2.896TyrVal: 2.896 ± 0.566
0.334TyrTrp: 0.334 ± 0.634
3.675TyrTyr: 3.675 ± 1.55
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.111XaaSer: 0.111 ± 0.214
0.0XaaThr: 0.0 ± 0.0
0.223XaaVal: 0.223 ± 0.094
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (8980 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski