Amino acid dipepetide frequency for Le Dantec virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.693AlaAla: 2.693 ± 1.501
0.808AlaCys: 0.808 ± 0.508
3.232AlaAsp: 3.232 ± 1.585
1.347AlaGlu: 1.347 ± 0.843
1.616AlaPhe: 1.616 ± 0.297
2.693AlaGly: 2.693 ± 0.899
1.347AlaHis: 1.347 ± 0.789
1.885AlaIle: 1.885 ± 0.257
1.347AlaLys: 1.347 ± 0.493
5.386AlaLeu: 5.386 ± 0.729
0.808AlaMet: 0.808 ± 0.732
1.347AlaAsn: 1.347 ± 0.804
1.885AlaPro: 1.885 ± 1.203
2.155AlaGln: 2.155 ± 0.452
2.155AlaArg: 2.155 ± 0.552
2.424AlaSer: 2.424 ± 0.594
3.501AlaThr: 3.501 ± 0.937
1.077AlaVal: 1.077 ± 0.68
0.539AlaTrp: 0.539 ± 0.34
1.616AlaTyr: 1.616 ± 0.732
0.0AlaXaa: 0.0 ± 0.0
Cys
0.539CysAla: 0.539 ± 0.34
0.539CysCys: 0.539 ± 0.243
0.808CysAsp: 0.808 ± 0.515
1.347CysGlu: 1.347 ± 1.064
1.077CysPhe: 1.077 ± 0.374
1.077CysGly: 1.077 ± 0.569
0.808CysHis: 0.808 ± 0.673
2.155CysIle: 2.155 ± 0.604
1.616CysLys: 1.616 ± 0.472
1.077CysLeu: 1.077 ± 0.552
0.269CysMet: 0.269 ± 0.38
1.347CysAsn: 1.347 ± 0.491
0.539CysPro: 0.539 ± 0.243
0.0CysGln: 0.0 ± 0.0
0.539CysArg: 0.539 ± 0.564
1.885CysSer: 1.885 ± 0.787
0.808CysThr: 0.808 ± 0.331
1.616CysVal: 1.616 ± 0.393
0.539CysTrp: 0.539 ± 0.284
0.808CysTyr: 0.808 ± 0.282
0.0CysXaa: 0.0 ± 0.0
Asp
1.077AspAla: 1.077 ± 0.557
1.347AspCys: 1.347 ± 0.709
5.117AspAsp: 5.117 ± 1.187
5.656AspGlu: 5.656 ± 1.568
3.232AspPhe: 3.232 ± 1.182
2.424AspGly: 2.424 ± 0.633
1.885AspHis: 1.885 ± 0.647
3.232AspIle: 3.232 ± 0.654
4.579AspLys: 4.579 ± 1.046
6.733AspLeu: 6.733 ± 0.708
3.501AspMet: 3.501 ± 0.887
2.963AspAsn: 2.963 ± 0.728
2.693AspPro: 2.693 ± 0.41
2.155AspGln: 2.155 ± 1.495
4.309AspArg: 4.309 ± 1.024
4.04AspSer: 4.04 ± 1.903
1.347AspThr: 1.347 ± 0.468
3.232AspVal: 3.232 ± 1.591
0.808AspTrp: 0.808 ± 0.331
2.963AspTyr: 2.963 ± 1.043
0.0AspXaa: 0.0 ± 0.0
Glu
2.424GluAla: 2.424 ± 1.036
1.885GluCys: 1.885 ± 0.647
4.848GluAsp: 4.848 ± 0.359
7.81GluGlu: 7.81 ± 2.281
3.232GluPhe: 3.232 ± 0.951
3.501GluGly: 3.501 ± 0.796
2.424GluHis: 2.424 ± 0.515
6.194GluIle: 6.194 ± 1.03
4.579GluLys: 4.579 ± 0.886
5.925GluLeu: 5.925 ± 1.099
1.077GluMet: 1.077 ± 0.487
2.155GluAsn: 2.155 ± 1.125
2.424GluPro: 2.424 ± 0.882
0.808GluGln: 0.808 ± 0.365
2.963GluArg: 2.963 ± 0.943
4.309GluSer: 4.309 ± 1.346
3.232GluThr: 3.232 ± 0.889
2.693GluVal: 2.693 ± 0.849
0.808GluTrp: 0.808 ± 0.331
3.232GluTyr: 3.232 ± 0.507
0.0GluXaa: 0.0 ± 0.0
Phe
2.424PheAla: 2.424 ± 1.231
0.808PheCys: 0.808 ± 0.732
1.885PheAsp: 1.885 ± 0.485
2.155PheGlu: 2.155 ± 0.653
2.155PhePhe: 2.155 ± 0.794
2.424PheGly: 2.424 ± 0.961
0.539PheHis: 0.539 ± 0.564
3.501PheIle: 3.501 ± 0.961
5.117PheLys: 5.117 ± 0.72
5.656PheLeu: 5.656 ± 1.982
1.347PheMet: 1.347 ± 0.693
1.347PheAsn: 1.347 ± 0.493
2.693PhePro: 2.693 ± 0.41
2.155PheGln: 2.155 ± 0.58
2.963PheArg: 2.963 ± 0.472
4.309PheSer: 4.309 ± 0.893
1.885PheThr: 1.885 ± 0.748
2.963PheVal: 2.963 ± 0.863
0.808PheTrp: 0.808 ± 0.282
0.808PheTyr: 0.808 ± 1.028
0.0PheXaa: 0.0 ± 0.0
Gly
1.347GlyAla: 1.347 ± 0.472
1.347GlyCys: 1.347 ± 0.284
4.579GlyAsp: 4.579 ± 1.01
2.963GlyGlu: 2.963 ± 0.535
5.117GlyPhe: 5.117 ± 0.808
5.386GlyGly: 5.386 ± 1.658
1.347GlyHis: 1.347 ± 0.331
4.579GlyIle: 4.579 ± 1.454
3.771GlyLys: 3.771 ± 1.2
7.272GlyLeu: 7.272 ± 1.531
1.616GlyMet: 1.616 ± 0.564
2.424GlyAsn: 2.424 ± 0.615
2.693GlyPro: 2.693 ± 0.787
1.616GlyGln: 1.616 ± 0.393
1.885GlyArg: 1.885 ± 0.722
4.848GlySer: 4.848 ± 1.152
3.232GlyThr: 3.232 ± 0.82
3.501GlyVal: 3.501 ± 0.467
0.539GlyTrp: 0.539 ± 0.33
1.885GlyTyr: 1.885 ± 0.753
0.0GlyXaa: 0.0 ± 0.0
His
2.155HisAla: 2.155 ± 0.825
0.269HisCys: 0.269 ± 0.142
1.885HisAsp: 1.885 ± 0.412
2.424HisGlu: 2.424 ± 1.269
1.077HisPhe: 1.077 ± 0.424
1.347HisGly: 1.347 ± 0.507
1.077HisHis: 1.077 ± 0.487
2.693HisIle: 2.693 ± 0.81
1.347HisLys: 1.347 ± 0.507
2.693HisLeu: 2.693 ± 0.677
0.269HisMet: 0.269 ± 0.38
0.808HisAsn: 0.808 ± 0.427
2.155HisPro: 2.155 ± 0.452
0.808HisGln: 0.808 ± 0.515
2.424HisArg: 2.424 ± 0.595
0.808HisSer: 0.808 ± 0.508
1.077HisThr: 1.077 ± 0.69
2.155HisVal: 2.155 ± 0.79
0.539HisTrp: 0.539 ± 0.284
0.808HisTyr: 0.808 ± 0.282
0.0HisXaa: 0.0 ± 0.0
Ile
2.424IleAla: 2.424 ± 0.643
0.808IleCys: 0.808 ± 0.282
4.579IleAsp: 4.579 ± 1.142
4.309IleGlu: 4.309 ± 0.958
2.155IlePhe: 2.155 ± 0.452
3.501IleGly: 3.501 ± 0.767
1.616IleHis: 1.616 ± 0.618
4.04IleIle: 4.04 ± 0.894
7.81IleLys: 7.81 ± 1.413
5.925IleLeu: 5.925 ± 1.002
1.616IleMet: 1.616 ± 0.444
5.656IleAsn: 5.656 ± 0.854
2.693IlePro: 2.693 ± 1.138
1.885IleGln: 1.885 ± 0.809
4.309IleArg: 4.309 ± 1.155
5.656IleSer: 5.656 ± 1.17
4.309IleThr: 4.309 ± 0.856
2.155IleVal: 2.155 ± 0.818
1.077IleTrp: 1.077 ± 0.428
2.424IleTyr: 2.424 ± 1.311
0.0IleXaa: 0.0 ± 0.0
Lys
3.232LysAla: 3.232 ± 1.37
1.347LysCys: 1.347 ± 0.507
4.04LysAsp: 4.04 ± 2.223
7.272LysGlu: 7.272 ± 1.418
2.424LysPhe: 2.424 ± 1.005
4.848LysGly: 4.848 ± 0.46
1.077LysHis: 1.077 ± 0.324
4.309LysIle: 4.309 ± 0.75
7.272LysLys: 7.272 ± 1.242
5.386LysLeu: 5.386 ± 1.16
1.077LysMet: 1.077 ± 0.424
5.117LysAsn: 5.117 ± 0.859
3.232LysPro: 3.232 ± 0.745
2.693LysGln: 2.693 ± 1.449
4.309LysArg: 4.309 ± 0.525
5.386LysSer: 5.386 ± 1.099
3.232LysThr: 3.232 ± 1.501
3.501LysVal: 3.501 ± 0.5
1.616LysTrp: 1.616 ± 0.618
1.347LysTyr: 1.347 ± 0.427
0.0LysXaa: 0.0 ± 0.0
Leu
3.501LeuAla: 3.501 ± 0.722
2.424LeuCys: 2.424 ± 0.424
4.309LeuAsp: 4.309 ± 1.335
4.848LeuGlu: 4.848 ± 1.03
2.424LeuPhe: 2.424 ± 0.679
7.541LeuGly: 7.541 ± 0.714
2.963LeuHis: 2.963 ± 1.151
8.349LeuIle: 8.349 ± 2.092
5.925LeuLys: 5.925 ± 0.555
8.08LeuLeu: 8.08 ± 0.849
4.04LeuMet: 4.04 ± 1.067
6.194LeuAsn: 6.194 ± 1.67
2.693LeuPro: 2.693 ± 1.323
2.424LeuGln: 2.424 ± 0.861
5.656LeuArg: 5.656 ± 2.001
11.312LeuSer: 11.312 ± 0.877
5.656LeuThr: 5.656 ± 1.492
6.194LeuVal: 6.194 ± 1.147
1.077LeuTrp: 1.077 ± 0.552
2.693LeuTyr: 2.693 ± 0.888
0.0LeuXaa: 0.0 ± 0.0
Met
1.616MetAla: 1.616 ± 0.663
0.269MetCys: 0.269 ± 0.142
1.885MetAsp: 1.885 ± 0.474
2.424MetGlu: 2.424 ± 0.778
1.885MetPhe: 1.885 ± 0.655
2.155MetGly: 2.155 ± 0.55
0.269MetHis: 0.269 ± 0.142
1.885MetIle: 1.885 ± 0.722
1.077MetLys: 1.077 ± 0.538
2.155MetLeu: 2.155 ± 0.649
0.539MetMet: 0.539 ± 0.284
1.347MetAsn: 1.347 ± 0.472
0.808MetPro: 0.808 ± 0.828
0.539MetGln: 0.539 ± 0.284
0.539MetArg: 0.539 ± 0.805
1.885MetSer: 1.885 ± 0.722
2.693MetThr: 2.693 ± 0.806
1.347MetVal: 1.347 ± 0.804
0.808MetTrp: 0.808 ± 0.508
1.347MetTyr: 1.347 ± 0.697
0.0MetXaa: 0.0 ± 0.0
Asn
1.616AsnAla: 1.616 ± 1.037
0.0AsnCys: 0.0 ± 0.0
2.155AsnAsp: 2.155 ± 0.531
3.501AsnGlu: 3.501 ± 1.205
2.693AsnPhe: 2.693 ± 1.176
2.155AsnGly: 2.155 ± 0.931
2.424AsnHis: 2.424 ± 1.28
3.232AsnIle: 3.232 ± 0.75
1.616AsnLys: 1.616 ± 0.432
6.194AsnLeu: 6.194 ± 0.939
1.885AsnMet: 1.885 ± 0.666
1.616AsnAsn: 1.616 ± 0.578
2.693AsnPro: 2.693 ± 0.334
2.424AsnGln: 2.424 ± 0.673
0.808AsnArg: 0.808 ± 0.336
4.579AsnSer: 4.579 ± 1.11
1.347AsnThr: 1.347 ± 0.711
2.424AsnVal: 2.424 ± 0.515
1.077AsnTrp: 1.077 ± 0.374
1.616AsnTyr: 1.616 ± 0.564
0.0AsnXaa: 0.0 ± 0.0
Pro
3.232ProAla: 3.232 ± 0.847
0.808ProCys: 0.808 ± 0.403
3.501ProAsp: 3.501 ± 0.892
2.155ProGlu: 2.155 ± 0.653
0.539ProPhe: 0.539 ± 0.243
2.424ProGly: 2.424 ± 0.505
1.347ProHis: 1.347 ± 0.472
3.501ProIle: 3.501 ± 1.417
3.771ProLys: 3.771 ± 1.311
3.771ProLeu: 3.771 ± 0.976
1.077ProMet: 1.077 ± 0.797
0.808ProAsn: 0.808 ± 0.331
2.963ProPro: 2.963 ± 1.552
0.808ProGln: 0.808 ± 0.427
0.808ProArg: 0.808 ± 0.331
4.04ProSer: 4.04 ± 0.855
2.963ProThr: 2.963 ± 0.698
1.616ProVal: 1.616 ± 0.607
0.808ProTrp: 0.808 ± 0.427
1.885ProTyr: 1.885 ± 0.648
0.0ProXaa: 0.0 ± 0.0
Gln
1.347GlnAla: 1.347 ± 0.388
1.077GlnCys: 1.077 ± 0.599
1.077GlnAsp: 1.077 ± 0.569
0.808GlnGlu: 0.808 ± 0.376
2.155GlnPhe: 2.155 ± 0.825
2.424GlnGly: 2.424 ± 0.528
0.269GlnHis: 0.269 ± 0.38
2.693GlnIle: 2.693 ± 0.711
1.885GlnLys: 1.885 ± 0.435
2.963GlnLeu: 2.963 ± 0.801
1.077GlnMet: 1.077 ± 0.68
1.077GlnAsn: 1.077 ± 0.374
1.077GlnPro: 1.077 ± 0.397
0.539GlnGln: 0.539 ± 0.284
1.885GlnArg: 1.885 ± 0.787
1.347GlnSer: 1.347 ± 0.711
0.539GlnThr: 0.539 ± 0.284
2.693GlnVal: 2.693 ± 1.502
0.269GlnTrp: 0.269 ± 0.282
0.808GlnTyr: 0.808 ± 0.365
0.0GlnXaa: 0.0 ± 0.0
Arg
2.424ArgAla: 2.424 ± 0.656
1.347ArgCys: 1.347 ± 0.507
2.424ArgAsp: 2.424 ± 0.594
4.04ArgGlu: 4.04 ± 1.542
4.04ArgPhe: 4.04 ± 0.814
3.232ArgGly: 3.232 ± 0.915
2.693ArgHis: 2.693 ± 0.62
2.693ArgIle: 2.693 ± 1.422
3.232ArgLys: 3.232 ± 0.799
3.232ArgLeu: 3.232 ± 0.883
0.808ArgMet: 0.808 ± 0.628
1.347ArgAsn: 1.347 ± 0.331
1.885ArgPro: 1.885 ± 0.412
1.077ArgGln: 1.077 ± 0.428
2.963ArgArg: 2.963 ± 1.109
4.04ArgSer: 4.04 ± 1.181
3.501ArgThr: 3.501 ± 0.626
3.501ArgVal: 3.501 ± 0.561
1.347ArgTrp: 1.347 ± 0.491
2.155ArgTyr: 2.155 ± 0.567
0.0ArgXaa: 0.0 ± 0.0
Ser
3.501SerAla: 3.501 ± 0.649
0.269SerCys: 0.269 ± 0.142
7.002SerAsp: 7.002 ± 1.537
5.386SerGlu: 5.386 ± 1.093
3.232SerPhe: 3.232 ± 1.195
5.117SerGly: 5.117 ± 1.142
1.885SerHis: 1.885 ± 0.793
5.117SerIle: 5.117 ± 1.565
5.656SerLys: 5.656 ± 1.466
7.272SerLeu: 7.272 ± 1.1
1.616SerMet: 1.616 ± 0.556
3.501SerAsn: 3.501 ± 0.773
3.501SerPro: 3.501 ± 0.561
2.155SerGln: 2.155 ± 0.649
4.579SerArg: 4.579 ± 0.731
7.272SerSer: 7.272 ± 1.847
4.579SerThr: 4.579 ± 1.038
4.848SerVal: 4.848 ± 1.158
0.808SerTrp: 0.808 ± 0.282
2.693SerTyr: 2.693 ± 0.921
0.0SerXaa: 0.0 ± 0.0
Thr
1.347ThrAla: 1.347 ± 1.12
1.885ThrCys: 1.885 ± 0.435
2.155ThrAsp: 2.155 ± 0.621
2.424ThrGlu: 2.424 ± 0.505
1.885ThrPhe: 1.885 ± 0.608
3.771ThrGly: 3.771 ± 0.659
1.616ThrHis: 1.616 ± 0.683
3.771ThrIle: 3.771 ± 0.851
2.963ThrLys: 2.963 ± 1.306
5.117ThrLeu: 5.117 ± 1.035
1.885ThrMet: 1.885 ± 0.821
2.424ThrAsn: 2.424 ± 0.528
0.808ThrPro: 0.808 ± 0.376
1.347ThrGln: 1.347 ± 0.939
4.04ThrArg: 4.04 ± 0.927
3.232ThrSer: 3.232 ± 0.871
3.501ThrThr: 3.501 ± 1.276
3.501ThrVal: 3.501 ± 0.548
1.885ThrTrp: 1.885 ± 0.412
1.347ThrTyr: 1.347 ± 0.491
0.0ThrXaa: 0.0 ± 0.0
Val
1.616ValAla: 1.616 ± 0.73
1.616ValCys: 1.616 ± 0.618
4.309ValAsp: 4.309 ± 0.965
2.693ValGlu: 2.693 ± 0.987
3.232ValPhe: 3.232 ± 0.677
3.232ValGly: 3.232 ± 0.989
1.885ValHis: 1.885 ± 0.657
3.232ValIle: 3.232 ± 1.008
3.771ValLys: 3.771 ± 1.823
7.81ValLeu: 7.81 ± 0.703
1.885ValMet: 1.885 ± 0.657
2.424ValAsn: 2.424 ± 0.682
2.693ValPro: 2.693 ± 0.908
1.885ValGln: 1.885 ± 0.657
2.424ValArg: 2.424 ± 0.604
4.579ValSer: 4.579 ± 0.614
1.885ValThr: 1.885 ± 1.193
2.424ValVal: 2.424 ± 0.965
0.269ValTrp: 0.269 ± 0.403
1.347ValTyr: 1.347 ± 0.794
0.0ValXaa: 0.0 ± 0.0
Trp
0.808TrpAla: 0.808 ± 0.365
0.0TrpCys: 0.0 ± 0.0
0.808TrpAsp: 0.808 ± 0.508
1.885TrpGlu: 1.885 ± 0.7
1.347TrpPhe: 1.347 ± 0.829
1.347TrpGly: 1.347 ± 0.711
0.269TrpHis: 0.269 ± 0.142
1.077TrpIle: 1.077 ± 0.428
1.616TrpLys: 1.616 ± 0.618
1.616TrpLeu: 1.616 ± 1.015
0.269TrpMet: 0.269 ± 0.142
0.539TrpAsn: 0.539 ± 0.284
0.808TrpPro: 0.808 ± 0.331
0.0TrpGln: 0.0 ± 0.0
0.539TrpArg: 0.539 ± 0.284
1.077TrpSer: 1.077 ± 0.397
0.269TrpThr: 0.269 ± 0.142
1.347TrpVal: 1.347 ± 0.444
0.269TrpTrp: 0.269 ± 0.142
0.539TrpTyr: 0.539 ± 0.243
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.539TyrAla: 0.539 ± 0.284
0.539TyrCys: 0.539 ± 0.243
2.693TyrAsp: 2.693 ± 1.36
1.077TyrGlu: 1.077 ± 0.374
2.424TyrPhe: 2.424 ± 0.608
1.347TyrGly: 1.347 ± 0.468
1.077TyrHis: 1.077 ± 0.374
0.808TyrIle: 0.808 ± 1.217
3.771TyrLys: 3.771 ± 0.551
4.04TyrLeu: 4.04 ± 1.461
0.539TyrMet: 0.539 ± 0.805
1.616TyrAsn: 1.616 ± 1.096
2.155TyrPro: 2.155 ± 0.339
0.539TyrGln: 0.539 ± 0.284
1.885TyrArg: 1.885 ± 0.643
2.963TyrSer: 2.963 ± 0.466
1.347TyrThr: 1.347 ± 0.284
2.424TyrVal: 2.424 ± 0.882
0.539TyrTrp: 0.539 ± 0.243
1.077TyrTyr: 1.077 ± 0.68
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (3714 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski