Amino acid dipepetide frequency for SIV-wrc Pbt-05GM-X02

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.117AlaAla: 3.117 ± 1.14
1.247AlaCys: 1.247 ± 0.785
1.87AlaAsp: 1.87 ± 0.618
6.546AlaGlu: 6.546 ± 2.574
1.559AlaPhe: 1.559 ± 0.457
3.117AlaGly: 3.117 ± 0.554
0.935AlaHis: 0.935 ± 0.426
4.364AlaIle: 4.364 ± 0.965
2.494AlaLys: 2.494 ± 0.567
5.299AlaLeu: 5.299 ± 1.42
1.87AlaMet: 1.87 ± 0.822
1.559AlaAsn: 1.559 ± 0.846
2.805AlaPro: 2.805 ± 1.332
2.182AlaGln: 2.182 ± 0.704
2.494AlaArg: 2.494 ± 0.994
3.429AlaSer: 3.429 ± 0.679
4.052AlaThr: 4.052 ± 0.909
4.052AlaVal: 4.052 ± 1.269
1.87AlaTrp: 1.87 ± 1.032
2.182AlaTyr: 2.182 ± 0.72
0.0AlaXaa: 0.0 ± 0.0
Cys
1.559CysAla: 1.559 ± 0.81
0.935CysCys: 0.935 ± 0.885
0.623CysAsp: 0.623 ± 0.401
1.247CysGlu: 1.247 ± 0.727
0.935CysPhe: 0.935 ± 0.784
1.247CysGly: 1.247 ± 0.727
1.247CysHis: 1.247 ± 0.779
1.559CysIle: 1.559 ± 0.672
2.805CysLys: 2.805 ± 0.876
1.87CysLeu: 1.87 ± 0.879
0.0CysMet: 0.0 ± 0.0
1.559CysAsn: 1.559 ± 1.039
0.623CysPro: 0.623 ± 0.514
1.87CysGln: 1.87 ± 0.542
1.559CysArg: 1.559 ± 0.718
0.312CysSer: 0.312 ± 0.413
2.494CysThr: 2.494 ± 1.407
1.559CysVal: 1.559 ± 0.715
1.247CysTrp: 1.247 ± 0.679
0.623CysTyr: 0.623 ± 0.464
0.0CysXaa: 0.0 ± 0.0
Asp
1.87AspAla: 1.87 ± 0.961
1.87AspCys: 1.87 ± 0.619
1.87AspAsp: 1.87 ± 0.661
0.935AspGlu: 0.935 ± 0.458
2.182AspPhe: 2.182 ± 0.351
2.182AspGly: 2.182 ± 0.593
0.935AspHis: 0.935 ± 0.257
3.117AspIle: 3.117 ± 0.953
2.494AspLys: 2.494 ± 0.799
3.117AspLeu: 3.117 ± 1.219
0.623AspMet: 0.623 ± 0.259
0.623AspAsn: 0.623 ± 0.259
1.247AspPro: 1.247 ± 0.764
1.247AspGln: 1.247 ± 0.594
2.182AspArg: 2.182 ± 1.062
2.805AspSer: 2.805 ± 1.66
2.494AspThr: 2.494 ± 0.477
2.182AspVal: 2.182 ± 0.602
1.247AspTrp: 1.247 ± 1.026
0.935AspTyr: 0.935 ± 0.601
0.0AspXaa: 0.0 ± 0.0
Glu
4.052GluAla: 4.052 ± 1.702
2.182GluCys: 2.182 ± 0.712
4.676GluAsp: 4.676 ± 1.316
8.416GluGlu: 8.416 ± 1.397
0.935GluPhe: 0.935 ± 0.426
5.923GluGly: 5.923 ± 1.859
0.935GluHis: 0.935 ± 0.42
6.234GluIle: 6.234 ± 1.535
8.416GluLys: 8.416 ± 0.675
6.234GluLeu: 6.234 ± 0.669
1.87GluMet: 1.87 ± 0.682
3.117GluAsn: 3.117 ± 1.502
2.182GluPro: 2.182 ± 0.8
4.052GluGln: 4.052 ± 1.52
3.117GluArg: 3.117 ± 1.644
4.988GluSer: 4.988 ± 1.288
3.429GluThr: 3.429 ± 0.447
3.429GluVal: 3.429 ± 0.75
2.494GluTrp: 2.494 ± 0.946
1.559GluTyr: 1.559 ± 0.638
0.0GluXaa: 0.0 ± 0.0
Phe
1.247PheAla: 1.247 ± 0.475
0.312PheCys: 0.312 ± 0.257
0.312PheAsp: 0.312 ± 0.386
1.559PheGlu: 1.559 ± 0.551
2.182PhePhe: 2.182 ± 0.549
2.182PheGly: 2.182 ± 0.841
0.935PheHis: 0.935 ± 0.598
1.87PheIle: 1.87 ± 0.694
3.429PheLys: 3.429 ± 1.106
2.182PheLeu: 2.182 ± 0.756
0.312PheMet: 0.312 ± 0.257
2.182PheAsn: 2.182 ± 1.195
0.935PhePro: 0.935 ± 0.904
1.559PheGln: 1.559 ± 0.495
1.87PheArg: 1.87 ± 0.805
1.247PheSer: 1.247 ± 0.973
2.805PheThr: 2.805 ± 0.743
0.935PheVal: 0.935 ± 0.536
0.623PheTrp: 0.623 ± 0.39
0.312PheTyr: 0.312 ± 0.413
0.0PheXaa: 0.0 ± 0.0
Gly
4.364GlyAla: 4.364 ± 1.042
1.87GlyCys: 1.87 ± 0.595
2.494GlyAsp: 2.494 ± 0.567
3.117GlyGlu: 3.117 ± 1.188
2.182GlyPhe: 2.182 ± 1.007
4.988GlyGly: 4.988 ± 2.046
1.87GlyHis: 1.87 ± 0.914
5.299GlyIle: 5.299 ± 1.435
6.234GlyLys: 6.234 ± 1.935
6.234GlyLeu: 6.234 ± 1.834
1.559GlyMet: 1.559 ± 0.783
3.741GlyAsn: 3.741 ± 0.747
3.741GlyPro: 3.741 ± 2.221
2.494GlyGln: 2.494 ± 0.894
3.117GlyArg: 3.117 ± 1.307
4.988GlySer: 4.988 ± 1.094
3.429GlyThr: 3.429 ± 1.338
4.676GlyVal: 4.676 ± 1.358
1.87GlyTrp: 1.87 ± 0.858
2.494GlyTyr: 2.494 ± 0.458
0.0GlyXaa: 0.0 ± 0.0
His
1.247HisAla: 1.247 ± 0.487
0.935HisCys: 0.935 ± 0.458
0.623HisAsp: 0.623 ± 0.364
0.623HisGlu: 0.623 ± 0.709
0.935HisPhe: 0.935 ± 0.904
0.935HisGly: 0.935 ± 0.393
0.312HisHis: 0.312 ± 0.257
2.182HisIle: 2.182 ± 1.064
1.247HisLys: 1.247 ± 0.542
3.429HisLeu: 3.429 ± 0.642
0.935HisMet: 0.935 ± 0.557
0.312HisAsn: 0.312 ± 0.237
2.494HisPro: 2.494 ± 1.056
1.87HisGln: 1.87 ± 0.621
1.247HisArg: 1.247 ± 0.632
1.247HisSer: 1.247 ± 1.044
1.559HisThr: 1.559 ± 0.864
1.247HisVal: 1.247 ± 0.455
0.312HisTrp: 0.312 ± 0.485
0.312HisTyr: 0.312 ± 0.413
0.0HisXaa: 0.0 ± 0.0
Ile
4.052IleAla: 4.052 ± 0.944
1.559IleCys: 1.559 ± 0.425
2.805IleAsp: 2.805 ± 1.036
4.676IleGlu: 4.676 ± 0.848
2.494IlePhe: 2.494 ± 1.358
6.546IleGly: 6.546 ± 1.159
1.87IleHis: 1.87 ± 0.674
3.429IleIle: 3.429 ± 2.244
4.052IleLys: 4.052 ± 1.311
3.117IleLeu: 3.117 ± 0.719
0.312IleMet: 0.312 ± 0.294
2.182IleAsn: 2.182 ± 0.655
2.805IlePro: 2.805 ± 1.626
4.988IleGln: 4.988 ± 1.233
6.546IleArg: 6.546 ± 0.879
4.052IleSer: 4.052 ± 1.017
3.741IleThr: 3.741 ± 0.935
4.052IleVal: 4.052 ± 0.571
1.87IleTrp: 1.87 ± 0.501
2.805IleTyr: 2.805 ± 0.66
0.0IleXaa: 0.0 ± 0.0
Lys
1.87LysAla: 1.87 ± 0.321
0.312LysCys: 0.312 ± 0.473
3.429LysAsp: 3.429 ± 0.886
9.663LysGlu: 9.663 ± 1.808
1.87LysPhe: 1.87 ± 0.845
4.988LysGly: 4.988 ± 1.087
0.935LysHis: 0.935 ± 0.516
6.234LysIle: 6.234 ± 1.924
6.234LysLys: 6.234 ± 1.721
8.416LysLeu: 8.416 ± 1.132
1.247LysMet: 1.247 ± 0.346
3.429LysAsn: 3.429 ± 0.447
2.494LysPro: 2.494 ± 0.752
4.676LysGln: 4.676 ± 1.281
5.299LysArg: 5.299 ± 1.206
2.182LysSer: 2.182 ± 0.954
4.364LysThr: 4.364 ± 0.671
2.805LysVal: 2.805 ± 0.757
1.247LysTrp: 1.247 ± 0.455
1.559LysTyr: 1.559 ± 0.633
0.0LysXaa: 0.0 ± 0.0
Leu
5.923LeuAla: 5.923 ± 1.604
4.052LeuCys: 4.052 ± 1.601
1.559LeuAsp: 1.559 ± 0.668
7.17LeuGlu: 7.17 ± 2.124
2.805LeuPhe: 2.805 ± 0.862
6.234LeuGly: 6.234 ± 1.12
1.87LeuHis: 1.87 ± 0.959
5.299LeuIle: 5.299 ± 1.281
4.052LeuLys: 4.052 ± 1.168
8.728LeuLeu: 8.728 ± 1.567
1.87LeuMet: 1.87 ± 0.54
2.494LeuAsn: 2.494 ± 0.84
3.429LeuPro: 3.429 ± 0.814
6.858LeuGln: 6.858 ± 1.774
5.299LeuArg: 5.299 ± 1.227
4.364LeuSer: 4.364 ± 0.984
4.364LeuThr: 4.364 ± 1.038
5.611LeuVal: 5.611 ± 1.147
1.247LeuTrp: 1.247 ± 0.541
2.805LeuTyr: 2.805 ± 0.867
0.0LeuXaa: 0.0 ± 0.0
Met
1.559MetAla: 1.559 ± 0.653
0.312MetCys: 0.312 ± 0.257
1.247MetAsp: 1.247 ± 0.638
1.559MetGlu: 1.559 ± 0.356
0.935MetPhe: 0.935 ± 0.458
1.559MetGly: 1.559 ± 0.476
0.623MetHis: 0.623 ± 0.445
1.87MetIle: 1.87 ± 0.534
0.935MetLys: 0.935 ± 0.704
1.559MetLeu: 1.559 ± 0.763
0.312MetMet: 0.312 ± 0.257
0.623MetAsn: 0.623 ± 0.364
0.312MetPro: 0.312 ± 0.294
2.494MetGln: 2.494 ± 0.818
0.312MetArg: 0.312 ± 0.294
0.935MetSer: 0.935 ± 0.546
1.247MetThr: 1.247 ± 0.435
0.935MetVal: 0.935 ± 0.486
0.312MetTrp: 0.312 ± 0.237
0.312MetTyr: 0.312 ± 0.237
0.0MetXaa: 0.0 ± 0.0
Asn
1.559AsnAla: 1.559 ± 0.685
0.623AsnCys: 0.623 ± 0.588
0.312AsnAsp: 0.312 ± 0.237
3.117AsnGlu: 3.117 ± 1.08
2.182AsnPhe: 2.182 ± 0.574
1.87AsnGly: 1.87 ± 0.911
1.559AsnHis: 1.559 ± 1.039
3.741AsnIle: 3.741 ± 0.769
2.494AsnLys: 2.494 ± 0.612
4.052AsnLeu: 4.052 ± 1.143
1.559AsnMet: 1.559 ± 0.421
3.429AsnAsn: 3.429 ± 0.909
1.87AsnPro: 1.87 ± 0.515
0.623AsnGln: 0.623 ± 0.474
2.494AsnArg: 2.494 ± 0.735
3.117AsnSer: 3.117 ± 1.221
3.117AsnThr: 3.117 ± 0.682
1.559AsnVal: 1.559 ± 0.699
1.247AsnTrp: 1.247 ± 0.519
1.559AsnTyr: 1.559 ± 0.657
0.0AsnXaa: 0.0 ± 0.0
Pro
2.182ProAla: 2.182 ± 0.744
2.182ProCys: 2.182 ± 0.616
1.87ProAsp: 1.87 ± 0.568
4.052ProGlu: 4.052 ± 1.367
0.623ProPhe: 0.623 ± 0.285
3.741ProGly: 3.741 ± 0.772
0.623ProHis: 0.623 ± 0.259
2.182ProIle: 2.182 ± 1.034
2.494ProLys: 2.494 ± 0.748
4.052ProLeu: 4.052 ± 1.439
0.623ProMet: 0.623 ± 0.306
0.623ProAsn: 0.623 ± 0.709
4.052ProPro: 4.052 ± 1.421
3.117ProGln: 3.117 ± 0.614
2.805ProArg: 2.805 ± 1.032
1.87ProSer: 1.87 ± 0.619
3.429ProThr: 3.429 ± 1.496
3.429ProVal: 3.429 ± 1.317
0.935ProTrp: 0.935 ± 0.536
1.247ProTyr: 1.247 ± 0.519
0.0ProXaa: 0.0 ± 0.0
Gln
6.234GlnAla: 6.234 ± 2.142
0.623GlnCys: 0.623 ± 0.486
0.623GlnAsp: 0.623 ± 0.259
4.676GlnGlu: 4.676 ± 0.603
1.247GlnPhe: 1.247 ± 0.552
6.234GlnGly: 6.234 ± 1.618
1.87GlnHis: 1.87 ± 1.09
3.429GlnIle: 3.429 ± 0.925
5.611GlnLys: 5.611 ± 1.136
3.117GlnLeu: 3.117 ± 1.248
1.247GlnMet: 1.247 ± 0.445
1.559GlnAsn: 1.559 ± 0.852
0.935GlnPro: 0.935 ± 0.442
6.546GlnGln: 6.546 ± 1.253
4.364GlnArg: 4.364 ± 2.211
2.805GlnSer: 2.805 ± 0.442
2.494GlnThr: 2.494 ± 0.752
2.494GlnVal: 2.494 ± 0.566
3.117GlnTrp: 3.117 ± 1.035
2.182GlnTyr: 2.182 ± 0.655
0.0GlnXaa: 0.0 ± 0.0
Arg
3.741ArgAla: 3.741 ± 1.099
1.87ArgCys: 1.87 ± 0.861
1.559ArgAsp: 1.559 ± 0.84
4.676ArgGlu: 4.676 ± 1.533
1.559ArgPhe: 1.559 ± 0.93
5.299ArgGly: 5.299 ± 1.622
0.935ArgHis: 0.935 ± 0.701
4.364ArgIle: 4.364 ± 2.045
4.052ArgLys: 4.052 ± 0.808
3.429ArgLeu: 3.429 ± 1.457
1.559ArgMet: 1.559 ± 0.695
3.429ArgAsn: 3.429 ± 1.186
1.87ArgPro: 1.87 ± 0.952
2.494ArgGln: 2.494 ± 0.865
5.299ArgArg: 5.299 ± 3.133
2.494ArgSer: 2.494 ± 0.869
3.429ArgThr: 3.429 ± 0.976
2.805ArgVal: 2.805 ± 0.688
1.247ArgTrp: 1.247 ± 0.377
1.559ArgTyr: 1.559 ± 1.338
0.0ArgXaa: 0.0 ± 0.0
Ser
4.052SerAla: 4.052 ± 0.872
0.935SerCys: 0.935 ± 0.662
3.741SerAsp: 3.741 ± 0.773
2.494SerGlu: 2.494 ± 1.007
1.247SerPhe: 1.247 ± 0.817
3.117SerGly: 3.117 ± 0.794
1.559SerHis: 1.559 ± 0.54
2.805SerIle: 2.805 ± 1.01
4.052SerLys: 4.052 ± 0.832
5.923SerLeu: 5.923 ± 1.613
0.623SerMet: 0.623 ± 0.513
1.559SerAsn: 1.559 ± 0.77
3.117SerPro: 3.117 ± 1.229
4.052SerGln: 4.052 ± 0.727
2.182SerArg: 2.182 ± 0.602
5.923SerSer: 5.923 ± 1.145
2.182SerThr: 2.182 ± 0.558
2.494SerVal: 2.494 ± 0.852
1.247SerTrp: 1.247 ± 0.755
3.117SerTyr: 3.117 ± 1.323
0.0SerXaa: 0.0 ± 0.0
Thr
3.429ThrAla: 3.429 ± 0.462
1.559ThrCys: 1.559 ± 0.666
3.117ThrAsp: 3.117 ± 0.975
4.676ThrGlu: 4.676 ± 1.135
1.559ThrPhe: 1.559 ± 0.99
4.676ThrGly: 4.676 ± 0.566
1.559ThrHis: 1.559 ± 0.466
3.429ThrIle: 3.429 ± 0.612
2.805ThrLys: 2.805 ± 1.168
5.611ThrLeu: 5.611 ± 1.776
1.559ThrMet: 1.559 ± 0.653
2.805ThrAsn: 2.805 ± 1.223
4.052ThrPro: 4.052 ± 1.331
3.117ThrGln: 3.117 ± 0.85
3.117ThrArg: 3.117 ± 1.326
2.494ThrSer: 2.494 ± 0.994
2.182ThrThr: 2.182 ± 1.15
3.429ThrVal: 3.429 ± 1.133
2.182ThrTrp: 2.182 ± 0.963
0.312ThrTyr: 0.312 ± 0.294
0.0ThrXaa: 0.0 ± 0.0
Val
2.182ValAla: 2.182 ± 0.729
1.247ValCys: 1.247 ± 0.616
2.182ValAsp: 2.182 ± 0.555
4.988ValGlu: 4.988 ± 0.918
0.623ValPhe: 0.623 ± 0.586
2.182ValGly: 2.182 ± 0.45
0.935ValHis: 0.935 ± 0.673
2.182ValIle: 2.182 ± 0.95
4.988ValLys: 4.988 ± 1.743
5.611ValLeu: 5.611 ± 0.925
0.312ValMet: 0.312 ± 0.237
2.805ValAsn: 2.805 ± 0.926
4.676ValPro: 4.676 ± 1.246
4.364ValGln: 4.364 ± 0.932
2.805ValArg: 2.805 ± 0.99
4.364ValSer: 4.364 ± 1.498
2.494ValThr: 2.494 ± 1.413
2.182ValVal: 2.182 ± 0.667
1.247ValTrp: 1.247 ± 0.679
1.247ValTyr: 1.247 ± 0.519
0.0ValXaa: 0.0 ± 0.0
Trp
1.559TrpAla: 1.559 ± 0.664
0.312TrpCys: 0.312 ± 0.386
0.935TrpAsp: 0.935 ± 0.42
2.494TrpGlu: 2.494 ± 1.252
0.312TrpPhe: 0.312 ± 0.257
2.805TrpGly: 2.805 ± 0.754
0.935TrpHis: 0.935 ± 0.572
1.87TrpIle: 1.87 ± 1.306
1.87TrpLys: 1.87 ± 1.202
1.559TrpLeu: 1.559 ± 0.584
0.623TrpMet: 0.623 ± 0.274
2.182TrpAsn: 2.182 ± 1.083
0.935TrpPro: 0.935 ± 0.711
2.494TrpGln: 2.494 ± 0.821
0.312TrpArg: 0.312 ± 0.237
1.559TrpSer: 1.559 ± 0.646
1.559TrpThr: 1.559 ± 0.457
1.87TrpVal: 1.87 ± 0.691
0.935TrpTrp: 0.935 ± 0.426
0.623TrpTyr: 0.623 ± 0.259
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.247TyrAla: 1.247 ± 0.685
0.935TyrCys: 0.935 ± 0.546
0.312TyrAsp: 0.312 ± 0.257
1.559TyrGlu: 1.559 ± 0.792
0.623TyrPhe: 0.623 ± 0.445
0.935TyrGly: 0.935 ± 0.458
1.559TyrHis: 1.559 ± 0.839
2.494TyrIle: 2.494 ± 0.858
2.494TyrLys: 2.494 ± 1.004
2.805TyrLeu: 2.805 ± 0.523
0.623TyrMet: 0.623 ± 0.259
1.87TyrAsn: 1.87 ± 0.534
1.559TyrPro: 1.559 ± 0.742
0.312TyrGln: 0.312 ± 0.237
1.247TyrArg: 1.247 ± 0.648
1.247TyrSer: 1.247 ± 0.817
2.805TyrThr: 2.805 ± 0.715
1.87TyrVal: 1.87 ± 0.555
1.247TyrTrp: 1.247 ± 0.958
1.247TyrTyr: 1.247 ± 0.519
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 9 proteins (3209 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski