Amino acid dipepetide frequency for Oak-Vale virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.676AlaAla: 2.676 ± 0.746
1.606AlaCys: 1.606 ± 0.567
2.676AlaAsp: 2.676 ± 0.74
1.873AlaGlu: 1.873 ± 0.711
1.07AlaPhe: 1.07 ± 0.415
3.211AlaGly: 3.211 ± 1.093
0.535AlaHis: 0.535 ± 0.302
4.282AlaIle: 4.282 ± 1.632
2.676AlaLys: 2.676 ± 0.981
4.817AlaLeu: 4.817 ± 1.799
0.803AlaMet: 0.803 ± 0.703
2.944AlaAsn: 2.944 ± 1.056
2.944AlaPro: 2.944 ± 0.765
2.141AlaGln: 2.141 ± 0.607
2.944AlaArg: 2.944 ± 0.993
3.211AlaSer: 3.211 ± 1.976
2.408AlaThr: 2.408 ± 1.218
2.676AlaVal: 2.676 ± 1.336
0.535AlaTrp: 0.535 ± 0.481
1.338AlaTyr: 1.338 ± 0.628
0.0AlaXaa: 0.0 ± 0.0
Cys
0.803CysAla: 0.803 ± 0.58
0.268CysCys: 0.268 ± 0.151
0.268CysAsp: 0.268 ± 0.151
1.07CysGlu: 1.07 ± 0.387
0.803CysPhe: 0.803 ± 0.315
1.338CysGly: 1.338 ± 0.602
1.338CysHis: 1.338 ± 0.628
1.873CysIle: 1.873 ± 0.691
1.338CysLys: 1.338 ± 0.497
2.141CysLeu: 2.141 ± 1.207
0.268CysMet: 0.268 ± 0.151
0.535CysAsn: 0.535 ± 0.302
0.803CysPro: 0.803 ± 1.241
1.338CysGln: 1.338 ± 0.651
1.873CysArg: 1.873 ± 0.719
1.873CysSer: 1.873 ± 0.698
0.535CysThr: 0.535 ± 0.281
1.07CysVal: 1.07 ± 0.387
0.803CysTrp: 0.803 ± 0.321
0.268CysTyr: 0.268 ± 0.429
0.0CysXaa: 0.0 ± 0.0
Asp
2.944AspAla: 2.944 ± 0.695
0.803AspCys: 0.803 ± 0.452
2.676AspAsp: 2.676 ± 0.442
2.944AspGlu: 2.944 ± 1.381
3.746AspPhe: 3.746 ± 0.649
2.676AspGly: 2.676 ± 1.072
1.606AspHis: 1.606 ± 0.791
3.211AspIle: 3.211 ± 1.001
2.408AspLys: 2.408 ± 1.112
4.817AspLeu: 4.817 ± 1.367
0.803AspMet: 0.803 ± 0.81
1.606AspAsn: 1.606 ± 0.624
3.211AspPro: 3.211 ± 0.802
1.07AspGln: 1.07 ± 0.976
2.676AspArg: 2.676 ± 1.676
3.211AspSer: 3.211 ± 1.076
2.141AspThr: 2.141 ± 0.517
4.817AspVal: 4.817 ± 1.598
0.535AspTrp: 0.535 ± 0.306
2.408AspTyr: 2.408 ± 0.439
0.0AspXaa: 0.0 ± 0.0
Glu
3.211GluAla: 3.211 ± 1.451
1.07GluCys: 1.07 ± 0.34
4.014GluAsp: 4.014 ± 1.026
5.887GluGlu: 5.887 ± 1.159
2.141GluPhe: 2.141 ± 0.603
4.282GluGly: 4.282 ± 0.961
1.07GluHis: 1.07 ± 0.364
6.422GluIle: 6.422 ± 1.341
4.817GluLys: 4.817 ± 1.088
3.746GluLeu: 3.746 ± 0.703
1.338GluMet: 1.338 ± 0.557
2.408GluAsn: 2.408 ± 0.878
1.606GluPro: 1.606 ± 0.475
1.873GluGln: 1.873 ± 0.698
5.084GluArg: 5.084 ± 0.906
4.282GluSer: 4.282 ± 0.964
3.211GluThr: 3.211 ± 0.697
3.211GluVal: 3.211 ± 0.811
1.873GluTrp: 1.873 ± 1.064
1.07GluTyr: 1.07 ± 0.645
0.0GluXaa: 0.0 ± 0.0
Phe
2.141PheAla: 2.141 ± 0.653
1.07PheCys: 1.07 ± 0.615
2.944PheAsp: 2.944 ± 1.083
0.803PheGlu: 0.803 ± 0.452
1.873PhePhe: 1.873 ± 0.879
1.873PheGly: 1.873 ± 0.726
0.803PheHis: 0.803 ± 0.393
2.408PheIle: 2.408 ± 0.944
4.549PheLys: 4.549 ± 1.106
5.887PheLeu: 5.887 ± 1.263
0.803PheMet: 0.803 ± 0.393
2.944PheAsn: 2.944 ± 0.776
3.479PhePro: 3.479 ± 0.962
1.606PheGln: 1.606 ± 0.867
1.606PheArg: 1.606 ± 0.677
2.676PheSer: 2.676 ± 0.646
1.606PheThr: 1.606 ± 0.444
2.141PheVal: 2.141 ± 0.676
1.07PheTrp: 1.07 ± 0.73
1.07PheTyr: 1.07 ± 0.34
0.0PheXaa: 0.0 ± 0.0
Gly
1.873GlyAla: 1.873 ± 1.059
1.606GlyCys: 1.606 ± 0.697
4.014GlyAsp: 4.014 ± 0.596
3.211GlyGlu: 3.211 ± 1.017
3.479GlyPhe: 3.479 ± 0.766
6.69GlyGly: 6.69 ± 1.313
1.606GlyHis: 1.606 ± 0.418
6.422GlyIle: 6.422 ± 0.896
4.549GlyLys: 4.549 ± 0.834
7.76GlyLeu: 7.76 ± 0.995
2.676GlyMet: 2.676 ± 0.777
1.873GlyAsn: 1.873 ± 0.695
3.211GlyPro: 3.211 ± 1.074
3.479GlyGln: 3.479 ± 0.791
4.014GlyArg: 4.014 ± 1.141
4.817GlySer: 4.817 ± 0.974
2.944GlyThr: 2.944 ± 1.281
4.014GlyVal: 4.014 ± 1.076
1.338GlyTrp: 1.338 ± 0.441
1.873GlyTyr: 1.873 ± 0.69
0.0GlyXaa: 0.0 ± 0.0
His
0.803HisAla: 0.803 ± 0.922
0.268HisCys: 0.268 ± 0.365
1.338HisAsp: 1.338 ± 0.873
1.07HisGlu: 1.07 ± 0.387
0.803HisPhe: 0.803 ± 0.315
1.338HisGly: 1.338 ± 0.497
0.535HisHis: 0.535 ± 0.306
1.606HisIle: 1.606 ± 0.409
1.873HisLys: 1.873 ± 0.64
1.873HisLeu: 1.873 ± 0.704
0.803HisMet: 0.803 ± 0.452
0.803HisAsn: 0.803 ± 0.452
2.141HisPro: 2.141 ± 1.183
1.873HisGln: 1.873 ± 1.309
1.606HisArg: 1.606 ± 0.699
1.606HisSer: 1.606 ± 0.403
0.803HisThr: 0.803 ± 0.398
1.338HisVal: 1.338 ± 0.628
0.535HisTrp: 0.535 ± 0.306
0.268HisTyr: 0.268 ± 0.151
0.0HisXaa: 0.0 ± 0.0
Ile
3.746IleAla: 3.746 ± 1.08
1.873IleCys: 1.873 ± 0.8
2.408IleAsp: 2.408 ± 0.675
4.282IleGlu: 4.282 ± 1.226
3.746IlePhe: 3.746 ± 0.67
4.282IleGly: 4.282 ± 1.144
1.606IleHis: 1.606 ± 0.225
4.282IleIle: 4.282 ± 0.842
6.155IleLys: 6.155 ± 1.375
5.084IleLeu: 5.084 ± 0.91
2.141IleMet: 2.141 ± 0.689
4.282IleAsn: 4.282 ± 1.033
4.817IlePro: 4.817 ± 1.267
1.606IleGln: 1.606 ± 0.582
5.619IleArg: 5.619 ± 1.139
5.619IleSer: 5.619 ± 1.017
4.282IleThr: 4.282 ± 1.25
5.619IleVal: 5.619 ± 1.753
0.803IleTrp: 0.803 ± 0.452
3.746IleTyr: 3.746 ± 0.639
0.0IleXaa: 0.0 ± 0.0
Lys
1.606LysAla: 1.606 ± 0.629
0.803LysCys: 0.803 ± 0.315
3.746LysAsp: 3.746 ± 0.828
5.352LysGlu: 5.352 ± 1.644
2.676LysPhe: 2.676 ± 0.64
3.746LysGly: 3.746 ± 1.025
0.268LysHis: 0.268 ± 0.151
6.957LysIle: 6.957 ± 1.682
4.282LysLys: 4.282 ± 0.903
6.155LysLeu: 6.155 ± 1.71
1.338LysMet: 1.338 ± 0.536
2.408LysAsn: 2.408 ± 0.808
1.07LysPro: 1.07 ± 0.751
2.141LysGln: 2.141 ± 0.539
3.746LysArg: 3.746 ± 1.07
4.282LysSer: 4.282 ± 1.785
4.549LysThr: 4.549 ± 1.458
4.282LysVal: 4.282 ± 0.821
0.803LysTrp: 0.803 ± 0.489
1.606LysTyr: 1.606 ± 0.911
0.0LysXaa: 0.0 ± 0.0
Leu
6.422LeuAla: 6.422 ± 1.728
2.141LeuCys: 2.141 ± 0.603
4.282LeuAsp: 4.282 ± 1.273
6.422LeuGlu: 6.422 ± 1.402
3.746LeuPhe: 3.746 ± 0.721
6.155LeuGly: 6.155 ± 1.452
1.873LeuHis: 1.873 ± 0.901
6.957LeuIle: 6.957 ± 1.098
6.69LeuLys: 6.69 ± 1.853
9.098LeuLeu: 9.098 ± 2.571
2.676LeuMet: 2.676 ± 0.939
5.084LeuAsn: 5.084 ± 1.158
2.944LeuPro: 2.944 ± 1.347
2.944LeuGln: 2.944 ± 0.61
4.817LeuArg: 4.817 ± 1.159
9.366LeuSer: 9.366 ± 2.1
7.225LeuThr: 7.225 ± 2.443
5.619LeuVal: 5.619 ± 1.28
0.535LeuTrp: 0.535 ± 0.302
0.268LeuTyr: 0.268 ± 0.151
0.0LeuXaa: 0.0 ± 0.0
Met
1.873MetAla: 1.873 ± 0.69
0.268MetCys: 0.268 ± 0.151
2.141MetAsp: 2.141 ± 0.753
2.141MetGlu: 2.141 ± 0.316
2.141MetPhe: 2.141 ± 0.716
1.606MetGly: 1.606 ± 0.843
0.268MetHis: 0.268 ± 0.365
2.944MetIle: 2.944 ± 1.39
1.873MetLys: 1.873 ± 1.056
1.606MetLeu: 1.606 ± 1.197
0.803MetMet: 0.803 ± 0.552
1.07MetAsn: 1.07 ± 0.559
0.535MetPro: 0.535 ± 0.281
0.535MetGln: 0.535 ± 0.281
1.606MetArg: 1.606 ± 0.543
1.873MetSer: 1.873 ± 0.404
1.07MetThr: 1.07 ± 0.628
1.07MetVal: 1.07 ± 0.603
0.268MetTrp: 0.268 ± 0.365
0.535MetTyr: 0.535 ± 0.396
0.0MetXaa: 0.0 ± 0.0
Asn
1.07AsnAla: 1.07 ± 0.465
0.268AsnCys: 0.268 ± 0.151
1.07AsnAsp: 1.07 ± 0.415
0.535AsnGlu: 0.535 ± 0.306
2.676AsnPhe: 2.676 ± 0.726
3.746AsnGly: 3.746 ± 1.401
0.803AsnHis: 0.803 ± 0.452
2.944AsnIle: 2.944 ± 0.806
2.408AsnLys: 2.408 ± 0.664
5.084AsnLeu: 5.084 ± 0.669
1.07AsnMet: 1.07 ± 0.5
2.408AsnAsn: 2.408 ± 1.12
5.887AsnPro: 5.887 ± 1.333
1.606AsnGln: 1.606 ± 0.905
2.944AsnArg: 2.944 ± 1.141
3.211AsnSer: 3.211 ± 1.181
1.606AsnThr: 1.606 ± 0.629
3.479AsnVal: 3.479 ± 0.95
0.0AsnTrp: 0.0 ± 0.0
0.803AsnTyr: 0.803 ± 0.398
0.0AsnXaa: 0.0 ± 0.0
Pro
2.141ProAla: 2.141 ± 0.887
2.141ProCys: 2.141 ± 0.799
2.676ProAsp: 2.676 ± 0.569
4.014ProGlu: 4.014 ± 1.835
1.873ProPhe: 1.873 ± 0.743
4.282ProGly: 4.282 ± 2.417
0.535ProHis: 0.535 ± 0.302
2.676ProIle: 2.676 ± 0.945
2.408ProLys: 2.408 ± 0.827
5.084ProLeu: 5.084 ± 1.905
0.535ProMet: 0.535 ± 0.515
2.141ProAsn: 2.141 ± 0.656
5.352ProPro: 5.352 ± 5.206
1.338ProGln: 1.338 ± 0.871
1.338ProArg: 1.338 ± 1.014
5.084ProSer: 5.084 ± 1.6
4.549ProThr: 4.549 ± 1.188
2.676ProVal: 2.676 ± 1.077
0.803ProTrp: 0.803 ± 0.315
1.338ProTyr: 1.338 ± 0.584
0.0ProXaa: 0.0 ± 0.0
Gln
1.873GlnAla: 1.873 ± 0.695
0.268GlnCys: 0.268 ± 0.365
1.606GlnAsp: 1.606 ± 0.785
2.944GlnGlu: 2.944 ± 0.653
0.535GlnPhe: 0.535 ± 0.281
2.141GlnGly: 2.141 ± 0.316
0.268GlnHis: 0.268 ± 0.41
2.676GlnIle: 2.676 ± 0.894
1.07GlnLys: 1.07 ± 0.611
3.479GlnLeu: 3.479 ± 0.889
0.803GlnMet: 0.803 ± 0.568
2.944GlnAsn: 2.944 ± 1.015
0.268GlnPro: 0.268 ± 0.41
1.338GlnGln: 1.338 ± 1.027
1.873GlnArg: 1.873 ± 0.688
2.944GlnSer: 2.944 ± 1.037
2.944GlnThr: 2.944 ± 1.149
1.873GlnVal: 1.873 ± 2.443
0.535GlnTrp: 0.535 ± 0.302
0.268GlnTyr: 0.268 ± 0.151
0.0GlnXaa: 0.0 ± 0.0
Arg
1.873ArgAla: 1.873 ± 0.807
0.803ArgCys: 0.803 ± 0.467
2.676ArgAsp: 2.676 ± 0.651
5.084ArgGlu: 5.084 ± 1.112
1.338ArgPhe: 1.338 ± 0.511
5.084ArgGly: 5.084 ± 1.658
1.873ArgHis: 1.873 ± 1.324
2.944ArgIle: 2.944 ± 1.44
1.606ArgLys: 1.606 ± 0.629
4.014ArgLeu: 4.014 ± 0.848
1.873ArgMet: 1.873 ± 0.522
3.211ArgAsn: 3.211 ± 1.218
2.141ArgPro: 2.141 ± 0.817
1.873ArgGln: 1.873 ± 0.551
2.944ArgArg: 2.944 ± 1.349
4.817ArgSer: 4.817 ± 1.303
4.549ArgThr: 4.549 ± 1.309
4.549ArgVal: 4.549 ± 2.085
1.07ArgTrp: 1.07 ± 0.574
3.211ArgTyr: 3.211 ± 0.586
0.0ArgXaa: 0.0 ± 0.0
Ser
4.549SerAla: 4.549 ± 1.051
1.338SerCys: 1.338 ± 0.894
4.549SerAsp: 4.549 ± 1.29
5.352SerGlu: 5.352 ± 0.55
3.211SerPhe: 3.211 ± 0.727
6.957SerGly: 6.957 ± 1.463
2.944SerHis: 2.944 ± 0.734
5.084SerIle: 5.084 ± 1.031
4.014SerLys: 4.014 ± 1.16
10.169SerLeu: 10.169 ± 1.586
1.873SerMet: 1.873 ± 0.906
1.873SerAsn: 1.873 ± 0.404
3.479SerPro: 3.479 ± 2.185
1.338SerGln: 1.338 ± 0.608
4.549SerArg: 4.549 ± 0.915
5.352SerSer: 5.352 ± 1.636
3.746SerThr: 3.746 ± 0.694
2.141SerVal: 2.141 ± 1.003
1.338SerTrp: 1.338 ± 0.56
1.873SerTyr: 1.873 ± 1.182
0.0SerXaa: 0.0 ± 0.0
Thr
2.141ThrAla: 2.141 ± 0.967
0.268ThrCys: 0.268 ± 0.151
1.338ThrAsp: 1.338 ± 0.693
3.746ThrGlu: 3.746 ± 0.829
3.479ThrPhe: 3.479 ± 0.871
5.887ThrGly: 5.887 ± 1.055
2.676ThrHis: 2.676 ± 1.07
3.211ThrIle: 3.211 ± 0.827
2.408ThrLys: 2.408 ± 0.66
5.619ThrLeu: 5.619 ± 0.817
1.873ThrMet: 1.873 ± 0.759
1.873ThrAsn: 1.873 ± 0.726
4.817ThrPro: 4.817 ± 1.733
1.873ThrGln: 1.873 ± 0.485
2.676ThrArg: 2.676 ± 1.256
5.352ThrSer: 5.352 ± 1.269
4.014ThrThr: 4.014 ± 0.754
2.676ThrVal: 2.676 ± 1.084
0.535ThrTrp: 0.535 ± 0.302
1.606ThrTyr: 1.606 ± 0.418
0.0ThrXaa: 0.0 ± 0.0
Val
2.944ValAla: 2.944 ± 1.16
2.676ValCys: 2.676 ± 0.856
2.944ValAsp: 2.944 ± 1.154
3.211ValGlu: 3.211 ± 1.116
2.408ValPhe: 2.408 ± 1.18
2.944ValGly: 2.944 ± 0.813
1.873ValHis: 1.873 ± 0.985
5.352ValIle: 5.352 ± 1.253
3.479ValLys: 3.479 ± 0.994
4.549ValLeu: 4.549 ± 1.421
1.606ValMet: 1.606 ± 0.656
1.338ValAsn: 1.338 ± 0.602
2.676ValPro: 2.676 ± 0.526
2.408ValGln: 2.408 ± 0.835
2.676ValArg: 2.676 ± 1.154
4.549ValSer: 4.549 ± 1.445
4.282ValThr: 4.282 ± 0.85
1.873ValVal: 1.873 ± 0.522
0.535ValTrp: 0.535 ± 0.281
3.479ValTyr: 3.479 ± 1.007
0.0ValXaa: 0.0 ± 0.0
Trp
0.535TrpAla: 0.535 ± 0.488
0.268TrpCys: 0.268 ± 0.365
0.535TrpAsp: 0.535 ± 0.821
1.07TrpGlu: 1.07 ± 0.441
0.268TrpPhe: 0.268 ± 0.151
0.535TrpGly: 0.535 ± 0.302
0.268TrpHis: 0.268 ± 0.317
2.141TrpIle: 2.141 ± 1.092
1.338TrpLys: 1.338 ± 0.542
0.803TrpLeu: 0.803 ± 0.398
0.535TrpMet: 0.535 ± 0.306
1.07TrpAsn: 1.07 ± 0.73
0.268TrpPro: 0.268 ± 0.151
0.0TrpGln: 0.0 ± 0.0
1.338TrpArg: 1.338 ± 0.754
0.0TrpSer: 0.0 ± 0.0
1.07TrpThr: 1.07 ± 0.415
1.606TrpVal: 1.606 ± 0.951
0.0TrpTrp: 0.0 ± 0.0
0.535TrpTyr: 0.535 ± 0.281
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.141TyrAla: 2.141 ± 0.607
0.803TyrCys: 0.803 ± 0.321
2.676TyrAsp: 2.676 ± 0.442
1.606TyrGlu: 1.606 ± 0.629
1.338TyrPhe: 1.338 ± 0.497
2.676TyrGly: 2.676 ± 1.204
0.535TyrHis: 0.535 ± 0.306
1.606TyrIle: 1.606 ± 1.158
1.873TyrLys: 1.873 ± 0.577
3.479TyrLeu: 3.479 ± 0.426
1.338TyrMet: 1.338 ± 0.367
0.803TyrAsn: 0.803 ± 0.315
1.338TyrPro: 1.338 ± 0.497
0.268TyrGln: 0.268 ± 0.41
1.606TyrArg: 1.606 ± 0.867
1.606TyrSer: 1.606 ± 0.471
0.535TyrThr: 0.535 ± 0.281
1.07TyrVal: 1.07 ± 0.415
0.268TyrTrp: 0.268 ± 0.593
1.606TyrTyr: 1.606 ± 1.312
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 7 proteins (3738 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski