Amino acid dipepetide frequency for Caprine arthritis encephalitis virus Ov496

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.437AlaAla: 5.437 ± 0.976
1.019AlaCys: 1.019 ± 0.675
0.68AlaAsp: 0.68 ± 0.239
4.417AlaGlu: 4.417 ± 0.836
1.019AlaPhe: 1.019 ± 0.591
4.077AlaGly: 4.077 ± 0.64
1.019AlaHis: 1.019 ± 0.526
3.738AlaIle: 3.738 ± 1.036
4.417AlaLys: 4.417 ± 1.233
4.417AlaLeu: 4.417 ± 0.593
2.379AlaMet: 2.379 ± 1.207
2.039AlaAsn: 2.039 ± 0.535
2.379AlaPro: 2.379 ± 0.517
5.097AlaGln: 5.097 ± 0.783
4.077AlaArg: 4.077 ± 1.169
1.359AlaSer: 1.359 ± 0.736
2.718AlaThr: 2.718 ± 0.86
2.379AlaVal: 2.379 ± 0.62
2.718AlaTrp: 2.718 ± 1.525
2.039AlaTyr: 2.039 ± 0.535
0.0AlaXaa: 0.0 ± 0.0
Cys
0.68CysAla: 0.68 ± 0.531
0.34CysCys: 0.34 ± 0.36
0.0CysAsp: 0.0 ± 0.0
1.019CysGlu: 1.019 ± 0.452
0.34CysPhe: 0.34 ± 0.242
1.699CysGly: 1.699 ± 0.752
0.34CysHis: 0.34 ± 0.36
1.019CysIle: 1.019 ± 0.274
1.359CysLys: 1.359 ± 0.432
1.359CysLeu: 1.359 ± 1.018
0.68CysMet: 0.68 ± 0.239
1.359CysAsn: 1.359 ± 0.743
0.34CysPro: 0.34 ± 0.242
2.039CysGln: 2.039 ± 0.87
2.718CysArg: 2.718 ± 1.434
2.039CysSer: 2.039 ± 1.321
1.359CysThr: 1.359 ± 1.09
2.039CysVal: 2.039 ± 1.375
1.019CysTrp: 1.019 ± 0.652
1.019CysTyr: 1.019 ± 0.526
0.0CysXaa: 0.0 ± 0.0
Asp
2.379AspAla: 2.379 ± 1.243
1.699AspCys: 1.699 ± 0.802
0.68AspAsp: 0.68 ± 0.239
1.699AspGlu: 1.699 ± 0.683
2.039AspPhe: 2.039 ± 0.987
2.039AspGly: 2.039 ± 0.535
0.34AspHis: 0.34 ± 0.242
3.398AspIle: 3.398 ± 1.336
2.039AspLys: 2.039 ± 1.029
1.699AspLeu: 1.699 ± 0.578
2.039AspMet: 2.039 ± 1.041
1.699AspAsn: 1.699 ± 0.588
1.019AspPro: 1.019 ± 0.427
0.68AspGln: 0.68 ± 0.72
3.738AspArg: 3.738 ± 0.696
2.039AspSer: 2.039 ± 0.502
1.699AspThr: 1.699 ± 1.226
1.359AspVal: 1.359 ± 0.77
2.379AspTrp: 2.379 ± 0.653
2.039AspTyr: 2.039 ± 0.618
0.0AspXaa: 0.0 ± 0.0
Glu
3.398GluAla: 3.398 ± 0.574
0.68GluCys: 0.68 ± 0.239
4.417GluAsp: 4.417 ± 0.795
11.553GluGlu: 11.553 ± 2.235
1.359GluPhe: 1.359 ± 0.752
6.116GluGly: 6.116 ± 2.514
1.699GluHis: 1.699 ± 0.671
4.077GluIle: 4.077 ± 1.162
7.475GluLys: 7.475 ± 2.483
6.796GluLeu: 6.796 ± 1.55
1.359GluMet: 1.359 ± 0.836
3.058GluAsn: 3.058 ± 1.131
3.398GluPro: 3.398 ± 1.075
2.718GluGln: 2.718 ± 1.283
5.776GluArg: 5.776 ± 1.525
4.417GluSer: 4.417 ± 0.868
3.398GluThr: 3.398 ± 0.75
4.417GluVal: 4.417 ± 0.557
1.019GluTrp: 1.019 ± 0.427
2.379GluTyr: 2.379 ± 1.393
0.0GluXaa: 0.0 ± 0.0
Phe
2.039PheAla: 2.039 ± 0.535
1.019PheCys: 1.019 ± 0.396
1.019PheAsp: 1.019 ± 0.427
1.699PheGlu: 1.699 ± 0.605
0.0PhePhe: 0.0 ± 0.0
1.359PheGly: 1.359 ± 0.611
0.0PheHis: 0.0 ± 0.0
1.359PheIle: 1.359 ± 0.966
1.019PheLys: 1.019 ± 0.274
1.699PheLeu: 1.699 ± 0.903
0.68PheMet: 0.68 ± 0.368
1.019PheAsn: 1.019 ± 0.591
1.019PhePro: 1.019 ± 0.574
1.359PheGln: 1.359 ± 0.432
1.019PheArg: 1.019 ± 0.597
0.68PheSer: 0.68 ± 0.239
2.039PheThr: 2.039 ± 1.075
0.68PheVal: 0.68 ± 0.513
1.359PheTrp: 1.359 ± 0.756
0.0PheTyr: 0.0 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
4.417GlyAla: 4.417 ± 1.233
2.718GlyCys: 2.718 ± 1.538
2.039GlyAsp: 2.039 ± 0.501
5.776GlyGlu: 5.776 ± 0.886
3.398GlyPhe: 3.398 ± 0.403
5.097GlyGly: 5.097 ± 1.198
2.379GlyHis: 2.379 ± 0.596
8.835GlyIle: 8.835 ± 1.935
7.475GlyLys: 7.475 ± 0.878
4.757GlyLeu: 4.757 ± 1.526
2.379GlyMet: 2.379 ± 0.638
6.116GlyAsn: 6.116 ± 2.368
3.058GlyPro: 3.058 ± 1.545
2.718GlyGln: 2.718 ± 1.051
3.738GlyArg: 3.738 ± 1.513
3.398GlySer: 3.398 ± 0.921
3.738GlyThr: 3.738 ± 0.941
2.039GlyVal: 2.039 ± 0.578
1.019GlyTrp: 1.019 ± 0.514
1.359GlyTyr: 1.359 ± 0.611
0.0GlyXaa: 0.0 ± 0.0
His
0.34HisAla: 0.34 ± 0.242
0.0HisCys: 0.0 ± 0.0
1.019HisAsp: 1.019 ± 0.396
0.68HisGlu: 0.68 ± 0.483
0.34HisPhe: 0.34 ± 0.535
0.68HisGly: 0.68 ± 0.38
0.34HisHis: 0.34 ± 0.272
1.359HisIle: 1.359 ± 0.5
1.359HisLys: 1.359 ± 0.363
1.699HisLeu: 1.699 ± 0.367
2.039HisMet: 2.039 ± 0.678
1.019HisAsn: 1.019 ± 0.538
2.039HisPro: 2.039 ± 1.246
1.359HisGln: 1.359 ± 0.478
2.039HisArg: 2.039 ± 0.618
0.0HisSer: 0.0 ± 0.0
1.699HisThr: 1.699 ± 1.226
1.359HisVal: 1.359 ± 0.611
2.039HisTrp: 2.039 ± 0.501
1.359HisTyr: 1.359 ± 0.478
0.0HisXaa: 0.0 ± 0.0
Ile
3.058IleAla: 3.058 ± 1.428
1.699IleCys: 1.699 ± 0.802
3.398IleAsp: 3.398 ± 0.871
2.718IleGlu: 2.718 ± 1.199
1.019IlePhe: 1.019 ± 0.396
5.437IleGly: 5.437 ± 0.96
1.699IleHis: 1.699 ± 0.608
4.417IleIle: 4.417 ± 1.421
5.097IleLys: 5.097 ± 1.348
5.437IleLeu: 5.437 ± 1.489
2.718IleMet: 2.718 ± 1.017
4.077IleAsn: 4.077 ± 1.456
6.116IlePro: 6.116 ± 2.456
3.738IleGln: 3.738 ± 0.494
4.417IleArg: 4.417 ± 1.189
1.699IleSer: 1.699 ± 0.617
4.077IleThr: 4.077 ± 1.44
4.417IleVal: 4.417 ± 1.3
1.019IleTrp: 1.019 ± 0.649
2.039IleTyr: 2.039 ± 1.45
0.0IleXaa: 0.0 ± 0.0
Lys
4.077LysAla: 4.077 ± 0.67
0.68LysCys: 0.68 ± 0.368
4.417LysAsp: 4.417 ± 0.888
7.475LysGlu: 7.475 ± 1.305
3.058LysPhe: 3.058 ± 1.107
5.437LysGly: 5.437 ± 2.076
3.058LysHis: 3.058 ± 1.047
4.077LysIle: 4.077 ± 0.917
7.136LysLys: 7.136 ± 1.273
6.116LysLeu: 6.116 ± 1.563
2.039LysMet: 2.039 ± 1.242
3.398LysAsn: 3.398 ± 1.195
2.039LysPro: 2.039 ± 0.722
2.379LysGln: 2.379 ± 0.914
6.456LysArg: 6.456 ± 1.028
4.077LysSer: 4.077 ± 2.281
1.359LysThr: 1.359 ± 0.594
4.757LysVal: 4.757 ± 1.12
4.417LysTrp: 4.417 ± 1.151
3.398LysTyr: 3.398 ± 0.975
0.0LysXaa: 0.0 ± 0.0
Leu
5.097LeuAla: 5.097 ± 1.17
1.019LeuCys: 1.019 ± 0.773
3.058LeuAsp: 3.058 ± 0.998
6.456LeuGlu: 6.456 ± 1.244
0.68LeuPhe: 0.68 ± 0.483
6.116LeuGly: 6.116 ± 1.254
1.699LeuHis: 1.699 ± 0.746
2.718LeuIle: 2.718 ± 0.73
5.437LeuLys: 5.437 ± 1.141
8.495LeuLeu: 8.495 ± 1.294
2.039LeuMet: 2.039 ± 1.073
2.039LeuAsn: 2.039 ± 0.535
4.077LeuPro: 4.077 ± 1.21
6.796LeuGln: 6.796 ± 1.617
4.417LeuArg: 4.417 ± 0.947
1.699LeuSer: 1.699 ± 0.552
4.757LeuThr: 4.757 ± 0.977
4.077LeuVal: 4.077 ± 0.728
3.398LeuTrp: 3.398 ± 1.016
1.359LeuTyr: 1.359 ± 0.707
0.0LeuXaa: 0.0 ± 0.0
Met
2.039MetAla: 2.039 ± 0.927
0.34MetCys: 0.34 ± 0.36
2.379MetAsp: 2.379 ± 1.038
4.077MetGlu: 4.077 ± 1.246
0.68MetPhe: 0.68 ± 0.72
3.398MetGly: 3.398 ± 1.113
0.34MetHis: 0.34 ± 0.272
1.019MetIle: 1.019 ± 0.452
1.699MetLys: 1.699 ± 0.671
2.039MetLeu: 2.039 ± 0.738
1.019MetMet: 1.019 ± 0.538
1.019MetAsn: 1.019 ± 0.514
1.359MetPro: 1.359 ± 0.392
4.417MetGln: 4.417 ± 1.608
0.0MetArg: 0.0 ± 0.0
1.019MetSer: 1.019 ± 0.649
1.019MetThr: 1.019 ± 0.452
1.359MetVal: 1.359 ± 0.392
1.019MetTrp: 1.019 ± 0.773
0.34MetTyr: 0.34 ± 0.242
0.0MetXaa: 0.0 ± 0.0
Asn
2.379AsnAla: 2.379 ± 1.546
3.398AsnCys: 3.398 ± 1.835
0.0AsnAsp: 0.0 ± 0.0
3.058AsnGlu: 3.058 ± 1.271
0.34AsnPhe: 0.34 ± 0.272
3.738AsnGly: 3.738 ± 1.746
0.68AsnHis: 0.68 ± 0.239
4.077AsnIle: 4.077 ± 1.44
5.776AsnLys: 5.776 ± 1.786
3.738AsnLeu: 3.738 ± 1.01
1.359AsnMet: 1.359 ± 0.832
1.699AsnAsn: 1.699 ± 0.988
2.379AsnPro: 2.379 ± 0.875
1.699AsnGln: 1.699 ± 0.988
1.699AsnArg: 1.699 ± 0.804
2.718AsnSer: 2.718 ± 0.894
1.699AsnThr: 1.699 ± 0.617
1.699AsnVal: 1.699 ± 0.67
1.699AsnTrp: 1.699 ± 1.13
0.34AsnTyr: 0.34 ± 0.242
0.0AsnXaa: 0.0 ± 0.0
Pro
2.039ProAla: 2.039 ± 0.704
0.68ProCys: 0.68 ± 0.538
0.68ProAsp: 0.68 ± 0.239
4.077ProGlu: 4.077 ± 0.873
0.34ProPhe: 0.34 ± 0.272
3.058ProGly: 3.058 ± 1.066
1.699ProHis: 1.699 ± 0.608
3.738ProIle: 3.738 ± 1.0
2.039ProLys: 2.039 ± 1.523
4.757ProLeu: 4.757 ± 1.709
1.359ProMet: 1.359 ± 0.761
2.039ProAsn: 2.039 ± 0.778
3.738ProPro: 3.738 ± 1.374
4.077ProGln: 4.077 ± 1.08
1.699ProArg: 1.699 ± 0.438
2.379ProSer: 2.379 ± 0.493
2.039ProThr: 2.039 ± 1.075
1.699ProVal: 1.699 ± 0.584
2.718ProTrp: 2.718 ± 0.736
2.039ProTyr: 2.039 ± 0.535
0.0ProXaa: 0.0 ± 0.0
Gln
5.097GlnAla: 5.097 ± 3.458
1.019GlnCys: 1.019 ± 0.591
2.039GlnAsp: 2.039 ± 0.721
4.417GlnGlu: 4.417 ± 0.867
1.699GlnPhe: 1.699 ± 0.608
4.417GlnGly: 4.417 ± 0.606
1.699GlnHis: 1.699 ± 0.552
2.039GlnIle: 2.039 ± 0.548
6.116GlnLys: 6.116 ± 1.342
4.417GlnLeu: 4.417 ± 1.364
0.68GlnMet: 0.68 ± 0.368
1.019GlnAsn: 1.019 ± 0.427
1.699GlnPro: 1.699 ± 0.912
5.437GlnGln: 5.437 ± 1.179
3.058GlnArg: 3.058 ± 1.597
2.718GlnSer: 2.718 ± 1.235
2.379GlnThr: 2.379 ± 0.861
4.757GlnVal: 4.757 ± 0.928
3.058GlnTrp: 3.058 ± 0.461
2.379GlnTyr: 2.379 ± 1.136
0.0GlnXaa: 0.0 ± 0.0
Arg
6.116ArgAla: 6.116 ± 1.483
0.68ArgCys: 0.68 ± 0.368
3.058ArgAsp: 3.058 ± 1.738
5.776ArgGlu: 5.776 ± 2.138
0.68ArgPhe: 0.68 ± 0.545
6.796ArgGly: 6.796 ± 1.72
2.039ArgHis: 2.039 ± 1.25
5.097ArgIle: 5.097 ± 1.392
4.757ArgLys: 4.757 ± 1.044
1.699ArgLeu: 1.699 ± 1.242
0.68ArgMet: 0.68 ± 0.239
3.058ArgAsn: 3.058 ± 0.625
2.039ArgPro: 2.039 ± 1.006
4.077ArgGln: 4.077 ± 1.494
4.757ArgArg: 4.757 ± 1.716
1.359ArgSer: 1.359 ± 0.392
3.058ArgThr: 3.058 ± 0.825
4.417ArgVal: 4.417 ± 1.555
2.379ArgTrp: 2.379 ± 0.548
1.699ArgTyr: 1.699 ± 0.608
0.0ArgXaa: 0.0 ± 0.0
Ser
2.379SerAla: 2.379 ± 0.62
1.699SerCys: 1.699 ± 0.932
2.379SerAsp: 2.379 ± 0.493
2.718SerGlu: 2.718 ± 0.786
0.68SerPhe: 0.68 ± 0.513
4.417SerGly: 4.417 ± 1.064
0.34SerHis: 0.34 ± 0.272
3.058SerIle: 3.058 ± 0.982
1.699SerLys: 1.699 ± 0.432
3.058SerLeu: 3.058 ± 0.372
1.699SerMet: 1.699 ± 0.742
2.718SerAsn: 2.718 ± 0.581
2.379SerPro: 2.379 ± 1.313
1.019SerGln: 1.019 ± 0.396
1.359SerArg: 1.359 ± 0.554
2.039SerSer: 2.039 ± 0.87
2.039SerThr: 2.039 ± 1.003
2.039SerVal: 2.039 ± 1.051
1.359SerTrp: 1.359 ± 0.636
1.359SerTyr: 1.359 ± 0.693
0.0SerXaa: 0.0 ± 0.0
Thr
1.019ThrAla: 1.019 ± 0.396
2.039ThrCys: 2.039 ± 0.717
1.019ThrAsp: 1.019 ± 0.396
3.058ThrGlu: 3.058 ± 1.107
0.34ThrPhe: 0.34 ± 0.535
4.757ThrGly: 4.757 ± 0.967
0.68ThrHis: 0.68 ± 0.513
3.738ThrIle: 3.738 ± 0.725
3.398ThrLys: 3.398 ± 0.538
5.776ThrLeu: 5.776 ± 0.729
2.718ThrMet: 2.718 ± 0.8
3.058ThrAsn: 3.058 ± 0.72
1.359ThrPro: 1.359 ± 0.432
3.058ThrGln: 3.058 ± 1.107
4.417ThrArg: 4.417 ± 1.289
2.039ThrSer: 2.039 ± 0.496
2.718ThrThr: 2.718 ± 1.041
2.379ThrVal: 2.379 ± 0.96
3.398ThrTrp: 3.398 ± 0.719
1.699ThrTyr: 1.699 ± 0.971
0.0ThrXaa: 0.0 ± 0.0
Val
2.718ValAla: 2.718 ± 1.017
0.34ValCys: 0.34 ± 0.272
2.379ValAsp: 2.379 ± 0.985
3.738ValGlu: 3.738 ± 0.817
1.699ValPhe: 1.699 ± 0.367
3.058ValGly: 3.058 ± 0.598
0.68ValHis: 0.68 ± 0.483
5.097ValIle: 5.097 ± 0.712
5.097ValLys: 5.097 ± 0.407
3.058ValLeu: 3.058 ± 0.936
1.019ValMet: 1.019 ± 0.615
0.68ValAsn: 0.68 ± 0.513
2.379ValPro: 2.379 ± 0.517
2.379ValGln: 2.379 ± 0.679
3.738ValArg: 3.738 ± 0.87
2.718ValSer: 2.718 ± 1.077
4.417ValThr: 4.417 ± 1.621
2.379ValVal: 2.379 ± 1.243
1.699ValTrp: 1.699 ± 0.438
1.699ValTyr: 1.699 ± 0.58
0.0ValXaa: 0.0 ± 0.0
Trp
0.68TrpAla: 0.68 ± 0.513
1.359TrpCys: 1.359 ± 0.636
1.359TrpAsp: 1.359 ± 0.447
3.738TrpGlu: 3.738 ± 1.445
1.019TrpPhe: 1.019 ± 0.427
2.379TrpGly: 2.379 ± 0.937
1.019TrpHis: 1.019 ± 0.396
2.718TrpIle: 2.718 ± 0.728
4.757TrpLys: 4.757 ± 1.932
2.039TrpLeu: 2.039 ± 1.039
0.68TrpMet: 0.68 ± 0.538
1.359TrpAsn: 1.359 ± 0.478
1.359TrpPro: 1.359 ± 0.707
2.379TrpGln: 2.379 ± 0.775
3.738TrpArg: 3.738 ± 1.31
1.359TrpSer: 1.359 ± 0.5
3.738TrpThr: 3.738 ± 1.291
2.379TrpVal: 2.379 ± 0.387
0.34TrpTrp: 0.34 ± 0.535
0.68TrpTyr: 0.68 ± 0.531
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.039TyrAla: 2.039 ± 0.482
0.34TyrCys: 0.34 ± 0.272
0.68TyrAsp: 0.68 ± 0.239
1.359TyrGlu: 1.359 ± 0.621
0.34TyrPhe: 0.34 ± 0.242
2.379TyrGly: 2.379 ± 0.562
1.019TyrHis: 1.019 ± 0.817
2.718TyrIle: 2.718 ± 0.956
2.039TyrLys: 2.039 ± 0.855
2.379TyrLeu: 2.379 ± 0.317
0.68TyrMet: 0.68 ± 0.38
1.699TyrAsn: 1.699 ± 0.552
2.718TyrPro: 2.718 ± 1.106
2.718TyrGln: 2.718 ± 0.956
1.359TyrArg: 1.359 ± 0.478
0.68TyrSer: 0.68 ± 0.566
2.718TyrThr: 2.718 ± 0.736
0.34TyrVal: 0.34 ± 0.535
1.019TyrTrp: 1.019 ± 0.396
1.359TyrTyr: 1.359 ± 0.725
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (2944 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski