Amino acid dipepetide frequency for Veterinary Pathology Zurich virus 1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.321AlaAla: 1.321 ± 0.697
0.33AlaCys: 0.33 ± 0.162
2.313AlaAsp: 2.313 ± 0.231
3.304AlaGlu: 3.304 ± 0.255
1.321AlaPhe: 1.321 ± 0.806
1.982AlaGly: 1.982 ± 0.628
0.991AlaHis: 0.991 ± 0.486
3.304AlaIle: 3.304 ± 0.917
4.625AlaLys: 4.625 ± 1.972
3.634AlaLeu: 3.634 ± 2.091
0.661AlaMet: 0.661 ± 0.324
2.643AlaAsn: 2.643 ± 1.109
0.661AlaPro: 0.661 ± 0.349
1.321AlaGln: 1.321 ± 0.697
0.33AlaArg: 0.33 ± 0.162
2.973AlaSer: 2.973 ± 0.942
2.313AlaThr: 2.313 ± 0.508
1.652AlaVal: 1.652 ± 0.629
0.33AlaTrp: 0.33 ± 0.162
1.652AlaTyr: 1.652 ± 0.535
0.0AlaXaa: 0.0 ± 0.0
Cys
1.652CysAla: 1.652 ± 0.643
0.0CysCys: 0.0 ± 0.0
0.661CysAsp: 0.661 ± 0.596
0.991CysGlu: 0.991 ± 0.486
1.321CysPhe: 1.321 ± 0.648
0.33CysGly: 0.33 ± 0.475
0.0CysHis: 0.0 ± 0.0
0.991CysIle: 0.991 ± 0.389
3.304CysLys: 3.304 ± 1.259
2.313CysLeu: 2.313 ± 0.795
0.661CysMet: 0.661 ± 0.324
1.321CysAsn: 1.321 ± 0.277
0.33CysPro: 0.33 ± 0.475
0.991CysGln: 0.991 ± 0.389
0.661CysArg: 0.661 ± 0.951
2.973CysSer: 2.973 ± 1.454
0.661CysThr: 0.661 ± 0.596
1.321CysVal: 1.321 ± 0.358
0.661CysTrp: 0.661 ± 0.349
0.33CysTyr: 0.33 ± 0.162
0.0CysXaa: 0.0 ± 0.0
Asp
2.643AspAla: 2.643 ± 0.716
0.661AspCys: 0.661 ± 0.596
3.304AspAsp: 3.304 ± 0.961
1.982AspGlu: 1.982 ± 0.587
1.652AspPhe: 1.652 ± 0.775
1.982AspGly: 1.982 ± 0.092
0.991AspHis: 0.991 ± 0.486
4.955AspIle: 4.955 ± 1.245
2.973AspLys: 2.973 ± 1.032
5.286AspLeu: 5.286 ± 1.039
0.991AspMet: 0.991 ± 0.453
3.304AspAsn: 3.304 ± 0.255
2.313AspPro: 2.313 ± 0.986
3.304AspGln: 3.304 ± 0.529
1.652AspArg: 1.652 ± 0.128
3.964AspSer: 3.964 ± 0.622
1.982AspThr: 1.982 ± 0.658
2.973AspVal: 2.973 ± 1.41
0.991AspTrp: 0.991 ± 0.389
2.643AspTyr: 2.643 ± 0.95
0.0AspXaa: 0.0 ± 0.0
Glu
3.304GluAla: 3.304 ± 0.707
1.652GluCys: 1.652 ± 0.535
5.286GluAsp: 5.286 ± 0.904
4.295GluGlu: 4.295 ± 1.193
3.634GluPhe: 3.634 ± 0.627
3.964GluGly: 3.964 ± 1.256
0.991GluHis: 0.991 ± 0.435
7.268GluIle: 7.268 ± 2.083
8.92GluLys: 8.92 ± 1.643
9.58GluLeu: 9.58 ± 1.418
0.991GluMet: 0.991 ± 0.486
5.946GluAsn: 5.946 ± 0.633
2.313GluPro: 2.313 ± 0.729
1.652GluGln: 1.652 ± 0.459
3.304GluArg: 3.304 ± 1.187
4.955GluSer: 4.955 ± 0.21
3.964GluThr: 3.964 ± 0.622
3.964GluVal: 3.964 ± 0.604
0.661GluTrp: 0.661 ± 0.324
1.982GluTyr: 1.982 ± 0.777
0.0GluXaa: 0.0 ± 0.0
Phe
1.652PheAla: 1.652 ± 0.81
0.661PheCys: 0.661 ± 0.596
2.313PheAsp: 2.313 ± 0.231
2.643PheGlu: 2.643 ± 0.94
2.313PhePhe: 2.313 ± 0.711
1.652PheGly: 1.652 ± 0.535
1.321PheHis: 1.321 ± 0.781
3.634PheIle: 3.634 ± 1.959
3.304PheLys: 3.304 ± 0.333
5.286PheLeu: 5.286 ± 0.637
0.661PheMet: 0.661 ± 0.324
4.295PheAsn: 4.295 ± 1.712
1.321PhePro: 1.321 ± 0.648
3.634PheGln: 3.634 ± 0.46
0.991PheArg: 0.991 ± 0.781
1.982PheSer: 1.982 ± 0.628
5.616PheThr: 5.616 ± 0.718
2.643PheVal: 2.643 ± 0.462
0.661PheTrp: 0.661 ± 0.403
2.643PheTyr: 2.643 ± 0.388
0.0PheXaa: 0.0 ± 0.0
Gly
0.991GlyAla: 0.991 ± 0.866
0.991GlyCys: 0.991 ± 0.435
1.652GlyAsp: 1.652 ± 1.126
1.652GlyGlu: 1.652 ± 0.775
2.643GlyPhe: 2.643 ± 0.26
1.652GlyGly: 1.652 ± 0.629
0.661GlyHis: 0.661 ± 0.403
2.643GlyIle: 2.643 ± 0.26
4.295GlyLys: 4.295 ± 1.193
3.964GlyLeu: 3.964 ± 0.83
0.661GlyMet: 0.661 ± 0.349
1.321GlyAsn: 1.321 ± 0.358
1.982GlyPro: 1.982 ± 0.092
1.321GlyGln: 1.321 ± 0.648
1.321GlyArg: 1.321 ± 0.781
1.652GlySer: 1.652 ± 0.643
3.634GlyThr: 3.634 ± 1.261
1.321GlyVal: 1.321 ± 0.439
1.652GlyTrp: 1.652 ± 0.128
1.652GlyTyr: 1.652 ± 0.128
0.0GlyXaa: 0.0 ± 0.0
His
0.33HisAla: 0.33 ± 0.162
1.321HisCys: 1.321 ± 0.842
1.321HisAsp: 1.321 ± 0.842
0.661HisGlu: 0.661 ± 0.951
1.321HisPhe: 1.321 ± 0.358
1.321HisGly: 1.321 ± 0.842
0.33HisHis: 0.33 ± 0.475
0.0HisIle: 0.0 ± 0.0
1.321HisLys: 1.321 ± 0.697
3.304HisLeu: 3.304 ± 1.196
0.661HisMet: 0.661 ± 0.403
0.991HisAsn: 0.991 ± 0.435
0.991HisPro: 0.991 ± 0.486
0.0HisGln: 0.0 ± 0.0
0.0HisArg: 0.0 ± 0.0
2.313HisSer: 2.313 ± 1.472
0.33HisThr: 0.33 ± 0.475
0.991HisVal: 0.991 ± 0.486
0.0HisTrp: 0.0 ± 0.0
1.652HisTyr: 1.652 ± 1.264
0.0HisXaa: 0.0 ± 0.0
Ile
2.973IleAla: 2.973 ± 0.47
0.991IleCys: 0.991 ± 0.314
2.643IleAsp: 2.643 ± 0.554
6.938IleGlu: 6.938 ± 0.506
3.634IlePhe: 3.634 ± 1.111
2.313IleGly: 2.313 ± 0.353
1.321IleHis: 1.321 ± 0.697
5.946IleIle: 5.946 ± 0.836
7.929IleLys: 7.929 ± 2.178
9.58IleLeu: 9.58 ± 0.563
0.661IleMet: 0.661 ± 0.324
4.625IleAsn: 4.625 ± 1.262
2.313IlePro: 2.313 ± 1.134
2.643IleGln: 2.643 ± 1.296
4.295IleArg: 4.295 ± 1.193
4.295IleSer: 4.295 ± 0.262
4.295IleThr: 4.295 ± 0.675
4.295IleVal: 4.295 ± 1.236
0.991IleTrp: 0.991 ± 0.486
1.982IleTyr: 1.982 ± 0.658
0.0IleXaa: 0.0 ± 0.0
Lys
4.625LysAla: 4.625 ± 1.573
1.982LysCys: 1.982 ± 1.208
4.955LysAsp: 4.955 ± 0.874
9.58LysGlu: 9.58 ± 2.588
3.304LysPhe: 3.304 ± 0.917
3.964LysGly: 3.964 ± 0.622
2.313LysHis: 2.313 ± 0.508
5.946LysIle: 5.946 ± 1.628
8.92LysLys: 8.92 ± 0.753
14.536LysLeu: 14.536 ± 1.681
1.982LysMet: 1.982 ± 0.678
5.286LysAsn: 5.286 ± 1.226
1.652LysPro: 1.652 ± 0.459
3.634LysGln: 3.634 ± 1.261
3.304LysArg: 3.304 ± 1.196
7.268LysSer: 7.268 ± 1.5
6.277LysThr: 6.277 ± 0.567
2.643LysVal: 2.643 ± 0.796
1.982LysTrp: 1.982 ± 0.658
3.634LysTyr: 3.634 ± 0.72
0.0LysXaa: 0.0 ± 0.0
Leu
2.643LeuAla: 2.643 ± 0.26
3.964LeuCys: 3.964 ± 0.184
6.277LeuAsp: 6.277 ± 0.967
7.929LeuGlu: 7.929 ± 1.774
3.964LeuPhe: 3.964 ± 0.561
1.982LeuGly: 1.982 ± 0.658
1.982LeuHis: 1.982 ± 0.628
7.929LeuIle: 7.929 ± 0.674
13.214LeuLys: 13.214 ± 3.037
8.92LeuLeu: 8.92 ± 1.566
2.973LeuMet: 2.973 ± 0.47
6.938LeuAsn: 6.938 ± 1.109
2.643LeuPro: 2.643 ± 1.563
3.634LeuGln: 3.634 ± 0.46
6.277LeuArg: 6.277 ± 2.265
8.259LeuSer: 8.259 ± 0.676
7.929LeuThr: 7.929 ± 3.482
4.955LeuVal: 4.955 ± 2.115
0.991LeuTrp: 0.991 ± 0.389
1.982LeuTyr: 1.982 ± 0.587
0.0LeuXaa: 0.0 ± 0.0
Met
0.991MetAla: 0.991 ± 0.486
0.661MetCys: 0.661 ± 0.403
1.982MetAsp: 1.982 ± 0.587
1.321MetGlu: 1.321 ± 0.439
0.33MetPhe: 0.33 ± 0.162
0.991MetGly: 0.991 ± 0.389
0.33MetHis: 0.33 ± 0.475
0.661MetIle: 0.661 ± 0.403
2.313MetLys: 2.313 ± 0.729
1.321MetLeu: 1.321 ± 0.648
0.0MetMet: 0.0 ± 0.0
0.0MetAsn: 0.0 ± 0.0
0.991MetPro: 0.991 ± 0.314
0.33MetGln: 0.33 ± 0.162
0.33MetArg: 0.33 ± 0.162
1.982MetSer: 1.982 ± 0.092
3.304MetThr: 3.304 ± 0.608
1.321MetVal: 1.321 ± 0.358
0.33MetTrp: 0.33 ± 0.444
0.33MetTyr: 0.33 ± 0.444
0.0MetXaa: 0.0 ± 0.0
Asn
3.304AsnAla: 3.304 ± 2.118
2.313AsnCys: 2.313 ± 0.813
3.634AsnAsp: 3.634 ± 0.87
5.286AsnGlu: 5.286 ± 1.107
2.643AsnPhe: 2.643 ± 0.877
2.643AsnGly: 2.643 ± 0.26
0.661AsnHis: 0.661 ± 0.403
4.625AsnIle: 4.625 ± 0.968
5.616AsnLys: 5.616 ± 0.935
8.92AsnLeu: 8.92 ± 1.241
1.321AsnMet: 1.321 ± 0.358
5.946AsnAsn: 5.946 ± 0.276
1.321AsnPro: 1.321 ± 0.358
3.964AsnGln: 3.964 ± 1.741
1.652AsnArg: 1.652 ± 0.629
2.973AsnSer: 2.973 ± 1.166
2.973AsnThr: 2.973 ± 1.032
1.982AsnVal: 1.982 ± 1.053
0.661AsnTrp: 0.661 ± 0.349
2.973AsnTyr: 2.973 ± 1.306
0.0AsnXaa: 0.0 ± 0.0
Pro
1.321ProAla: 1.321 ± 0.648
0.0ProCys: 0.0 ± 0.0
2.643ProAsp: 2.643 ± 0.796
1.982ProGlu: 1.982 ± 0.587
3.304ProPhe: 3.304 ± 1.146
0.991ProGly: 0.991 ± 0.389
1.321ProHis: 1.321 ± 0.277
1.982ProIle: 1.982 ± 0.587
1.321ProLys: 1.321 ± 0.781
1.982ProLeu: 1.982 ± 0.092
0.33ProMet: 0.33 ± 0.162
1.321ProAsn: 1.321 ± 1.222
0.33ProPro: 0.33 ± 0.162
0.991ProGln: 0.991 ± 0.486
0.33ProArg: 0.33 ± 0.162
2.313ProSer: 2.313 ± 1.134
1.982ProThr: 1.982 ± 0.628
1.652ProVal: 1.652 ± 0.459
0.0ProTrp: 0.0 ± 0.0
2.313ProTyr: 2.313 ± 0.795
0.0ProXaa: 0.0 ± 0.0
Gln
1.652GlnAla: 1.652 ± 0.128
0.991GlnCys: 0.991 ± 0.486
0.991GlnAsp: 0.991 ± 0.314
2.973GlnGlu: 2.973 ± 0.69
0.661GlnPhe: 0.661 ± 0.349
1.652GlnGly: 1.652 ± 0.128
0.991GlnHis: 0.991 ± 0.389
2.973GlnIle: 2.973 ± 1.032
3.964GlnLys: 3.964 ± 1.032
4.295GlnLeu: 4.295 ± 1.161
1.321GlnMet: 1.321 ± 0.277
3.964GlnAsn: 3.964 ± 1.194
0.661GlnPro: 0.661 ± 0.349
0.991GlnGln: 0.991 ± 0.486
2.313GlnArg: 2.313 ± 0.986
3.964GlnSer: 3.964 ± 1.194
1.982GlnThr: 1.982 ± 0.092
2.973GlnVal: 2.973 ± 0.251
0.0GlnTrp: 0.0 ± 0.0
0.991GlnTyr: 0.991 ± 0.314
0.0GlnXaa: 0.0 ± 0.0
Arg
0.661ArgAla: 0.661 ± 0.403
0.991ArgCys: 0.991 ± 0.781
0.991ArgAsp: 0.991 ± 0.781
4.295ArgGlu: 4.295 ± 1.272
2.643ArgPhe: 2.643 ± 0.716
1.982ArgGly: 1.982 ± 0.092
0.661ArgHis: 0.661 ± 0.951
2.313ArgIle: 2.313 ± 0.729
4.295ArgLys: 4.295 ± 1.4
2.643ArgLeu: 2.643 ± 0.26
1.982ArgMet: 1.982 ± 0.628
4.295ArgAsn: 4.295 ± 2.241
0.0ArgPro: 0.0 ± 0.0
0.991ArgGln: 0.991 ± 0.314
0.991ArgArg: 0.991 ± 0.435
1.321ArgSer: 1.321 ± 0.648
2.313ArgThr: 2.313 ± 0.711
1.321ArgVal: 1.321 ± 0.439
0.991ArgTrp: 0.991 ± 0.866
0.661ArgTyr: 0.661 ± 0.324
0.0ArgXaa: 0.0 ± 0.0
Ser
1.652SerAla: 1.652 ± 1.126
1.982SerCys: 1.982 ± 0.092
3.304SerAsp: 3.304 ± 1.374
6.277SerGlu: 6.277 ± 1.333
4.295SerPhe: 4.295 ± 0.88
2.313SerGly: 2.313 ± 0.353
1.321SerHis: 1.321 ± 0.439
8.92SerIle: 8.92 ± 1.184
6.938SerLys: 6.938 ± 2.198
6.607SerLeu: 6.607 ± 1.589
0.991SerMet: 0.991 ± 0.314
3.634SerAsn: 3.634 ± 0.627
0.661SerPro: 0.661 ± 0.349
3.634SerGln: 3.634 ± 0.988
3.304SerArg: 3.304 ± 0.917
8.259SerSer: 8.259 ± 1.226
3.304SerThr: 3.304 ± 1.423
4.295SerVal: 4.295 ± 0.88
0.33SerTrp: 0.33 ± 0.162
2.643SerTyr: 2.643 ± 0.94
0.0SerXaa: 0.0 ± 0.0
Thr
1.982ThrAla: 1.982 ± 0.092
0.33ThrCys: 0.33 ± 0.162
1.321ThrAsp: 1.321 ± 0.358
5.616ThrGlu: 5.616 ± 0.935
3.634ThrPhe: 3.634 ± 1.443
1.982ThrGly: 1.982 ± 1.623
0.991ThrHis: 0.991 ± 0.982
5.946ThrIle: 5.946 ± 1.407
7.268ThrLys: 7.268 ± 1.688
3.634ThrLeu: 3.634 ± 0.72
1.652ThrMet: 1.652 ± 0.629
3.304ThrAsn: 3.304 ± 1.259
2.313ThrPro: 2.313 ± 0.353
3.634ThrGln: 3.634 ± 1.111
1.321ThrArg: 1.321 ± 0.439
6.277ThrSer: 6.277 ± 0.967
6.938ThrThr: 6.938 ± 1.555
3.634ThrVal: 3.634 ± 0.72
0.33ThrTrp: 0.33 ± 0.475
3.304ThrTyr: 3.304 ± 0.707
0.0ThrXaa: 0.0 ± 0.0
Val
2.643ValAla: 2.643 ± 1.685
0.661ValCys: 0.661 ± 0.324
2.313ValAsp: 2.313 ± 0.231
6.277ValGlu: 6.277 ± 1.705
2.973ValPhe: 2.973 ± 0.548
1.652ValGly: 1.652 ± 0.459
0.661ValHis: 0.661 ± 0.403
1.652ValIle: 1.652 ± 0.629
2.643ValLys: 2.643 ± 0.796
3.964ValLeu: 3.964 ± 1.194
0.661ValMet: 0.661 ± 0.403
3.304ValAsn: 3.304 ± 1.374
2.313ValPro: 2.313 ± 0.231
1.652ValGln: 1.652 ± 0.712
2.973ValArg: 2.973 ± 0.69
3.634ValSer: 3.634 ± 0.562
3.304ValThr: 3.304 ± 0.862
2.313ValVal: 2.313 ± 0.711
0.991ValTrp: 0.991 ± 0.486
1.321ValTyr: 1.321 ± 0.358
0.0ValXaa: 0.0 ± 0.0
Trp
0.661TrpAla: 0.661 ± 0.324
0.0TrpCys: 0.0 ± 0.0
0.991TrpAsp: 0.991 ± 0.314
2.313TrpGlu: 2.313 ± 0.231
0.661TrpPhe: 0.661 ± 0.403
0.661TrpGly: 0.661 ± 0.324
0.0TrpHis: 0.0 ± 0.0
0.33TrpIle: 0.33 ± 0.162
1.982TrpLys: 1.982 ± 1.732
1.321TrpLeu: 1.321 ± 0.358
0.33TrpMet: 0.33 ± 0.162
0.991TrpAsn: 0.991 ± 0.314
0.33TrpPro: 0.33 ± 0.162
0.33TrpGln: 0.33 ± 0.162
0.33TrpArg: 0.33 ± 0.475
0.661TrpSer: 0.661 ± 0.324
0.33TrpThr: 0.33 ± 0.162
0.991TrpVal: 0.991 ± 0.389
0.33TrpTrp: 0.33 ± 0.475
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.661TyrAla: 0.661 ± 0.324
0.991TyrCys: 0.991 ± 0.937
1.652TyrAsp: 1.652 ± 0.459
2.313TyrGlu: 2.313 ± 0.353
2.973TyrPhe: 2.973 ± 0.908
1.652TyrGly: 1.652 ± 0.643
1.321TyrHis: 1.321 ± 0.842
2.643TyrIle: 2.643 ± 0.879
2.643TyrLys: 2.643 ± 0.94
4.295TyrLeu: 4.295 ± 0.901
0.0TyrMet: 0.0 ± 0.369
1.982TyrAsn: 1.982 ± 0.484
2.973TyrPro: 2.973 ± 1.166
1.652TyrGln: 1.652 ± 0.81
0.661TyrArg: 0.661 ± 0.324
2.643TyrSer: 2.643 ± 0.462
1.982TyrThr: 1.982 ± 1.208
0.661TyrVal: 0.661 ± 0.324
0.661TyrTrp: 0.661 ± 0.324
1.652TyrTyr: 1.652 ± 0.629
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (3028 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski