Amino acid dipepetide frequency for Xinzhou nematode virus 1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.197AlaAla: 3.197 ± 3.244
0.266AlaCys: 0.266 ± 0.136
3.996AlaAsp: 3.996 ± 0.899
2.397AlaGlu: 2.397 ± 0.871
3.463AlaPhe: 3.463 ± 0.265
1.332AlaGly: 1.332 ± 1.017
2.397AlaHis: 2.397 ± 0.898
2.93AlaIle: 2.93 ± 0.694
2.131AlaLys: 2.131 ± 0.644
6.393AlaLeu: 6.393 ± 1.268
1.066AlaMet: 1.066 ± 0.486
1.598AlaAsn: 1.598 ± 0.305
2.664AlaPro: 2.664 ± 1.12
1.066AlaGln: 1.066 ± 0.486
2.131AlaArg: 2.131 ± 0.837
3.729AlaSer: 3.729 ± 1.227
2.397AlaThr: 2.397 ± 1.436
3.197AlaVal: 3.197 ± 1.967
0.266AlaTrp: 0.266 ± 0.603
2.397AlaTyr: 2.397 ± 0.716
0.0AlaXaa: 0.0 ± 0.0
Cys
1.598CysAla: 1.598 ± 0.814
0.533CysCys: 0.533 ± 0.271
1.066CysAsp: 1.066 ± 0.543
0.799CysGlu: 0.799 ± 0.239
0.533CysPhe: 0.533 ± 0.271
0.533CysGly: 0.533 ± 0.271
0.266CysHis: 0.266 ± 0.136
0.266CysIle: 0.266 ± 0.136
1.066CysLys: 1.066 ± 0.234
1.066CysLeu: 1.066 ± 0.543
0.533CysMet: 0.533 ± 0.309
0.533CysAsn: 0.533 ± 0.828
1.598CysPro: 1.598 ± 0.402
0.533CysGln: 0.533 ± 0.271
0.799CysArg: 0.799 ± 0.407
0.799CysSer: 0.799 ± 0.239
0.533CysThr: 0.533 ± 0.271
1.332CysVal: 1.332 ± 0.3
0.266CysTrp: 0.266 ± 0.414
1.066CysTyr: 1.066 ± 0.234
0.0CysXaa: 0.0 ± 0.0
Asp
2.131AspAla: 2.131 ± 0.469
0.799AspCys: 0.799 ± 0.407
3.996AspAsp: 3.996 ± 1.163
2.397AspGlu: 2.397 ± 0.871
2.93AspPhe: 2.93 ± 1.457
2.397AspGly: 2.397 ± 0.871
2.131AspHis: 2.131 ± 0.278
6.393AspIle: 6.393 ± 0.866
2.397AspLys: 2.397 ± 1.221
7.459AspLeu: 7.459 ± 2.078
1.332AspMet: 1.332 ± 0.3
2.664AspAsn: 2.664 ± 0.618
3.197AspPro: 3.197 ± 0.861
0.533AspGln: 0.533 ± 0.309
4.262AspArg: 4.262 ± 0.557
6.66AspSer: 6.66 ± 0.81
3.463AspThr: 3.463 ± 1.304
6.127AspVal: 6.127 ± 3.12
0.266AspTrp: 0.266 ± 0.414
4.529AspTyr: 4.529 ± 1.392
0.0AspXaa: 0.0 ± 0.0
Glu
1.332GluAla: 1.332 ± 0.517
0.266GluCys: 0.266 ± 0.136
3.463GluAsp: 3.463 ± 1.242
2.131GluGlu: 2.131 ± 1.085
2.664GluPhe: 2.664 ± 0.685
2.131GluGly: 2.131 ± 1.085
0.266GluHis: 0.266 ± 0.136
5.061GluIle: 5.061 ± 1.619
3.729GluLys: 3.729 ± 0.439
3.463GluLeu: 3.463 ± 1.476
0.266GluMet: 0.266 ± 0.136
4.795GluAsn: 4.795 ± 0.536
1.332GluPro: 1.332 ± 0.678
1.598GluGln: 1.598 ± 0.647
1.332GluArg: 1.332 ± 0.678
3.463GluSer: 3.463 ± 1.301
2.397GluThr: 2.397 ± 0.353
1.332GluVal: 1.332 ± 0.678
0.266GluTrp: 0.266 ± 0.136
2.397GluTyr: 2.397 ± 0.242
0.0GluXaa: 0.0 ± 0.0
Phe
3.729PheAla: 3.729 ± 2.263
0.799PheCys: 0.799 ± 0.407
2.93PheAsp: 2.93 ± 1.951
3.463PheGlu: 3.463 ± 0.921
3.729PhePhe: 3.729 ± 2.544
2.93PheGly: 2.93 ± 0.491
0.799PheHis: 0.799 ± 0.407
3.729PheIle: 3.729 ± 1.728
5.061PheLys: 5.061 ± 2.649
8.524PheLeu: 8.524 ± 5.95
1.066PheMet: 1.066 ± 0.506
2.93PheAsn: 2.93 ± 1.43
1.598PhePro: 1.598 ± 0.402
1.598PheGln: 1.598 ± 0.477
2.131PheArg: 2.131 ± 0.469
7.192PheSer: 7.192 ± 1.688
2.664PheThr: 2.664 ± 1.071
3.463PheVal: 3.463 ± 0.735
0.533PheTrp: 0.533 ± 0.828
3.197PheTyr: 3.197 ± 1.519
0.0PheXaa: 0.0 ± 0.0
Gly
1.598GlyAla: 1.598 ± 0.578
1.066GlyCys: 1.066 ± 0.619
2.93GlyAsp: 2.93 ± 0.694
1.598GlyGlu: 1.598 ± 0.814
1.865GlyPhe: 1.865 ± 0.259
2.397GlyGly: 2.397 ± 0.994
0.266GlyHis: 0.266 ± 0.136
3.729GlyIle: 3.729 ± 1.142
3.996GlyLys: 3.996 ± 0.899
2.664GlyLeu: 2.664 ± 0.109
0.799GlyMet: 0.799 ± 0.239
1.332GlyAsn: 1.332 ± 0.3
1.332GlyPro: 1.332 ± 1.662
0.533GlyGln: 0.533 ± 0.534
1.598GlyArg: 1.598 ± 0.402
2.131GlySer: 2.131 ± 1.085
1.865GlyThr: 1.865 ± 0.663
1.332GlyVal: 1.332 ± 0.395
0.0GlyTrp: 0.0 ± 0.0
2.397GlyTyr: 2.397 ± 0.783
0.0GlyXaa: 0.0 ± 0.0
His
1.066HisAla: 1.066 ± 1.067
0.799HisCys: 0.799 ± 0.407
0.799HisAsp: 0.799 ± 0.239
1.066HisGlu: 1.066 ± 0.234
1.066HisPhe: 1.066 ± 0.234
0.799HisGly: 0.799 ± 0.718
1.332HisHis: 1.332 ± 1.543
1.865HisIle: 1.865 ± 1.012
1.598HisLys: 1.598 ± 0.984
3.197HisLeu: 3.197 ± 0.703
0.533HisMet: 0.533 ± 0.271
0.799HisAsn: 0.799 ± 0.239
1.066HisPro: 1.066 ± 0.619
1.332HisGln: 1.332 ± 0.3
0.533HisArg: 0.533 ± 0.271
2.397HisSer: 2.397 ± 0.521
1.066HisThr: 1.066 ± 0.543
1.865HisVal: 1.865 ± 0.453
0.0HisTrp: 0.0 ± 0.0
1.865HisTyr: 1.865 ± 0.663
0.0HisXaa: 0.0 ± 0.0
Ile
2.664IleAla: 2.664 ± 0.457
1.332IleCys: 1.332 ± 0.535
3.996IleAsp: 3.996 ± 1.163
2.93IleGlu: 2.93 ± 1.088
4.262IlePhe: 4.262 ± 2.474
1.865IleGly: 1.865 ± 0.519
2.397IleHis: 2.397 ± 0.871
6.127IleIle: 6.127 ± 0.843
3.463IleLys: 3.463 ± 0.75
7.459IleLeu: 7.459 ± 5.192
2.664IleMet: 2.664 ± 1.312
3.463IleAsn: 3.463 ± 0.75
3.463IlePro: 3.463 ± 0.75
3.197IleGln: 3.197 ± 0.611
2.93IleArg: 2.93 ± 0.694
4.529IleSer: 4.529 ± 0.981
3.996IleThr: 3.996 ± 2.494
2.93IleVal: 2.93 ± 0.491
0.266IleTrp: 0.266 ± 0.136
4.795IleTyr: 4.795 ± 0.798
0.0IleXaa: 0.0 ± 0.0
Lys
2.397LysAla: 2.397 ± 0.994
0.533LysCys: 0.533 ± 0.309
4.529LysAsp: 4.529 ± 0.621
3.463LysGlu: 3.463 ± 0.55
5.86LysPhe: 5.86 ± 0.697
1.066LysGly: 1.066 ± 0.234
1.066LysHis: 1.066 ± 0.234
5.86LysIle: 5.86 ± 1.391
2.93LysLys: 2.93 ± 1.035
4.795LysLeu: 4.795 ± 1.545
2.397LysMet: 2.397 ± 0.871
5.061LysAsn: 5.061 ± 2.107
2.93LysPro: 2.93 ± 2.772
2.93LysGln: 2.93 ± 0.574
2.664LysArg: 2.664 ± 0.457
4.529LysSer: 4.529 ± 1.113
5.061LysThr: 5.061 ± 1.114
2.131LysVal: 2.131 ± 1.085
0.266LysTrp: 0.266 ± 0.414
3.463LysTyr: 3.463 ± 0.921
0.0LysXaa: 0.0 ± 0.0
Leu
6.127LeuAla: 6.127 ± 1.265
0.533LeuCys: 0.533 ± 0.271
7.192LeuAsp: 7.192 ± 0.937
7.192LeuGlu: 7.192 ± 1.567
5.86LeuPhe: 5.86 ± 4.419
2.93LeuGly: 2.93 ± 0.741
2.397LeuHis: 2.397 ± 0.716
5.594LeuIle: 5.594 ± 2.171
8.524LeuLys: 8.524 ± 1.962
12.52LeuLeu: 12.52 ± 3.133
3.197LeuMet: 3.197 ± 1.23
5.594LeuAsn: 5.594 ± 0.651
3.729LeuPro: 3.729 ± 0.674
4.529LeuGln: 4.529 ± 1.48
3.996LeuArg: 3.996 ± 0.889
9.057LeuSer: 9.057 ± 0.877
6.127LeuThr: 6.127 ± 1.193
7.725LeuVal: 7.725 ± 1.014
0.799LeuTrp: 0.799 ± 0.407
5.328LeuTyr: 5.328 ± 1.253
0.0LeuXaa: 0.0 ± 0.0
Met
1.332MetAla: 1.332 ± 1.088
0.266MetCys: 0.266 ± 0.136
2.397MetAsp: 2.397 ± 0.773
0.266MetGlu: 0.266 ± 0.136
1.598MetPhe: 1.598 ± 0.477
1.332MetGly: 1.332 ± 0.535
0.799MetHis: 0.799 ± 0.492
2.131MetIle: 2.131 ± 0.278
1.332MetLys: 1.332 ± 0.517
3.197MetLeu: 3.197 ± 0.17
0.266MetMet: 0.266 ± 0.136
0.266MetAsn: 0.266 ± 0.414
1.332MetPro: 1.332 ± 0.678
0.533MetGln: 0.533 ± 0.309
1.066MetArg: 1.066 ± 0.543
3.197MetSer: 3.197 ± 0.17
2.131MetThr: 2.131 ± 0.469
1.865MetVal: 1.865 ± 0.663
0.533MetTrp: 0.533 ± 0.271
0.266MetTyr: 0.266 ± 0.136
0.0MetXaa: 0.0 ± 0.0
Asn
2.664AsnAla: 2.664 ± 1.356
0.533AsnCys: 0.533 ± 0.271
3.996AsnAsp: 3.996 ± 0.899
0.533AsnGlu: 0.533 ± 0.309
3.463AsnPhe: 3.463 ± 1.846
1.332AsnGly: 1.332 ± 0.678
0.799AsnHis: 0.799 ± 0.407
4.795AsnIle: 4.795 ± 1.496
2.93AsnLys: 2.93 ± 1.035
6.66AsnLeu: 6.66 ± 1.941
1.066AsnMet: 1.066 ± 0.619
2.131AsnAsn: 2.131 ± 0.377
1.332AsnPro: 1.332 ± 0.3
0.533AsnGln: 0.533 ± 0.271
1.865AsnArg: 1.865 ± 0.663
4.529AsnSer: 4.529 ± 0.413
4.262AsnThr: 4.262 ± 0.493
5.594AsnVal: 5.594 ± 0.095
0.0AsnTrp: 0.0 ± 0.0
3.197AsnTyr: 3.197 ± 0.804
0.0AsnXaa: 0.0 ± 0.0
Pro
1.598ProAla: 1.598 ± 0.993
0.799ProCys: 0.799 ± 0.407
3.197ProAsp: 3.197 ± 0.17
1.066ProGlu: 1.066 ± 0.543
2.93ProPhe: 2.93 ± 0.491
1.865ProGly: 1.865 ± 0.95
0.266ProHis: 0.266 ± 0.414
1.332ProIle: 1.332 ± 0.678
2.397ProLys: 2.397 ± 0.783
4.529ProLeu: 4.529 ± 1.11
1.332ProMet: 1.332 ± 0.259
0.799ProAsn: 0.799 ± 0.492
1.865ProPro: 1.865 ± 0.259
1.066ProGln: 1.066 ± 0.486
1.598ProArg: 1.598 ± 0.928
3.996ProSer: 3.996 ± 1.487
2.397ProThr: 2.397 ± 0.242
3.729ProVal: 3.729 ± 1.227
0.266ProTrp: 0.266 ± 0.136
1.865ProTyr: 1.865 ± 1.534
0.0ProXaa: 0.0 ± 0.0
Gln
1.598GlnAla: 1.598 ± 0.305
0.799GlnCys: 0.799 ± 0.239
2.397GlnAsp: 2.397 ± 0.353
1.598GlnGlu: 1.598 ± 0.814
0.799GlnPhe: 0.799 ± 0.239
0.799GlnGly: 0.799 ± 0.407
1.598GlnHis: 1.598 ± 0.477
1.598GlnIle: 1.598 ± 0.477
2.131GlnLys: 2.131 ± 0.644
3.996GlnLeu: 3.996 ± 1.378
0.266GlnMet: 0.266 ± 0.136
2.131GlnAsn: 2.131 ± 0.469
1.332GlnPro: 1.332 ± 0.395
1.598GlnGln: 1.598 ± 0.993
2.664GlnArg: 2.664 ± 0.599
1.332GlnSer: 1.332 ± 1.279
0.533GlnThr: 0.533 ± 0.271
1.598GlnVal: 1.598 ± 0.984
0.266GlnTrp: 0.266 ± 0.136
2.397GlnTyr: 2.397 ± 0.521
0.0GlnXaa: 0.0 ± 0.0
Arg
2.397ArgAla: 2.397 ± 0.353
0.533ArgCys: 0.533 ± 0.271
2.664ArgAsp: 2.664 ± 0.457
2.131ArgGlu: 2.131 ± 1.085
2.397ArgPhe: 2.397 ± 0.748
2.131ArgGly: 2.131 ± 1.012
1.332ArgHis: 1.332 ± 0.3
1.332ArgIle: 1.332 ± 0.678
2.397ArgLys: 2.397 ± 0.773
5.594ArgLeu: 5.594 ± 2.377
0.799ArgMet: 0.799 ± 0.407
3.463ArgAsn: 3.463 ± 0.55
0.799ArgPro: 0.799 ± 0.627
1.332ArgGln: 1.332 ± 0.3
2.131ArgArg: 2.131 ± 0.879
3.729ArgSer: 3.729 ± 0.906
3.197ArgThr: 3.197 ± 0.861
2.664ArgVal: 2.664 ± 0.903
0.266ArgTrp: 0.266 ± 0.603
1.598ArgTyr: 1.598 ± 0.402
0.0ArgXaa: 0.0 ± 0.0
Ser
3.996SerAla: 3.996 ± 1.487
1.066SerCys: 1.066 ± 0.234
4.262SerAsp: 4.262 ± 1.538
3.996SerGlu: 3.996 ± 0.52
5.594SerPhe: 5.594 ± 0.579
2.664SerGly: 2.664 ± 0.457
3.463SerHis: 3.463 ± 0.921
5.594SerIle: 5.594 ± 0.788
6.66SerLys: 6.66 ± 1.721
10.655SerLeu: 10.655 ± 2.029
2.397SerMet: 2.397 ± 0.651
3.729SerAsn: 3.729 ± 1.899
2.93SerPro: 2.93 ± 0.681
3.197SerGln: 3.197 ± 0.703
3.463SerArg: 3.463 ± 1.301
7.192SerSer: 7.192 ± 1.611
5.594SerThr: 5.594 ± 1.384
4.529SerVal: 4.529 ± 0.413
0.266SerTrp: 0.266 ± 0.136
4.529SerTyr: 4.529 ± 0.438
0.0SerXaa: 0.0 ± 0.0
Thr
3.463ThrAla: 3.463 ± 0.795
1.598ThrCys: 1.598 ± 0.402
1.865ThrAsp: 1.865 ± 0.95
2.397ThrGlu: 2.397 ± 0.871
3.463ThrPhe: 3.463 ± 0.919
2.397ThrGly: 2.397 ± 1.586
1.865ThrHis: 1.865 ± 0.453
4.529ThrIle: 4.529 ± 1.392
3.996ThrLys: 3.996 ± 1.082
5.594ThrLeu: 5.594 ± 1.67
1.598ThrMet: 1.598 ± 0.477
3.463ThrAsn: 3.463 ± 1.301
1.332ThrPro: 1.332 ± 1.017
1.865ThrGln: 1.865 ± 0.259
3.729ThrArg: 3.729 ± 0.816
6.127ThrSer: 6.127 ± 2.268
6.127ThrThr: 6.127 ± 4.142
1.598ThrVal: 1.598 ± 0.402
0.799ThrTrp: 0.799 ± 0.239
3.729ThrTyr: 3.729 ± 0.518
0.0ThrXaa: 0.0 ± 0.0
Val
3.729ValAla: 3.729 ± 1.142
1.598ValCys: 1.598 ± 0.402
5.328ValAsp: 5.328 ± 1.438
1.865ValGlu: 1.865 ± 0.95
5.061ValPhe: 5.061 ± 2.792
2.397ValGly: 2.397 ± 1.476
0.533ValHis: 0.533 ± 0.309
2.397ValIle: 2.397 ± 0.994
3.996ValLys: 3.996 ± 0.928
5.328ValLeu: 5.328 ± 0.795
1.066ValMet: 1.066 ± 1.191
2.93ValAsn: 2.93 ± 1.492
3.197ValPro: 3.197 ± 0.611
1.598ValGln: 1.598 ± 0.814
1.865ValArg: 1.865 ± 0.663
6.926ValSer: 6.926 ± 3.053
3.197ValThr: 3.197 ± 1.168
5.061ValVal: 5.061 ± 1.676
0.266ValTrp: 0.266 ± 0.136
2.664ValTyr: 2.664 ± 0.685
0.0ValXaa: 0.0 ± 0.0
Trp
0.533TrpAla: 0.533 ± 0.534
0.266TrpCys: 0.266 ± 0.136
0.0TrpAsp: 0.0 ± 0.0
0.0TrpGlu: 0.0 ± 0.0
0.799TrpPhe: 0.799 ± 0.239
0.266TrpGly: 0.266 ± 0.136
0.0TrpHis: 0.0 ± 0.0
0.266TrpIle: 0.266 ± 0.136
0.533TrpLys: 0.533 ± 0.534
0.799TrpLeu: 0.799 ± 0.239
0.266TrpMet: 0.266 ± 0.136
0.533TrpAsn: 0.533 ± 0.309
0.0TrpPro: 0.0 ± 0.0
0.266TrpGln: 0.266 ± 0.414
0.266TrpArg: 0.266 ± 0.414
0.799TrpSer: 0.799 ± 0.718
0.266TrpThr: 0.266 ± 0.136
0.266TrpVal: 0.266 ± 0.136
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.865TyrAla: 1.865 ± 0.663
1.332TyrCys: 1.332 ± 0.3
5.061TyrAsp: 5.061 ± 2.107
2.397TyrGlu: 2.397 ± 0.748
3.729TyrPhe: 3.729 ± 2.667
1.865TyrGly: 1.865 ± 0.512
1.066TyrHis: 1.066 ± 0.506
3.197TyrIle: 3.197 ± 1.295
2.664TyrLys: 2.664 ± 0.109
5.061TyrLeu: 5.061 ± 1.193
2.93TyrMet: 2.93 ± 0.491
3.996TyrAsn: 3.996 ± 1.186
1.865TyrPro: 1.865 ± 0.519
1.598TyrGln: 1.598 ± 0.477
1.865TyrArg: 1.865 ± 0.95
3.729TyrSer: 3.729 ± 0.956
4.262TyrThr: 4.262 ± 0.3
2.664TyrVal: 2.664 ± 0.685
0.533TyrTrp: 0.533 ± 0.309
3.729TyrTyr: 3.729 ± 0.2
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (3755 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski